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© Errors? Feedback? Email me! © 


With your help, I plan to keep improving this textbook. 


Please refer to the latest version and also 
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The first thing to understand is that mathematics is an art. 


— Paul Lockhart (2009). 


A mathematician, like a painter or a poet, is a maker of patterns. If his patterns 
are more permanent than theirs, it is because they are made with ideas. ... Beauty 
is the first test: there is no permanent place in the world for ugly mathematics. 


— G.H. Hardy (1940). 


Le savant n’étudie pas la nature parce que cela est utile; il ’étudie parce qu’il y 
prend plaisir et il y prend plaisir parce qu’elle est belle. Si la nature n’était pas 
belle, elle ne vaudrait pas la peine d’étre connue, la vie ne vaudrait pas la peine 
d’étre vécue. 


The scientist does not study nature because it is useful to do so. He studies it 
because he takes pleasure in it, and he takes pleasure in it because it is beautiful. 
If nature were not beautiful it would not be worth knowing, and life would not be 
worth living. 


— Henri Poincaré (1908, 1914t). 


whoever does not love and admire mathematics for its own internal splendours, 
knows nothing whatever about tt. 


— Michael Polanyi (1959). 
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About this Book 


This textbook is for H2 Maths students in Singapore (hence the occasional Singlish and 
TLAs?). It exactly follows the latest 9758 syllabus.’ (Of course, I hope that anyone else 
in the world will also find this useful!) 


e FREE! This book is free. But if you paid any money for it, I certainly hope your money 
is going to me! This book is free because 


1. It is a shameless advertising vehicle for my awesome tutoring services. 


2. The marginal cost of reproducing this book is zero and I am a benevolent maximiser of 
social welfare. If you don’t understand what that last sentence means, you should read 
my economics textbooks.* (Quick translation: I’m a very nice guy. ©) 


¢ HELP ME IMPROVE THIS BOOK! Feel free to email me if 


1. There are any errors in this book. Please let me know even if it’s something as trivial 
as a spelling mistake, an extra space, a grammatical error, or an incorrect /broken link. 


2. You have absolutely any suggestions for improvement. 


3. Any part of this book is less than crystal clear. 


If at any point in this textbook, you’ve read the same passage a few times, tried to reason 
it through, and still find things confusing, then it is a failure on my part—lI have failed 
to explain things clearly and simply. Please let me know and I will try to rewrite it so that 
it’s clearer and simpler. (There is also the possibility that I simply made some mistake or 
typo! So please let me know if there’s anything confusing!) 


I deeply value any feedback, because I’d like to keep improving this textbook 
for the benefit of everyone! 1 am very grateful to all the kind folks who’ve already 
written in, allowing me to rid this book of more than a few embarrassing errors. 


¢ LyX rocks!? A big thank you to all who’ve contributed to the LyX project. 





?Three Letter Abbreviations. In the US for example, abbreviations are viewed by some as dumbing down. 
In contrast, in Singapore, the ability to use as many abbreviations as possible and even create one’s own 
abbreviations is nearly a mark of intelligence. There shall therefore be very many abbreviations in this 
textbook. 

3In 2017, the current 9758 syllabus was examined for the first time and the previous 9740 syllabus for the 
last time. 

“I’m working on these. You can find half-completed versions on my website. 

°This book was written using LyX. FTX is the typesetting program used by most economists and 
scientists. But ETR@X can be annoying to use. LyX is a user-friendly, GUI, for-dummies version of ATRX. 
With LyX, you can actually clearly see on screen the equations you're typing, as you’re typing them. 

It is quite pointless to try working out one’s maths in HTX because you can’t clearly tell what you’re 
writing. In contrast, it is perfectly feasible and indeed easy to work out one’s maths in LyX. For example, 
if you have countless lines of tedious algebra to do, you can do it in LyX and copy-paste/document every 
step of the way. Otherwise you’d probably be doing it on pen and paper which is messy and which you'll 
probably misplace. 

LyX has boosted my productivity by countless hours over the years and you should use LyX too! 
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« Maths, math, and matzz 


This book uses British English. And so the word mathematics shall be abbreviated as maths 
(and not math). By the way, Singaporeans used to pronounce maths as matzz (similar to 
how they pronounce clothes as klotes). But at some point between 2005 and 2015, perhaps 
after watching too many American TV and movies, Singaporeans decided they’d switch to 
the American math. Perhaps in 50 years, we'll all be Amos Yees trying to speak annoying 
pseudo American English. But in the meantime, I’ll continue to say matzz. This is my way 
of promoting and preserving Singlish (and also sticking it to the ghost of LKY). 


e Hyperlinks 


This book has clickable hyperlinks that bring you to some other location in this book, a 
web-page, or email link. 

After clicking on any such hyperlink, clicking on “Back” or “Previous View” available on 
many PDF readers (on desktop Adobe Acrobat, “Alt-Left Arrow” works for me) should go 
back to where you were previously. Try it! 
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Tips for the Student 


e Read maths slowly 


Reading maths is not like reading Harry Potter. Most of Harry Potter is fluff. You can 
skip 100 pages of Harry Potter without missing a beat.® 


In contrast, maths has little fluff. Skip or misapprehend one line and it will cost you. 


So, go slowly. Dwell upon and carefully consider every sentence in this textbook. Make 
sure you completely understand what each statement says and why it is true. Reading 
maths is very different from reading most other subject matter. 


If you don’t quite understand some material, you might be tempted to move forward anyway. 
Don’t. In maths, later material usually builds on earlier material. So, if you simply move 
forward, then yea sure, this may save you some time and frustration in the short run, but 
it will almost always cost you far more in the long run. 


Better then to stop right there. Keep working on it until you “get” it. Help is all around— 
ask a friend or a teacher. Feel free to even email me! (I’m always interested to know what 
the common points of confusion are and how I can better clear them up.) 


e Do every example and exercise 


Carefully work through every example and exercise. Merely moving your eyeballs is not 
the same as working. Working means having pencil and paper by your side and going 
through each example/exercise word-by-word, line-by-line. 


For example, we might write, “x?-y? =0. So, (x-y)(x+y) =0. Thus, x= y or x =-y.” If 


it’s not obvious why the first sentence implies the second and the third, stop right there and 
work on it until you understand why. And again, if you can’t figure it out yourself, don’t 
be shy about asking someone for help. Don’t just let your eyeballs fly over these sentences 
and pretend that your brain “gets” it. Such self-deceit will only cost you in the long run. 


The Chinese believe in “eating bitterness” and work for work’s sake. That is not my view. 
The exercises in this textbook are not to make you suffer or somehow strengthen your moral 
fibre. Instead, as with learning to ride a bike or swim, practice makes perfect. The best 
way to learn and master any material is by practising, doing, and occasionally failing. You 
may struggle initially and get a few bruises, but eventually, you’ll get good. 


To repeat, I strongly advise that you do every exercise in this textbook. They'll help 
you learn. They’ll also serve as a check that you’ve actually “got it”. And if you haven’t, 
then well, as mentioned, either keep working until you get it or go out and get help. ‘The 
seemingly easy way out of pretending you’ve got it and flipping to the next page is actually 
the hard way out, because it will only cost you more grief in the long run. 


« Construct your own examples 


A good stock of examples, as large as possible, is indispensable for a thorough 
understanding of any concept, and when I want to learn something new, I make it 
my first job to build one. 


— Paul Halmos (1983). 





®Spoiler: Voldemort dies, Harry Potter lives happily ever after. 
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the surest kind of knowledge is what you construct yourself. 


— Judea Pearl (2018). 


¢ Confused? Good! 


MOE! MZ, BRERA, FAH. 


shall I teach you what wisdom means? To know what you know and know what 
you do not know—this then is wisdom. 


— Analects of Confucius (1998t). 


he fancies he knows something, although he knows nothing, whereas I, as I do not 
know any thing, so I do not fancy I do. In this trifling particular, then, I appear 
to be wiser than him, because I do not fancy I know what I do not know. 


— Plato’s Socrates (4th century BC, 1854t). 


he discovered that, when he imagined his education was completed, it had in fact 
not commenced; and that, although he had been at a public school and a university, 
he in fact knew nothing. To be conscious that you are ignorant is a great step to 
knowledge. 


— Benjamin Disraeli (1845). 


erste Vorbedingung des Lernens: das Wissen des Nichtwissens. 


The first prerequisite for learning anything is .... the knowledge that we do not 
know. 


— Gottlob Frege (1884, 1950t). 


There are always some students who, when probed, say they are not at all confused and 
have no questions to ask. In my experience, these are usually precisely the worst students. 
These students have such poor understanding of the material that they do not know 
what they do not know. 


And so, when you find yourself confused, don’t panic or despair. Your confusion is actually 
good news! To be confused is to know that you do not know and thus to have taken your 
first step towards wisdom. You now know what you need to work on and what questions 
to ask your friends and teachers. 


Of course, merely knowing what you do not know is not enough. You need to actually act 
on this as well. Be proactive: your education and your life are in your own hands! 
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e Made a mistake? Good! 


Making mistakes is the key to making progress. ... 


Mistakes are not just opportunities for learning; they are, in an important sense, 
the only opportunity for learning ... 


The chief trick to making good mistakes is not to hide them—especially not from 
yourself. Instead of turning away in denial when you make a mistake, you should 
become a connoisseur of your own mistakes, turning them over in your mind as if 
they were works of art, which in a way they are. The fundamental reaction to any 
mistake ought to be this: “Well, I won’t do that again!” 


— Daniel Dennett (2013). 
¢ If you can’t explain something simply, you don’t understand it well enough’ 


[Richard] Feynman was once asked by a Caltech faculty member to explain why 
spin 1/2 particles obey Fermi-Dirac statistics. He gauged his audience perfectly 
and said, “I’ll prepare a freshman lecture on it.” But a few days later he returned 
and said, “You know, I couldn’t do it. I couldn't reduce it to the freshman level. 
That means we really don’t understand it.” 


— David L. Goodstein (1989). 


if I can’t explain something I’m doing to a group of bright undergraduates, I don’t 
really understand it myself 


— Daniel Dennett (2013). 


There is a view in some philosophical circles that anything that can be understood 
by people who have not studied philosophy is not profound enough to be worth 
saying. To the contrary, I suspect that whatever cannot be said clearly is probably 
not being thought clearly either. 

— Peter Singer (2016). 


The above saying is useful for the teacher. But it also yields the learner the following useful 
corollary (in mathematics, a corollary is a statement that follows readily from another): 


To test whether you understand something well, try explaining it simply. 


This learning technique may be dubbed learning by teaching.® 





’This or some similar saying is often misattributed to Einstein. But as Einstein himself once said, “73% 
of Einstein quotes are misattributed.” 
8In Latin, Docendo discimus—by teaching, we learn. 
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I have learned much from my teachers and even more from my friends, but from 
my students I have learned more than from all of them. 


— Rabbi Hanina (Talmud). 


When I write equations, I have a very clear idea of who my readers are. Not so 
when I write for the general public—an entirely new adventure for me. Strange, 
but this new experience has been one of the most rewarding educational trips of 
my life. The need to shape ideas in your language, to guess your background, your 
questions, and your reactions, did more to sharpen my understanding of causality 
than all the equations I have written prior to writing this book. 


— Judea Pearl (2017). 


One way to implement this technique? is for you and your friends to get together explain 
concepts to each other aloud. For this technique to work, you and your friends must be 
demanding of and challenge each other. Do not rest until all are satisfied that the concept 
has been clearly, simply, and correctly explained. 


e« Lessons from the science of learning 


Unfortunately, the science of learning is still very much in its infancy. But according to 
this short review of the literature—“How We Learn: What Works, What Doesn’t” (2013), 
the two most effective study/learning techniques are 


1. Self-testing; and 


2. Distributed or spaced practice.'® 

The next three are 

3. Elaborative interrogation; 

4. Self-explanation; and 

5. Interleaved practice." 

Two commonly used but ineffective techniques are (a) highlighting; and (b) rereading. 


Again, the science of learning is very much in its infancy, so you should take with a large 
dose of salt any advice (including mine about learning by teaching). Nonetheless, there’s 
probably no harm trying out different techniques and seeing what works for you. 





° Another way is to hand over the classroom to students. But this is probably too adventurous in Singapore, 

where the closest thing is probably the officially sanctioned and of course graded project work presentation. 

10Say a student has a 20-minute session of PE thrice a week for three weeks. Plan A: An hour of aerobics 
during Week 1, an hour of basketball during Week 2, and an hour of cricket during Week 3. Plan B 
(distributed or space practice): Each week, do 20 minutes of each sport. 

“Say a student has five addition problems and five multiplication problems. Plan A: Do them in this 
order—A, A 2A3A,A5M,M)M3M,Ms. Plan B (interleaved practice): Do A,M, AgM,A3M3A,4M,A5Ms. 
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Miscellaneous Tips for the Student 


« During the A-Level exam, you get a “List MF26: List of Formulae and 
Statistical Tables for Mathematics and Further Mathematics” (PDF) 


So there’s no need to memorise all the formulae that are already on this list. (Note though 
that your JC may or may not give you List MF26 during your JC common tests and exams. ) 


e Remember your O-Level Maths & ‘A’ Maths? 


You’ve probably forgotten some (or most?) of it. But unfortunately, you are still assumed 
to know all of O-Level Maths (2019 syllabus) and “some” (OK, more like a lot) of Additional 
Maths (2019 syllabus). See your A-Level syllabus (pp. 14-15) for what you need to know 
from ‘A’ Maths. (To take H2 Maths, most JCs require that you at least passed ‘A’ Maths.) 


Littered around this textbook are occasional “O-Level Reviews” (e.g. Ch. 5). These reviews 
will usually be very quick and hopefully you’ll have no difficulty with them. But if you do, 
go back and review your O-Level Maths and ‘A’ Maths! 


e Web-based calculators 


Google is probably the quickest for simple calculations. Type anything into your browser’s 
Google search bar and the answer will instantly show up: 


5*5+32= 
G Google Search 


= 57 


Wolfram Alpha is somewhat more advanced (but also slower). Enter sin x for example and 
you'll get graphs, the derivative, the indefinite integral, the Maclaurin series, and a bunch 
of other stuff you neither know nor care about. In this textbook, you’ll sometimes see 
(usually at the end of an example or an exercise answer) a clickable Wolfram Alpha logo 
that will bring you to the relevant computation on Wolfram Alpha. 


Symbolab is a much less powerful alternative to Wolfram Alpha. However, it’s perfectly 
good for simple algebra and somewhat quicker, so you may sometimes prefer using it. 


The Derivative Calculator and the Integral Calculator are probably unbeatable for the spe- 
cific purposes of differentiation and integration. Both give step-by-step solutions for any- 
thing you want to differentiate or integrate. As with Wolfram Alpha, you’ll sometimes see 
clickable logos ® and ® that'll bring you to the relevant computations. (Note that 
anor Ue ately, after clicking on these logos, you’ll also have to either click “Go!” or hit 
() cpl eee ) 





Enter} 








I also made this Collection of Spreadsheets. These are for doing tedious and repetitive 
calculations you’ll often encounter in H2 Maths (with vectors, complex numbers, etc.).\4 





Some kiasu JCs, like HCI, even require that you got at least a B3 for both Maths & Additional Maths. 

13The site author informed me that not having a direct link was deliberate and defensive. 

4 As with anything I do, I welcome any feedback on these spreadsheets. (Perhaps in the future I will make 
a more attractive version.) 
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e Other online resources 


There are way too many websites that try to cover primary, secondary, and lower-level 
undergraduate maths. Unfortunately, some of them are awful and get things wrong. 


Three websites I like (though are probably a bit advanced for JC students) are 
1. The Stack Exchange (SE) & family of Q&A websites.’ 


Just for maths alone, there are three SE sites!!® 


(a) At $ MATHEMATICS, you can ask questions and often get them answered fairly promptly. 
Note though that this site is mostly frequented by fairly advanced users of maths (in- 
cluding many mathematicians), so they can be pretty impatient and quick to downvote 
questions they perceive to be “stupid”. Nonetheless, if you make an effort to write down 
a carefully crafted question and show also that you’ve made some effort to look for an 
answer (either on your own or online), they can be very helpful. 


(b) Mathematics Educators »-\- focuses on pedagogy (teaching and learning). Unfortunately, 
it’s not as active as $$ MATHEMATICS. Nonetheless, many of the discussions there are 
filled with insights—insights that I’ve tried to incorporate into this textbook. 


(c) [mathoverflow is for research-level mathematics and is thus way beyond anything 
that’s of use to us. 


In this textbook, you'll occasionally see % and & logos. Click/touch them and you'll be 
sent to related SE discussions. 


2. ProofWiki gives succinct and rigorous definitions and proofs. Unfortunately it is very 
incomplete. 


3. Mathworld.Wolfram is also great, but at times excessively encyclopaedic, at the cost of 
clarity and brevity. 


And of course, you can find countless free maths textbooks online (some less legal than 
others). One wonderful and (mostly) legal resource is the Internet Archive, which has 
many old books—including very many that were scanned by Google but which are no 
longer available on Google Books. 


Two totally illegal!” resources are Library Genesis for books and Sci-Hub for articles.'° And 
of course, an old reliable is BitTorrent. (I have, of course, never used any of these illegal 
resources, but I hear they are great.) 





The flagship SE site is Sstackoverflow where you can ask any programming question and (often) see it 
answered amazingly quickly. For computing in general, there are also many other SE sites. 


SE sites are like Yahoo! Answers or Quora, but less stupid. The worst SE site is probably Politics o> > , 
but even there, the average question or answer is probably better than that on Yahoo! Answers or Quora. 
‘©There are also many other wonderful SE sites that you should explore. Unfortunately, despite my 
magnificent contributions, Economics >> is not exactly thriving. It seems that economists, unlike 
programmers or mathematicians, have learnt all too well that contributing to the public good is folly. 

‘Well, depending on which jurisdiction you live in. Of course, in Singapore, unless told otherwise, you 
should assume that everything is illegal. 

'8Note though that these sites are constantly playing whac-a-mole with the fascist authorities and so the 
URLs often change. If the links given here aren’t correct, the first thing you should do is to let me know 
so I can correct these broken links. Then simply google to find the current working URLs. For Sci-Hub, 
the WIKIPEDIA page usually lists the latest up-and-running URLs. 
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Use of Graphing Calculators 


You are required to know how to use a graphing calculator.” 


This textbook will give only a few examples involving graphing calculators. 
There is no better way of learning to use it than to play around with it yourself. By the 
time you sit down for your A-Level exams, you should have had plenty of practice with it. 


You can also use any of the seven calculators in the following list:?° 
LIST OF APPROVED GRAPHING CALCULATORS' 
The following graphing calculator models are approved for use ONLY in subjects examined at H1, H2 and 


H3 Levels of the A-Level curriculum. 
Note: All graphing calculators must be reset prior to any examination. 









































S/N | Calculator Brand | Calculator Model Operating Approved Period’ 
System 
1 CASIO FX-9860GIls OS 2.04 2014 — 2022 
OS 2.09 
2 TI-84 Plus OS 2.43 2006 — 2024 
3 ee TI-84 Plus Silver Edition | OS 2.53MP 2006 — 2024 
4 TI-84 Plus Pocket SE ene 2012 - 2021 
5 TI-84 Plus C Silver OS 4.0 2015 — 2023 
Edition OS 4.2 
6 TI-84 Plus CE OS 5.0.1 2017 — 2024 
OS 5.1.5 
OS 5.2.0 
OS 5.2.2 
OS 5.3.0 
OS 5.3.1 
OS 5.4.0 























This textbook will stick with the TI-84 PLUS Silver Edition, OS 2.55MP?! which I’ll simply 
call the TI84. (My understanding is that most students use a TI calculator and that the 
five approved TI calculators are pretty similar.) 


I'll always start each example with the calculator freshly reset.” 





Pretty bizarre that in this age of the smartphone, they want you to learn how to use these clunky and 
now-useless devices from the ’80s and ’90s. It is the equivalent of learning to program a VCR. (The TI-81 
was designed in the 1980s and first sold in 1990. The TI-84 PLUS was first sold in 2004 and represents 
only a modest improvement over the original TI-81.) 

IMHO it’d be much better to teach you to some simple programming or Excel (or whatever spreadsheet 
program). “B-b-but ... how would such learning be tested in an exam format?” Ay, there’s the rub. In 
the Singapore education system, anything that cannot be “examified” is not worth learning. 

Of course, there are some folks over in Texas who don’t mind. Nor do those lucky few MOE teachers 
and administrators who get to go on all-expenses-paid “business” trips to Texas to “learn” more about 
the calculators. 

°This list was last updated by SEAB on 2019-10-31. 

21You can download the operating system through the TI website. 

22T’ve never actually bought or owned a graphing calculator. All my screenshots here of the TI84 are 
actually from the emulator Wabbitemu. (Yup, you can download Atari or DOS emulators to play 
decades-old games; and you can likewise download an emulator for this decades-old piece of junk.) 
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Exam Tips for Towkays 


Use your graphing calculator as much as possible. You are always allowed to use your 
graphing calculator. 


Some exam questions will explicitly instruct you not to use your calculator, but this just 
means that your written answer should not include any hint that you used your calculator. 
(Nonetheless, you can still cheat and use your calculator for guidance and to check your 
answet. ) 


Instructions to not use your calculator include: 


“Do not use a calculator in answering this question.” 
“Without using your calculator ...” 
“Use a non-calculator method ...” 


“Find the exact value of ...” 


“Express your answer in terms of V3 or 71.” 





In Spider-Man: Homecoming (2017), Spider-Man uses a TI calculator to bust out of the 
“most secure facility on the Eastern Seaboard”—the Damage Control Deep Storage Vault 
(also known as the Department of Damage Control Vault).”° 





3This scene takes place at about 57:30-58:00 of the movie (YouTube clip). 
They removed the brand name and also the model name, so I can’t quite tell what model it is. One 
website claims it’s a TI-86, while another claims it’s an exact match to the TI-83 PLUS. 
Zooming in, it looks like Spider-Man is doing some sort of combinatorics on his notepad. I can’t tell 
what exactly’s on the calculator screen. Many of the buttons on the calculator seem to be permanently 
depressed, which is weird. 
Slowing down, we see that he first punches in (i then a moment later (or maybe (2) (6))—so 
it’s probably all just rubbish that he’s punching in. The latter keystrokes do get him out of the place 
though. (That’s all for my brilliant movie analysis of the week.) 
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Apps You Can Use on Your TI Calculator 


On 2020-04-03, I received this information from SEAB: 


The following applications for Texas Instruments graphing calculators are 
approved for use in the GCE A-Level national examinations: 


. Finance 
. Conics 
. Inequalz 


. PlySmlt2 
. Transfrm 


oF W NH Fe 


Please ensure that your graphing calculator is installed with the approved 
operating system. The Singapore Reset (pressing “On” key while holding the 
“2” and “8” keys) will reset the calculator without deleting these applications. 


On my (emulated) TI-84 Plus Silver Edition, the Finance app was already pre-installed. 
So I simply had to google, download, install the other four apps.” 


After doing so, here’s what my “APPLICATIONS” screen looks like after executing these 
two steps: 


1. Press to turn on your calculator. 
2. Press GUQRS to bring up the “APPLICATIONS?” screen. 


Step 1. Step 2. 





(Screenshots are of the end of each Step.) 





4\Vebpages to download these four apps for the TI-83 PLUS or TI-84 PLUS: Conics, Inequalz (seems like 
we’re supposed to use the “International” version), PlySmlt2, and Transfrm (links retrieved on 2020-04- 
04)—ZIP file containing these four apps and the accompanying guidebooks (all written in 2001 or 2002!). 
(Unfortunately, the guidebook they have for PlySmlt2 seems to be for an old version of the same app.) 
Note that if you have the TI-84 PLUS CE, you may have to look for different webpages from which to 
download the appropriate apps. 
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Preface/Rant 


When you’re very structured almost like a religion .... Uniforms, uniforms, uni- 
forms ... everybody is the same. Look at structured societies like Singapore where 
bad behaviour isn’t tolerated. You are extremely punished. Where are the creative 
people? Where are the great artists? Where are the great musicians? Where are 
the great singers? Where are the great writers? Where are the athletes? All the 
creative elements seem to disappear. 


— Steve Wozniak (2011).”° 


That ghost, authoritarianism, sees education as a way to instill in all students the 
same knowledge and skills deemed valuable by the authority. 


— Zhao Yong (2014). 


The most dangerous man, to any government, is the man who is able to think 
things out for himself. 


— H.L. Mencken (1919). 


Divide students into two extremes: 


e« Type 1 students are happy to learn absolutely nothing, so long as they get an A. 

e¢ Type 2 would rather learn a lot, even if this means getting a C. 

The good Singaporean is taught that pragmatism is the highest virtue (obedience is second). 
She is thus also trained to be a Type 1 student (and indeed a Type 1 human being). 


If you’re a Type 1 student, then this textbook may not be the best use of your time,”° 
though you may still find the exercises and Ten Year Series (TYS) questions useful. (But 
do read Why Even Type 1 Pragmatists Should Read This Textbook below.) 


Of course, any careful student of this textbook will be rewarded with an A. But getting an A 
is not the goal of this textbook. Instead, the goal is to impart genuine understanding. 


Contrast this with the goal of the Singapore education system: Create a docile labour 
force that generates GDP growth. 


At first glance, these two goals do not seem to be in conflict. After all, we’d expect a 
student who genuinely understands her H2 Maths to also contribute to GDP growth. 





©2011 BBC interview. Also quoted in Zhao (2012, p. 103). Unfortunately, the audio at the given BBC 
link is broken. (Over the years, I’ve reported this broken audio to the BBC at least thrice, but it’s never 
been fixed. I have not been able to find this audio anywhere else.) 

*6The efficient Type 1 student may be interested in these resources: (a) The H2 Mathematics CheatSheet, 
all the formulae you'll ever need on two sides of an A4 sheet of paper (this was written in 2016 and I 
hope to update it “soon”); (b) The Hi Mathematics Textbook, which is written more simply and covers 
a subset of the H2 syllabus; (c) The H2 Maths Exercise Book (coming “soon”), which teaches you how 
to mindlessly apply formulae and give the “correct” answer to every exam question; (d) My totally 
awesome tuition classes! 
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The conflict only arises with the keyword docile. An education system that imparts genuine 
understanding tends also to encourage independent thinking and discourage docility. 


On the one hand, to maximise GDP growth, Gahmen wants “creative” innovators. On the 
other, it doesn’t want too much of a challenge to the status quo (especially politically). Its 
goal is thus to turn docile test-taking drones into docile creative innovators. 


Unfortunately, docility and creative innovation are not compatible. A populace trained to 
avoid the slightest transgression is not one that is capable of producing anything new. The 
result is lip service to buzzwords like “creativity” and half-hearted education reform. Once 
a decade or so, some technocrat comes up with an inane four-letter campaign (FLC) like 
TSLN 1997 and TLLM 2005 that brings us precisely nowhere.” 


To this ambivalence and pussy-footing, add (a) the deep-rooted East Asian love of exams 
and rote-learning; and (b) the elitist British educational system we inherited.*® Altogether, 
despite superficial appearances to the contrary, we’ve had very little change over the years. 
Administrators, teachers, and students alike remain completely fixated on exams.?? 


Singapore produces world champion test-takers and gold medallists at the various Inter- 
national Olympiads. But as currently constituted, the Singapore education system will 
never produce a Fields Medallist or a Nobel Laureate. And as Steve Wozniak sug- 
gests, Singapore will never produce a world-beating innovator like an Apple or a 
Google. The reason is that unlike taking tests (be it your J1 Promos or your IMO), such 
endeavours require more than mere monkey-see-monkey-do mimicry.°” 





2"In 1997, Thinking Schools, Learning Nation (or, as was joked, Sinking Schools, Burning Nation). In 
2006, Teach Less, Learn More. These campaigns may now be found in the ash heap of history (alongside 
such gems as Goal 2010). The current FLC is probably ESGS (Every School a Good School). 

8From a 2013-04-12 Straits Times interview with Tharman: 
ST: DPM, why do you think we are the way we are? 
Tharman: Well, we inherited the British system, which is quite academically biased and in Britain, 
of course, quite an elitist system. We also inherited a Chinese education culture, which is also quite 
academically oriented, a strong emphasis on values and character education but quite academically 
oriented and quite test-oriented. And I think the combination of a British and East Asian educational 
ethos has created a particular form of meritocracy which achieved a lot in 40 years. But as we go forward 
and we think about the type of inclusive society we want, it’s not just about wages, which we are working 
on, it’s about how you view yourself and others at the workplace, wherever you live, how we view fellow 
Singaporeans, do you view them as equals, do you do things together. That has to start from young and 
it has to continue through life. 

2°To be fair, I should highly commend the recent Learn for Life changes. These changes were announced 
in late 2018 and will be implemented in 2019-21. They include the removal of all weighted assessments 
for Pl and P2; and mid-year exams for P3, P5, S1, and S3. Also, in changes that had already been 
announced earlier, from 2021, PSLE T-scores will be replaced by wider scoring bands. In my opinion, 
all of these changes reduce the obsession with exams and should ipso facto be commended. 
Unfortunately, to my knowledge, Gahmen has yet to announce any such changes for JC students, Singa- 
pore university admissions, and Singapore universities. In 2019, the Singapore Management University 
(SMU)—which has always tried to brand itself as hip, relaxed, and American—even went out of its 
way to condemn and criticise a Canadian SMU professor for giving As to all 169 of his students. (see 
2019-05-25 Straits Times story reproduced at The Star and archived here). 

3°Here are two common excuses for why Singapore has produced no Nobel Laureates: Singapore (a) has a 
small population; and (b) was until fairly recently very poor. But consider Denmark (population 5.8M), 
Finland (5.5M), and Norway (5.3M), whose populations are similar to or even smaller than Singapore’s 
(5.6M) and that were producing Nobel Laureates when they were far poorer than Singapore is today 
(whether in absolute terms or relative to, say, the US). We could also point to tiny Saint Lucia (180,000) 
with her two Nobel Laureates. Iam thus accepting bets for this proposition: “By 2050, no born-and-bred 
Singaporean will have won a Fields medal or a Nobel Prize (Peace excluded).” (We can work out what 
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How and Why JC Maths Fails Our Students 


I have a study in mind: Gather all the students who got As for their A-Level maths exams 
x = 5,10,15,20,... years after the exam. Get them to do that exact same A-Level exam 
they took x years ago. Also, ask if they remember anything from their JC maths education 
or if they believe it had any value whatsoever. I suspect that most will score close to 
zero, remember absolutely nothing, and consider their JC maths education to have been 
completely worthless. If these suspicions are correct, then JC maths education has no value, 
except as a selection device. 


(But of course, selection devices are of paramount importance in elitist, social-Darwinist 
Singapore. Grades help differentiate the President’s Scholar from the “mere” PSC scholar, 
the lowly McDonald’s employee from the dalit cleaner, and those who should reproduce 
from those who shouldn’t.?! Grades even decide whether a sex offender can avoid prison.*”) 


How is it that the vast majority of even those who got As on their A-Level maths exams x 
years ago can remember nearly nothing whatsoever? What explains this colossal failure of 
JC maths education? The explanation is simple: 


In Singapore, testing isn’t everything; it’s the only thing. 


Testing does have a place in the educator’s toolkit. But it shouldn’t be the only tool, as is 
the case in Singapore (and the rest of Confucian East Asia). 


Moreover, as currently constructed, Singapore’s system of testing does not test for gen- 
uine understanding. Instead, students are tested on whether they’ve mastered the East 
Asian “skills” of mimicry, following instructions, and reproducing recipes, formulae, and 
algorithms. In other words, students are tested on whether they’re well-trained, obedient 
monkeys capable of performing tricks they’ve practised over and over and over again. 


In A Mathematician’s Lament, Paul Lockhart describes maths education in the US as 
“stupid and boring”, “formulaic”, and “mindless” “pseudo-mathematics”.*? The same may 
be said of maths education in Singapore. 





exactly “born-and-bred” means. ) 

31Qne will recall the Graduate Mothers’ Priority Scheme and Small Family Incentive Scheme. While those 
two Orwellian/Nazi schemes have since been scrapped, the Social Development Unit (SDU) lives on, 
though now rebranded the Social Development Network (SDN). The SDU was established “to encour- 
age social interaction and marriage among graduate singles” at around the same time (1984) as the 
aforementioned schemes . 

32In 2018, NUS student Terence Siow Kai Yuan molested a woman thrice. He first molested her twice on an 
MRT train. When she alighted at Serangoon station, he too alighted and followed her in order to molest 
her a third time. Siow also admitted to having committed similar acts since at least 2016 and admitted 
to being “unable to recall the number of times he had committed such acts”. On 2019-09-25, noting 
that Siow’s “academic results show he has the ‘potential to excel in life’”, District Judge Jasvender Kaur 
sentenced him to only 21 months probation and 150 hours community service. 
Also in 2018, another NUS student Nicholas Lim filmed a female student Monica Baey in the shower 
without her consent. The police gave him a conditional warning, because a “prosecution, with a possible 
jail sentence, will likely ruin his entire future, with a permanent criminal record.” This incident came to 
light in early 2019 only because the victim Baey relentlessly called for attention. Importantly, Baey had 
the aid of social media—in 2005 say, this incident would not have seen the light of day, much less shame 
NUS and the police into giving any response. 

33By the way, Lockhart explains why this is so and what maths really is far more eloquently and clearly than 
I ever could. I strongly recommend that every student and instructor of maths read A Mathematician’s 
Lament. There are two versions—a 2002 25-page PDF that circulated online and a 2009 book version. 
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As currently taught and tested for, JC maths is a mere and mindless collection of isolated 
recipes. Little or no understanding is required. Where do these mysterious recipes come 
from? Why do they “work”? No matter. Instead, all that’s required is for the student to 
mug®* and reproduce these recipes during exams. 


Imagine a maths education system in which students were simply required to mug a million- 
digit number. Those who correctly write out the first 10000 digits get an A; those who 
manage only 5000 get a B; etc. Those who manage 100000 digits get government scholar- 
ships, with the n President’s Scholarships going to the top n muggers. 


This imaginary system sounds absurd, but isn’t really all that different from the existing 
system (or how the Chinese imperial examinations worked for many centuries). Indeed, 
it'd probably be an improvement. If we replaced the existing maths “education” system 
with the simple goal of mugging a million-digit number, then 


e Students would understand no less of maths than they do now. 
e Students would find maths no more “stupid and boring” than they do now. 


e We’d have a selection device that’s no worse than the present one. (Under the present 
system, students are selected and sorted based on their ability to mug and regurgitate 
isolated recipes that are no less random or arbitrary than a million-digit number.) 


e Resources currently dedicated to JC maths education would be freed for other uses. 


“(Memory works far better when you learn networks of facts rather than facts in isolation,” 
writes the British mathematician Timothy Gowers (2012) in a critique of A-Level maths. 
I agree. When we learn a network of facts and seek to understand why something works, 
we are far more likely to remember what we’ve learnt and make use of what we’ve learnt. 


Moreover, further learning also becomes easier. It becomes easier to acquire and assimilate 
even more facts, knowledge, and wisdom. 


In contrast, when we mug isolated facts and recipes that we do not understand, we aren’t 
likely to remember them mere months after our A-Level exam. When in a few years, we 
actually have to make use of the material that we were supposed to have learnt in JC, we 
find ourselves having to learn everything from scratch. What then was the point? We may 
as well have spent those two years mugging a random million-digit number. 





34To mug is to study hard and, especially, to engage in rote learning or memorisation. I consider mug to 
be Singlish and so italicise it. 
It seems though that the phrase mug up may have originated in Britain. To my knowledge, this phrase 
isn’t in current usage in Britain. I have however come across South Asians using mug up in this sense. 
(In efficient Singlish though, the preposition up is simply dropped.) 
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Work Work Work Work Work Work 


Pre-tertiary maths education in Singapore is no more “stupid and boring” than in the 
US (or elsewhere). The difference is that by the time the typical student in the US (or 
elsewhere) completes high school, she will only have squandered a very small portion of her 
life on such “mindless” and “formulaic” “pseudo-maths”. 


The same cannot be said for the typical Singaporean student. By the time she turns 18, 
she will have—just for the single subject of maths alone—clocked many thousands of hours 
attending school and tuition classes; doing homework, practice exam questions, assessment 
books, and TYS; taking common tests, promos, prelims, mid-year exams, and end-of-year 
exams; ad infinitum, ad nauseam. 


The Confucian East Asian countries®’ perform splendidly on international tests. For ex- 
ample, in the 2018 Programme for International Student Assessment (PISA) Maths test, 
they swept the top seven spots (see p. i).° 


“What,” the Western educator enquires, “is the magic here?”®’ But there is no magic. To 
me, the explanation for why East Asian students do so well on these tests is obvious: 


They’re forced to work their butts off. 


While the American teen is “wasting” her time on typical, “useless” teenager-ly pursuits, 
the Singaporean teen is seated obediently in front of his desk, doing yet another soul- 
crushing TYS question. And once in a while, children as young as ten commit suicide due 
to poor exam results.*® Kids the world over commit suicide for a variety of reasons, but 
only in East Asia do they regularly do so because of exams and schoolwork. 


In South Korea, legislators have even passed (ill-enforced) laws barring hagwons (private 
cram schools) from operating past 10 p.m. To the American teen, it is mind-blowing that 
(a) anyone would be in school past 10 p.m.; or that (b) this practice would grow so common 
that legislators saw fit to take action. But to the East Asian, this isn’t at all strange. 


To me, the fact that East Asian students bust their butts is obviously the single most 
important explanation for why they do so well on international tests. Yet strangely, in the 
countless papers and books that I’ve come across seeking to explain why some countries do 
better than others, this explanation is rarely ever considered.®” 





3°China, Hong Kong, Japan, Korea, Macao, Singapore, Taiwan, and Viet Nam. 

3©Viet Nam’s results have been omitted from the headline charts in the 2018 edition. I’m not sure why 
exactly and haven’t looked into this matter. But we have for example this somewhat-cryptic statement: 
“the statistical uniqueness of Viet Nam’s response data implies that performance in Viet Nam cannot 
be validly reported on the same PISA scale as performance in other countries” (“PISA 2018 Results, 
Volume I, What Students Know and Can Do”, 2019, p. 190, PDF). 

37Tn the US, “Singapore Math” has acquired something of a mythical status. As with weight loss, Americans 
are constantly on the lookout for some magic, painless solution to their mediocre education systems. 

38Tn 2001, ten-year-old Lysher Loh jumped to her death from her fifth floor apartment. She “had been 
disappointed with her mid-year examination results and had found the workload heavy.” She had also 
“told her maid Lorna Flores two weeks before her death she did not want to be reincarnated as a human 
being because she never wanted to have to do homework again.” In 2016, an 11-year-old “killed himself 
over his exam results by jumping from his bedroom window in the 17th-storey flat”. More suicides here. 

3°This explanation is neglected in part because the relevant statistics are difficult and expensive to collect. 
But a bigger reason for such neglect is probably that writers on education (be it in the West or in East 
Asia) are simply not quite aware of how much harder students in East Asia are made to work. 
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Taking Tests Seriously 


A closely related explanation is that East Asians are trained from young to take every test 
seriously. In contrast, US kids couldn’t care less about some inconsequential PISA test.*° 


And yes, the PISA test is truly inconsequential. Each individual student’s results are never 
revealed (not to the student herself, her teachers, her parents, her school, or anyone else). 
Students are simply forced to take a two-hour test*! whose results they’ll never know and 
which will have absolutely no consequence on their lives. 


Under such circumstances, I suspect the truly intelligent and educated kid would simply 
click through the test as quickly as possible. In contrast, the dull-witted, obedient, and 
well-trained student would actually take the test seriously. 


This explanation has received some academic attention. For example, in “Measuring Suc- 
cess in Education: The Role of Effort on the Test Itself” (2019), experimenters went to two 
high schools in the US® and four in Shanghai.*® Students at these schools** were made to 
do a 25-minute, 25-question PISA-like maths test. 


At each school, students were divided into a control group and a treatment group. Students 
in the control group got nothing. Each student in the treatment group was given US$25 or 
¥90 upfront, with US$1 or ¥3.60 later taken away for each wrong answer.*° 


In the US, the treatment groups did significantly better than the control groups. In contrast, 
in Shanghai, the treatment groups did no better: 


257 


US Shanghai 


207 


10- 


Score (out of 25) 








School 1 School 1 School 1 School 2 School 2 School 1 School 2 School 3 School 4 
low regular honors’ regular honors 


|| Control es | Treatment 











40Mast Asians care about exams as much as Americans care about sports, and vice versa. 

41The standard PISA test is two hours. More time may be spent on other questionnaires and assessments. 

““' A high-performing private boarding school and a large public school with both low- and average- 

performing students” (p. 295). 

*8“(O|ne below-average-performing school, one school with performance that is just above average, and 

two schools with performance that is well above average” (p. 295). 

“4 All students present on the day of testing took part in the experiment” (p. 295). 

“Using The Economist’s Big Mac Index, the authors reckoned that ¥90 was the purchasing-power equiv- 
alent of US$25 (see their p. 295 and n. 11). 
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This suggests that robotic Shanghai students have been trained and conditioned to always 
try their hardest, whether or not there’s any financial incentive. In contrast, US students 
may not be trying particularly hard when the stakes are low (or zero as is the case with 
PISA), but wll try a little harder when there’s some financial incentive. 


Note also that 


Importantly, students learn about the incentive just before taking the test, so any 
impact on performance can only operate through increased effort on the test itself 
rather than through, for example, better preparation or more studying. 


Hence, any remaining performance gap between US and Shanghai students could very well 
be eliminated if US students were incentivised (through carrot and stick) to prepare and 
work half as hard as Shanghai students.*° 


In two separate studies, the authors arrive at similar conclusions: 


e “a country can improve its ranking by up to 15 places by encouraging its own students 


to take the exam seriously”;“7 


e “our measures of student effort explain between 32 and 38 percent of the variation in 


test scores across countries”.*® 


Altogether then, there is very little that Western educators can learn from East Asia. The 
only lesson is this: /f you want your students to do well on PISA-like tests, then 


e Train them from young to take every test seriously; and 
e Force them to work their butts off. 


This may mean destroyed childhoods and adolescences, plus the occasional academic sui- 
cide. But that, surely, is a small price to pay for topping the PISA charts. 





“©Two more factors that may explain East Asians’ stellar performance (more pure speculation on my part): 


1. In the US, it would take a very unusual teacher or student to devote even a second preparing for 
PISA. In contrast, at many East Asian schools, many teachers and students will likely have spent at 
least some time preparing for it. 


2. In the peculiar case of China, there will likely also be a “patriotism effect”: Students, teachers, and 
administrators may be conditioned and exhorted to think, “It is our patriotic duty to do our best 
in order to show the rest of the world that China is great and glorious! China is no longer the Sick 
Man of Asia!” (Cue Chinese heroes defeating Somali pirates to the tune of March of the Volunteers.) 
Chinese teachers and students will therefore try harder than even other East Asians. 


“Taking PISA Seriously: How Accurate Are Low-Stakes Exams?” (2021). 
48VWhen Students Don’t Care: Reexamining International Differences in Achievement and Student Effort” 
(2019). 
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Genuine Understanding 


Again, the goal of this textbook is to impart genuine understanding. I suspect the 
sincere pursuit of this goal will do more to promote GDP growth than any of Gahmen’s 
current educational policies. 


But quite aside from any such instrumental value, I believe that a genuine understanding of 
maths (and indeed any other material) is intrinsically valuable. (GDP growth is lovely, 
but despite what Gahmen would have you believe, it is not all that makes life worthwhile. ) 
And heck, learning can even be that three-letter F word banished from the Singapore 
education system—and apparently also from playgrounds and HDB void decks: 


= ” - ——. 
eHOPUAYING|OF CHESS AT 


aa eOMMON AREAS 


4 


mentor si 





These were actual signs posted in Singapore. They went viral and were removed in 


June 2013 (New Paper story) and March 2016 (Straits Times story), respectively. 


Now, what do I mean by “imparting genuine understanding”? 


Personal anecdote: As a JC student, I remember being deeply mystified by why the scalar 
product had such a simple algebraic definition and yet could at the same time also tell us 
about the cosine of the angle between the two vectors. I never figured it out. But this 
didn’t matter, because this was simply “yet another formula” that we learnt for the sole 
purpose of answering exam questions.?? 


I remember being confused about the difference between the sample mean, the mean of the 
sample mean, the variance of the sample mean, and the sample variance. But this confusion 
didn’t matter, because once again, all we needed to do to get an A was to mindlessly apply 
formulae and algorithms. Monkey see, monkey do. 





49T remember complaining about this to a classmate, who responded along these lines, “But that’s how 
we've always been taught maths what. It’s just a bunch of formulae.” Of course, my classmate was right. 
Today, the intellectually curious student can easily find the answer on the internet. But at that time 
(2001-02), the internet was not quite as developed and so one could not easily find answers online. 
The non-East Asian reader may be wondering why I didn’t just ask a teacher if I wanted to know. The 
reason is that in East Asia, the student is prohibited from asking questions. Now, this prohibition is 
not absolute (indeed, happily, it has been gradually softening at least in Singapore). Nonetheless, this 
prohibition sufficed to deter me from asking too many questions. See e.g. Chuah (2010) for more about 
how this cultural phenomenon works. 
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This textbook is thus partly in response to my unhappy and unsatisfactory experience as a 
cog in the Singapore educational system. In other words, this is the textbook I wish I had 
had when I was a JC student. 


In the Singapore JC approach, teachers work through one example after another, demon- 
strating to students that something “works”. The student is then expected to faithfully 
replicate such “workings” on the exams. If the student acquires any understanding of what 
she’s doing, then that’s a nice but not terribly important bonus—what’s far more important 
is that the student produces the “correct” answers on the exams. 


Such perverse priorities are reversed in this textbook. Our foremost goal will always be to 
explain and understand why something is true. To then also be able to ace the exams is a 
natural but unimportant side effect. 


Almost all results are proven. I try to supply the intuition for each result in the simplest 
possible terms. Many proofs are relegated to the appendices, but where a proof is especially 
simple and beautiful, I encourage the student to savour it by leaving it in the main text 
(often as a guided exercise). In the rare instances where proofs are entirely omitted from 
this book—usually because they are too advanced—I make sure to clearly state so, lest the 
student be confused over where a result comes from or whether it’s supposed to be obvious. 


This textbook follows the Singapore A-Level syllabus.°? And so, a good deal of mindless 
formulae is unavoidable. Even so, I try in this textbook to give the student a tiny glimpse 
of what maths really is—“the art of explanation”.®' I try to plant a thoughtcrime in the 
student’s mind: Maths is not merely another pain to be endured, but can at times be a 
joy. And so for example, this textbook explains 


e A bit of intuition behind differentiation, integration, and the Fundamental Theorems of 
Calculus. (To get an A, no understanding of these is necessary. Instead, one need merely 
know how to “do” differentiation and integration problems. ) 


¢ A bit of intuition behind why the Maclaurin series “works”. (To get an A, it suffices 
to mindlessly write out various Maclaurin series without having the slightest clue what 
they are or where they come from.) 


e¢ Why the Central Limit Theorem (CLT) is so amazing. (To get an A, one need merely 
treat the CLT as yet another mysterious mathematical “trick” whose sole purpose is to 
solve exam questions. It isn’t necessary to appreciate why it is so amazing, where it 
might possibly come, or what relevance it has to everyday life.) 


e Why it is terribly wrong to believe that “a high correlation coefficient means a good 
model”. (Yet this is exactly your A-Level examiners seem to believe—see Ch. 132.9.) 





°°A quixotic, longer-term goal of mine is to entirely change that too. As Lockhart says, “[T]hrow the 
stupid curriculum and textbooks out the window!” 
°'Lockhart (2002, 2009). 


xlviii, Contents www.EconsPhDTutor.com 


Why Even Type 1 Pragmatists Should Read This Textbook 


‘Two reasons: 


1. The A-Level exams now include more curveball or out-of-syllabus questions. 


Previously, the A-Level exam questions were always perfectly predictable. If you had no 
problem doing past-year exam questions, then you’d have no problem getting an A. 


But starting in 2017 (coinciding with the new and supposedly reduced 9758 syllabus), 
curveball questions now carry a weight of perhaps 10-20%. For example, in 2017, out of 
absolutely nowhere, students were suddenly asked to use something called D’Alembert’s 
ratio test and to explain whether a series converges (see Exercise 575). 


I can find no official, publicly available statement announcing this change (much less ex- 
plaining it). I have heard only that JC maths teachers were informed by MOE of this 
change ahead of time. My guess is that this is the MOE’s highly creative method of creat- 
ing creative students. 


In my humble opinion, this change is cow manure. Challenging students with curveball 
questions is a perfectly good idea. What isn’t a good idea is to pose such questions on the 
high-stakes A-Level exam, thereby exerting additional pressure on the already pressure- 
cooked Singapore student.°” 


But I will confess that selfishly, I welcome this change because it increases the value of this 
textbook. The student who carefully studies this textbook will be rewarded with a true 
and deep understanding of all the H2 Maths material and hence be fully prepared to bat 
away any curveball. 


Take for example N2017/1/6. This unfamiliar problem will likely have come as a shock to the 
Singapore monkey drilled a thousand times over to “do” computational problems involving 
3D geometry, without ever understanding what he was doing. In contrast, any student 
who bothered to read Part III of this textbook even once will always have understood what 
she’d been “doing” and will thus have breezed through this problem. 





°2Of course, the ancient East Asian education system has conditioned students, teachers, and administra- 
tors to believe that anything that isn’t on the exam is worthless. And so, it is difficult to persuade them 
to learn or teach any material that isn’t also on the exams. 
Difficult yes, impossible no. Overcoming this ancient mindset will require years of hard work, cultural 
change, and courage. Simply adding curveball questions to the A-Level exams certainly does not help 
and indeed merely serves to reinforce that mindset: “Oh you have to learn this too, not because it’s 
interesting or intrinsically valuable, but because it might show up on your exam.” 


xlix, Contents www.EconsPhDTutor.com 


2. If you’re intending to do more maths in the future (this includes not just 
maths, but also physics, economics, engineering, and many other subjects), 
then this textbook will actually save you time in the long run. 


Merely doing well in A-Level H2 Maths may give you the false illusion that you’ve actually 
learnt or understood the material. Down the road, this may cost you more time. 


Another personal anecdote: When I began my undergraduate studies, I was still the typical 
kiasu Singaporean monkey trained to believe that life was a competitive, Social Darwinist 
race. And so I skipped a whole bunch of lower level maths classes (Calc. I, Calc. I, 
Statistics, and Linear Algebra), thinking I had already covered all the material back in JC. 


On paper, I may indeed have covered all this material. But in practice, all I'd learnt in JC 
was monkey-see-monkey-do. [’d learnt enough to do well on the exams, but not enough to 
actually understand or use any of the material. 


It was only many years later, with the benefit of hindsight, that I began to see how much 
of a mistake I had made. Skipping those classes saved me time and put me “ahead of the 
race” in the short run. But in the long run, this actually cost me dearly. I would actually 
have saved more time by not skipping those seemingly elementary lower level classes!”* 


And it wasn’t just that I had to spend time relearning everything from scratch. I also had 
to spend time unlearning and repairing the damage caused by my JC maths education. 


This textbook thus offers the sort of A-Level maths education I wished I had received.™ 
You'll be spending two years on H2 Maths anyway. And so, instead of wasting these two 
years learning mindless recipes which (a) you'll forget a few months after your A-Level 
exam; and (b) will do you more harm in the long-run, why not spend these two years 
actually learning and understanding material that you'll appreciate, remember, and can 
actually make use of at any point in your life? 


And of course, in my completely humble and unbiased opinion, the best way to learn and 
understand H2 Maths is by studying this textbook. 


I conclude this Preface/Rant by expressing my hope that even if you the instructor or 
student do not use this textbook as your primary instructional or learning material, you 
will still find it perfectly useful as an authoritative and reliable reference. 


P.S. This textbook is far from perfect. To steal a certain neighbourhood school’s motto, 
the best is yet to be. I hope to keep improving this textbook, but I can only do so with your 
help. So if you have any feedback or spot any errors, please feel free to email 
me. (As you can tell, I am pretty merciless about criticising others. So please don’t be shy 
about pointing out the many foolish mistakes that are surely still lurking in this textbook.) 





°3And of course, I would’ve saved even more time by skipping the Singapore education system, but alas, 
that wasn’t an option. 

°4Here I intend absolutely no knock or diss on my JC Maths teachers. They and indeed most of my 
Singapore teachers were generally pretty good (especially when compared to the US). They did the best 
they could—within the stultifying confines of the Singapore educational system. My critique here applies 
to that system. As the hip-hop cliché goes, I don’t hate the player(s); I hate the game. 
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PISA 2018 Maths Results 







































































































































































































































































B-S-J-Z | 591 NZ | 494 Brunei | 430 
Singapore | 569 Portugal | 492 Kazakhstan | 423 
Macao | 558 Australia | 491 Moldova | 421 
Hong Kong | 551 Russia | 488 Baku | 420 
Taiwan | 531 Italy | 487 Thailand | 419 
Japan | 527 Slovakia | 486 Uruguay | 418 
Korea | 526 Spain | 483 Chile | 417 
Estonia | 523 Luxembourg | 483 Qatar | 414 
Netherlands | 519 Hungary | 481 Mexico | 409 
Poland | 516 Lithuania | 481 Bosnia & Herz. | 406 
Switzerland | 515 US | 478 Costa Rica | 402 
Canada | 512 Belarus | 472 Jordan | 400 
Denmark | 509 Malta | 472 Peru | 400 
Slovenia | 509 Croatia | 464 Georgia | 398 
Belgium | 508 Israel | 463 N. Macedonia | 394 
Finland | 507 Turkey | 454 Lebanon | 393 
Sweden | 502 Ukraine | 453 Colombia | 391 
UK | 502 Greece | 451 Brazil | 384 
Norway | 501 Cyprus | 451 Argentina | 379 
Ireland | 500 Serbia | 448 Indonesia | 379 
Germany | 500 Malaysia | 440 Saudi Arabia | 373 
Czechia | 499 Albania | 437 Morocco | 368 
Austria | 499 Bulgaria | 436 Kosovo | 366 
Latvia | 496 UAE | 435 Panama | 353 
France | 495 Romania | 430 Philippines | 353 
Iceland | 495 Montenegro | 430 Dominican R. | 325 











Notes: “B-S-J-Z” = “Beijing-Shanghai-Jiangsu-Zhejiang”; Baku is in Azerbaijan; I’ve abbreviated or oth- 
erwise slightly changed some country names. 


Source: “PISA 2018 Results, Volume I, What Students Know and Can Do” (2019, pp. 17-18, PDF). 





°° And yes, I consider Macao, Hong Kong, and Taiwan countries that are separate from China. I hope the 
PRC gahmen and people will forgive me for this egregious sin (please don’t kidnap me and force me to 
make a TV confession). 
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Excerpts from “The Ugly Models” by Martha Nussbaum (2010) 


What do educators in Singapore and China do? By their own internal accounts, they do a 
great deal of rote learning and “teaching to the test.” Even if our sole goal was to produce stu- 
dents who would contribute maximally to national economic growth—the primary, avowed 
goal of education in Singapore and China—we should reject their strategies, just as they 
themselves have rejected them. In recent years, both nations have conducted major educa- 
tional reforms, concluding that a successful economy requires nourishing analytical abilities, 
active problem-solving, and the imagination required for innovation. In other words, nei- 
ther country has adopted a broader conception of education’s goal, but both have realized 
that even that narrow goal of economic enrichment is not well served by a system focused 
on rote learning. ... 


Singapore ... reformed its education policy in 2003 and 2004, allegedly moving away from 
rote learning toward a more “child-centered” approach in which children are understood 
as “proactive agents.” Rejecting “repetitious exercises and worksheets,” the reformed cur- 
riculum conceives of teachers as “co-learners with their students, instead of providers of 
solutions.” It emphasizes both analytical ability and “aesthetics and creative expression, en- 
vironmental awareness .... and self and social awareness.” .... Singapore and China are 
trying to move toward open-ended progressive education that cultivates student creativity ... 


Observers .... conclude that the reforms have not really been implemented. Teacher pay 
is still linked to test scores, and thus the incentive structure to effectuate real change is 
lacking. In general, it’s a lot easier to move toward rote learning than to move away from 
it, since teaching of the sort Dewey and Tagore recommended requires resourcefulness and 
perception, and it is always easier to follow a formula. 


Moreover, the reforms are cabined by these authoritarian nations’ fear of true critical free- 
dom. In Singapore, nobody even attempts to use the new techniques when teaching about 
politics and contemporary problems. “Citizenship education” typically takes the form of 
analyzing a problem, proposing several possible solutions, and then demonstrating how the 
one chosen by government is the right one for Singapore. In universities, some instructors 
attempt a more genuinely open approach, but the government has a way of suing professors 
for libel if they criticize the government in class, and even a small number of high-profile 
cases chills debate. One professor of communications (who has since left Singapore) re- 
ported on a recent attempt to lead a discussion of the libel suits in her class: “TI can feel 
the fear in the room. .... You can cut it with a knife.” Nor are foreign visitors immune: 
NYU’s film school”® has been encouraged to set up a Singapore branch, but informed that 
films made in the program may not be shown outside the campus. ... 


Singapore and China are terrible models of education for any nation that aspires to remain 
a pluralistic democracy. They have not succeeded on their own business-oriented terms, and 
they have energetically suppressed imagination and analysis when it comes to the future of 
the nation and the tough choices that lie before tt. 





°°In 2003, Singapore launched its “Global Schoolhouse” initiative to “bring in good foreign institutions 
as partners and attract ‘large numbers of international students’” (2021). This initiative failed, taking 
down with it NYU Tisch Asia (RIP 2012), Johns Hopkins Singapore (2006), NYUQ@NUS (2013), Chicago 
Booth Asia eMBA (2013), MIT-SUTD (2017), and Yale-NUS College (2021). 
Perhaps ambitious (and greedy) administrators at US universities will now finally heed the words of 
long-time Harvard president Derek Bok (2013): “I had my own run-in with Lee Kuan Yew some years 
ago ... Nothing in that experience would tempt me to try to establish a Harvard College in Singapore.” 
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Part O. 
A Few Basics 





Revision in progress (November 2021). 


And hence messy at the moment. 
Appy polly loggies for any inconvenience caused. 
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The glory of [maths] is its complete irrelevance to our lives. That’s why it’s so 
fun! 


— Paul Lockhart (2002, 2009). 


I have never done anything ‘useful? No discovery of mine has made, or is likely 
to make, directly or indirectly, for good or ill, the least difference to the amenity 
of the world. 


— G.H. Hardy (1940). 
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1. Just To Be Clear 


In this textbook, we'll stick to these standard conventions: 

¢« Greater than means “strictly greater than” (>). So I won’t bother saying “strictly”, 
unless it’s something I want to emphasise. 

¢ Similarly, less than means “strictly less than” (<). 


¢ If I want to say greater than or equal to (>) or smaller than or equal to (<), [ll 
say exactly that. 


¢ Positive means “greater than zero” (> 0) and non-positive means “less than or equal 
to zero” (< 0). 


¢ Similarly, negative means “less than zero” (< 0) and non-negative means “greater 
than or equal to zero” (> 0). 


e Zero is neither positive nor negative. Instead, it is both non-negative and non- 
positive.” 


Names of some punctuation marks: 





Left Right A pair of 
Parenthesis ( Parenthesis _) Parentheses () 
Bracket [ Bracket | Brackets _ [] 
Brace { Brace } Braces {} 











e The symbol -.. and .. stand for the words because and therefore. 


¢ The punctuation mark “ means ditto or “the same as above/before”: 


Example 1. Tokyo is the capital of Japan. 


Beijing China. 


Lima Peru 





¢ The multiplication symbol - is (sometimes) preferred to x because there is (sometimes) 
the slight risk of confusing x with the letter 2. 





°’But in France, positif and négatif mean >0 and <0, so that 0 is both positif and négatif (Wiktionary). 
’There is actually another pair of brackets () called angle brackets. If we’re using angle brackets, then 
we'll want to be careful to distinguish them from [] by referring to the latter as square brackets. 
Happily, we won’t be using angle brackets at all in this textbook. And so, we’ll simply call [] brackets. 
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2. PSLE Review: Division 


Example 2. Consider 9 = 4: 
We call 
e 9 the dividend 


e 4 the divisor 


e g=2 the quotient—g is the largest integer such that 4q < 9 


e r=1 the remainder—-, is defined so that 9 = 4q + r or equivalently, 


r=9-4q¢=9-4x2=1. 


Example 3. Consider 17 + 3: 
We call 
17 the dividend 


3 the divisor 
q = 5 the quotient—g is the largest integer such that 3q < 17 


r = 2 the remainder—-, is defined so that 17 = 3g +r or equivalently, 


real(—s¢=17—3x5=2. 





The above two examples used an algorithm”? called Euclidean division: 


Definition 1. (Euclidean division) Given positive integers x and d,® let q be the 
largest integer that satisfies dg < x. Then let r = 7 - dq 20. 


In the equation x = dq+r, we call x the dividend, d the divisor, q the quotient, and r the 
remainder. 


And if r =0, then we say that x divided by d leaves no remainder and that d is a factor 
of (or divides) x. 





It is possible to prove that thus defined, the quotient and remainder are unique.°! 


Example 4. Find 95+ 4 using Euclidean division: 


1. Find the largest integer g such that 4q < 95: g = 23. 


2. Now compute the remainder: r = 95 — 4g = 95-4 x 23 = 3. 


95 ss! 
3. Conclude: — = 23-. 
onclude: 7 1 








°° Algorithm is just a fancy synonym for method. 

For simplicity, this definition deals only with the case where x,d > 0. For a more general definition that 
covers also the cases where x and/or d may be negative, see Definition 262 (Appendices). 

See Theorem 50 (Appendices). 
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Example 5. Find 18+ 6 using Euclidean division: 


1. Find the largest integer g such that 6q < 18: g=3. 
2. Now compute the remainder: r = 18-6q¢=18-6x3=0. 


3. Conclude: - =. 


Since r=0, we say that 18+ 6 leaves no remainder and that 6 is a factor of or 
divides 18. 
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2.1. Long Division 


Remember long division from primary school? It’s simply Euclidean division repeated at 
each decimal place: 


Example 6. Find 95 +4 using long division. Seven steps: 


1. First, draw two lines, one horizontal and one vertical.°? Then write the dividend 95 
“inside” and the divisor 4 on the “left”. 


Long division is repeated Euclidean division—specifically, we do Euclidean division at 
each decimal place (ones, tens, hundreds, etc., but starting from the biggest place): 


. Starting at the tens place, we ask, “What is the largest integer such that 40q, < 95?” 
Answer: g; = 2. So, we write 2 above the horizontal line, at the tens place. 


. Compute 40q, = 40 x 2 = 80 and write “80” below our dividend 95. Also, draw a 
horizontal line below “80”. 


. Compute the remainder 95 — 80 = 15 and write “15” below the line just drawn. 


Step 1. Step 2. Step 3. Step 4. Step 6. Step 7. 


D >) 9 23 23 
4 | 99 4| 95 4|95 4/95 4|95 4| 95 
80 80 80 80 


15 15 15 
12 {2 
3 


. Now move to the ones place. Ask, “What is the largest integer such that 4g) < 15?” 
Answer: go = 3. So, write 3 above the top horizontal line, at the ones place. 


. Compute 4q9 = 4x 3 = 12 and write “12” below our remainder “15”. Also, draw a 
horizontal line below “12”. 


. Compute the remainder 15 —- 12 = 3 and write “3” below the line just drawn. 


We’re done! The quotient is 23 and the remainder is 3: 


3 
Oe 
ie a 


We could actually keep going, repeating the same procedure for subsequent decimal 
places: the tenths, hundredths, etc. If we did this, we’d find that 


95 
Soe ee 
4 








®The vertical line is actually usually a curve. But it was too much trouble/work getting this typesetting 
to work and look nice. So, I’ve simply settled for a vertical line. 


6, Contents www.EconsPhDTutor.com 


Example 7. Find 87 +7 using long division: 


12 
ae 
70 

cia 
14 

= 


Exercise 1. Find 8057 +39 using long division. Identify the dividend, divisor, quotient, 
and remainder. (Answer on p. 1747.) 
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2.2. The Cardinal Sin of Dividing By Zero 


Dividing by zero is a common mistake. It’s easy to avoid this mistake when the divisor 
is obviously a big fat zero. It’s harder when the divisor is an unknown constant or variable 
that may or may not be zero. 


Example 9. Solve x(a -1) = (2a4-2)(a-1). (That is, find the values of x for which 
the equation is true. We call such values of x the solutions or roots to the equation.) 


Here’s a wrong answer: “Divide both sides by x-1 to get x = 24-2. So r=2.” X 


Here’s a correct answer that considers two possible cases: 


Case 1. If «-1=0, then the equation is true. So, x = 1 is a possible solution. 


Case 2. If «-1+#0, then we can divide both sides by x - 1 to get x = 2x - 2. 
So, x = 2 is another possible solution. 


Conclusion. The two possible solutions are x = 1 and x = 2. 





Moral of the story. Dividing by zero may cause us to lose perfectly valid solutions. So, 
always make sure the divisor is non-zero. If you’re not sure whether it equals zero, then 
break up your analysis into two cases, as was done in the above example: 


Case 1. The quantity equals zero (and see what happens in this case). 


Case 2. The quantity is non-zero (in which case go ahead and divide). 


Exercise 2. What’s wrong with the following seven-step “proof” that 1 = 0? 


. Let x and y be positive numbers such that x = y. 

. Square both sides: x? = y’. 

. Rearrange: x7 — y” = 0. 

. Factorise: (x - y) (a+ y) =0. 

. Divide both sides by 7 - y to get x+y =0. 

. Since x = y, plug y = x into the above equation to get 2x = 0. 

. Divide both sides by 2z to get 1 =0. (Answer on p. 1747.) 


Exercise 3. Identify any error(s) below.” (Answer on p. 1747.) 


Example ce 


42% 35% 
x—2Z — 


: divide both numerators by x, and you will have 


ae ; therefore 42 = Psa he 
35X—70; Ai cieeqami= et se ear that is, 7x— 126 


==-—70; therefore 7x == 126— 70, that is, 7x == 56 ; andes 6: 


; therefore 42 x—~ 126 = 








°’This was taken from an 18th-century textbook (The Elements of Algebra, 1740, p. 103), written by one 
Nicholas Saunderson (1682-1739). Saunderson was the fourth Lucasian Professor of Mathematics (at 
the University of Cambridge)—this is a fancy title that’s been held by Isaac Newton (1642-1727) and 
Stephen Hawking (1942-2018). 
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2.3. Is One Divided by Zero Infinity? 


By the way, let’s take this opportunity to clear up a related and popular misconception. 
You’ve probably heard someone saying that 


1 
This is wrong. Instead, 0 eS: J 


Any number divided by zero is undefined.™ 


Undefined is the mathematician’s way of saying 


You haven't told me what you’re talking about. 
So what you’re saying is meaningless. 


And so, the following five expressions are all undefined: 


1 50 -17.1 c -o 
Cr 0 * Oe OF ig 











®4Tn the context of limits, the expression 0 +0 is sometimes also called an indeterminate form, but this 
isn’t something H2 Maths students need care about. 
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3. Logic 


Big surprise—you’ve secretly been using logic your whole life. 


Logic isn’t explicitly on your H2 Maths syllabus.® But spending an hour or two on logic 
pays huge dividends—you’ll learn to reason better, both in maths and in everyday life. 


This chapter is thus a brief and gentle introduction to logic. Here we merely present some 
of the most basic and useful results from logic. (If you truly can’t be bothered, please at 
least check out the one-page summary of this chapter on p. 41.) 


First, try this appetiser.°° 


Example 10. (Wason Four-Card Puzzle) In a special deck of cards, each card has a 
letter on one side and a number on the other. You are shown these four cards: 


Shanmugam the Liar now comes along and makes this claim: 
“Tf a card has a vowel on one side, then it has an even number on the other side.” 


You suspect that Shanmugam is wrong. To prove him wrong, which of the above four 
cards should you turn over? (The goal is to turn over as few cards as possible.) 





The above puzzle baffles most who are untrained in logical thinking—most turn over A 
(correct) and 28 (wrong). Right now, that probably includes you. But by p. 26 of this 
textbook, you'll have had some training in logic and thus be able to solve this puzzle easily. 


By the way, psychologists®* have found that people find it easier to solve an entirely equiv- 
alent but less abstract version of the above puzzle: 





Tf it were up to me, the H2 Maths syllabus would devote at least a little time to logic. Instead, that 
time is spent on learning to mindlessly compute such objects as the volume of the revolution of a curve 
around the y-axis. Which is a doggie trick that (1) students will forget two weeks after the final A-Level 
exam; and (2) is completely useless unless you’re planning to be an engineer or physicist, in which case it 
is still completely useless since down the road, you’ll be learning it again (and probably more properly). 

This is a slightly reworded version of Wason (1966, pp. 145-146). 

®7 Answer: A and 11. We’ll explain this on p. 26. 

®8Griges and Cox (1982). 
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Example 11. (Wason Four-Card Puzzle, Alcohol Version) In a special deck of 
cards, each card has a person’s age on one side and the drink she’s having on the other. 
You are shown these four cards: 


ABSOULT 4 
(=) Lb) ED 


Shanmugam the Liar now comes along and makes this claim: 
“Tf a person is having alcohol, then she must be at least 18.” 


You suspect that Shanmugam is wrong. To prove him wrong, which of the above four 
cards should you turn over? (The goal is to turn over as few cards as possible.)°? 


(Absolut Vodka is alcoholic; Milo isn’t.) 








69 Answer: Absolut Vodka and 11. 
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3.1. True, False, and Indeterminate Statements 


Example 12. Let A, B, C, and D be these statements: 


A: “Germany is in Europe.” 
B “Germany is in Asia.” 
Cr: ‘lel = 2) 

D ee aa. 


Statements A and C are true, while B and D are false. 


(We'll use Y and X as shorthand for true and false.) 


Example 13. Let W/, N, and O be these statements: 


eS 0! 
is ale alle 


O: “zx is a positive number.” 


Note that the truth values of M, N, and O depend on the value of x. That is, whether 
each statement is true or false depends on the value of x. 


So, without further information, each of these three statements can neither be said to be 
true nor said to be false. Instead, we say that each is indeterminate. 


But if we’re told that 


e x =0, then all three statements are true. 
e x =0.5, then statements M and O are true, while N is false. 


e x = -1, then all three statements are false. 
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3.2. The Conjunction AND and the Disjunction OR 


Example 14. As before, let A, B, C’, and D be these statements: 


A: “Germany is in Europe.” 4 
B: “Germany is in Asia.” 

G: ‘41-27 "A 
DD; alee AC x 


Using the logical connective AND (called the conjunction), we can form these state- 
ments (which we also call conjunctions): 


AAND B: “Germany is in Europe AND Germany is in Asia.” X 
AANDC: “Germany is in Europe AND 1+1 = 2.” "A 
BAND D: “Germany is in Asia AND 1+1=3.” x 


e« The conjunction A AND B is false because B is false. 
e« The conjunction A AND C is true because both A and C are true. 
¢ The conjunction B AND D is false because B is false. (Indeed, D is also false.) 


Using the logical connective OR (called the disjunction) we can form these statements 
(which we also call disjunctions): 


AOR B: “Germany is in Europe OR Germany is in Asia.” X 
AORC: “Germany is in Europe OR 1+ 1 = 2.” "A 
BORD: “Germany is in Asia OR 1+1=3.” x 


e The disjunction AOR B is true because A is true. 
¢ The disjunction A OR C is true because A is true. (Indeed, C is also true.) 
e The disjunction B OR D is false because both B and D are false. 


Definition 2. Let P and Q be statements. The conjunction of P and Q is the statement 
denoted P AND Q and defined as 


¢ True if both P and Q are true; and 


e False if at least one of P or Q is false. 


The disjunction of P and Q is the statement denoted P OR Q and defined as 


¢ True if at least one of P or Q is true; and 
¢ False if both P and Q are false. 
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3.3. The Negation NOT 
Informally, a statement’s negation is the “opposite” or contradictory statement. Formally, 


Definition 3. Let P be astatement. The negation of P is the statement denoted NOT-P 
and defined as 


true if P is false and false if P is true. 


Example 15. As before, let A, B, C, and D be these statements: 


A: “Germany is in Europe.” 4 
B: “Germany is in Asia.” 

C: oa ee ea "A 
Bs “+1 =37 x 


The negations of statements A, B, C’, and D are simply these statements: 


NOT-A: “Germany is not in Europe.” X 
NOT-B: “Germany is not in Asia.” ¥ 
NOT-C: a ee ee x 
NOT-D: “lel 32 "A 


e Since A and C are true, their negations NOT-A and NOT-C must be false. 
e Since B and D are false, their negations NOT-B and NOT-D must be true. 


The negation of the negation simply brings us back to the original statement: 


NOT-NOT-A: “Germany is in Europe.” 4 
NOT-NOT-B: “Germany is in Asia.” X 
NOT-NOT-C: 2 le oe ag "A 
NOT-NOT-D: “T+ 1=37 x 


Exercise 5. Let EF: “It’s raining”, F: “The grass is wet”, G: “I’m sleeping”, and H: 
“My eyes are shut”. 
Write down NOT-E, NOT-F’, NOT-G, and NOT-H. (Answer on p. 1748.) 
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3.4. Equivalence <=> 


Example 16. Let x be a real number and M, N, and O be these statements: 


Meee > Ue 
pa ice 


O: “zx is a positive number.” 


Observe that whatever x is, M and O always have the same truth value—i.e. whenever 
M is true, O is also true; and whenever M is false, O is also false. So, we say that M 
and O are equivalent and write 


M << OO. 


We will also say that “P is equivalent to Q” or “P if and only if Q”.” 


In contrast, if x = 0.5, then M is true while N is false. Since M and N do not always 
have the same truth value, we say that / and N are not equivalent and write 


M & N. 


We will also say that “P is not equivalent to Q”. 





So, if we can find even one single instance where the statements P and @ have different 
truth values, then P <> Q. If we can’t, then P = > Q. 


Definition 4. Let P and Q be statements. Suppose P and Q always have the same truth 
value (i.e. whenever P is true, Q is also true; and whenever P is false, Q is also false). 
Then we say that P and Q are equivalent and write 

PSO) 
We will also say that “P is equivalent to Q” or “P if and only if Q”. 


Example 17. Let x be a real number and a and £ be these statements: 


OQ: Ea 3 and Bo e+e 225" 


If a is true, then (@ is true. And if a is false, then ( is also false. Since a and (§ always 
have the same truth value, we say that a and ( are equivalent and write a <=> £6. 








Tf and only if is sometimes shortened to iff. 
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Example 18. Let A, B, C, and D be these statements: 


“Germany is in Europe.” 


A 

B: “Germany is in Asia.” 
CG: ‘1 a2? 

D oe eg 


Both statements A and C' are always true. So, A <=> C. 


Both statements B and D are always false. So, B <> D. 


Also, A <= B,A <= D,C <> B,andC <> D. 
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3.5. De Morgan’s Laws: Negating the Conjunction and Disjunction 


The negation of P AND Q is denoted NOT-(P AND Q). 


Fact 1. Suppose P and Q are statements. Then 


NOT-(PANDQ) <= NOT-PORNOT-Q. 








Proof. See p. 1546 (Appendices). 











Example 19. Let A: “Germany is in Europe”; B: “Germany is in Asia”; C: “1+1 = 2”, 
and Ds “1+ le 3”. 


Consider these three statements: 


1. AANDB: “Germany is in Europe AND Germany is in Asia.” X 
2. CAND D: “1+1=2 AND 1+1=3.” x 
3. AANDC: “Germany is in Europe AND 1+1 = 2.” "A 


Now consider these three statements’ negations: 
1. NOT-(A AND B) is true. There are two ways to see this: 


e Since A AND B is false, its negation NOT- (A AND B) must be true. 
e By Fact 1, NOT-(A AND B) is equivalent to (NOT-A OR NOT-B): “Germany 


is not in Europe OR Germany is not in Asia”. Which is true because NOT-B: 
“Germany is not in Asia” is true. 


2. NOT-(C AND D) is true. Two ways to see this: 


¢ Since C AND D is false, its negation NOT-(C AND D) must be true. 
e By Fact 1, NOT-(C AND D) is equivalent to (NOT-C OR NOT-D): “1+1 #2 
Ol lel s Whichis trie beeaise NOU -—D. Ieal= a” is true, 
3. NOT-(A AND C) is false. Two ways to see this: 


¢ Since A AND C is true, its negation NOT- (A AND C) must be false. 

e By Fact 1, NOT-(A AND C) is equivalent to (NOT-A OR NOT-C): “Germany 
is not in Europe OR 1+1# 2”. Which is false because both NOT-A: “Germany is 
not in Europe” and NOT-C: “1+ 1 # 2” are false. 








"Tn arithmetic, we have the PEMDAS order of operations convention: Parentheses, Exponentiation, 
Multiplication, Division, Addition, Subtraction. It turns out that in logic, the (usual) convention is 
Parentheses, NOT, AND, OR, =>, <=>. (But this isn’t something A-Level Maths students need 
know, so where there is any risk of doubt, I’ll use parentheses. ) 
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Exercise 7. Continue with the last example. Is the negation of each statement below 
true? (Answer on p. 1748.) 


(a) BANDC. (b) AAND D. (c) BAND D. 





The negation of P OR Q is denoted NOT- (POR Q). 
Fact 2. Suppose P and Q are statements. Then 


NOT-(PORQ) <— NOT-PANDNOT-Q. 








Proof. See p. 1547 (Appendices). 


Example 20. As before, let A: “Germany is in Europe”; B: “Germany is in Asia”; C: 
[l= ands Be lao”. 


Consider these three statements: 











1. AORB: “Germany is in Europe OR Germany is in Asia.” 4 
ZAC OR, "+ l=2 OR. 1 +1230 v 
3. AORC: “Germany is in Europe OR 1+1 = 2.” "A 


Now consider these three statements’ negations: 
1. NOT-(A OR B) is false. There are two ways to see this: 


¢ Since A OR B is true, its negation NOT- (A OR B) must be false. 
e By Fact 2, NOT-(A OR B) is equivalent to (NOT-A AND NOT-B): “Germany 


is not in Europe AND Germany is not in Asia”. Which is false because NOT-A: 
“Germany is not in Europe” is false. 


2. NOT-(C OR D) is false. Two ways to see this: 


e Since COR D is true, its negation NOT-(C OR D) must be false. 
e By Fact 2, NOT-(C OR D) is equivalent to (NOT-C AND NOT-D): “1+1 #2 
AND 14+1+#3”. Which is false because NOT-C: “1+1+# 2” is false. 
3. NOT-(A ORC) is false. Two ways to see this: 


¢ Since A ORC is true, its negation NOT-(A ORC) must be false. 

¢ By Fact 2, NOT-(A ORC) is equivalent to (NOT-A AND NOT-C): “Germany 
is not in Europe AND 1+1 #2”. Which is false because NOT-A: “Germany is not 
in Europe” is false. Indeed, NOT-C: “1+ 1 # 2” is also false. 
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3.6. The Implication P => Q 
The statement “P = > Q” is read aloud as “If P, then Q” or “P implies Q”. 


Example 21. Let E: “It’s raining” and F’: “The grass is wet”. 


Then the implication E => F is this statement: 

“Tf it’s raining, then the grass is wet.” 
Or equivalently, “That it’s raining implies that the grass is wet.” 
Which is true.” 


Example 22. Let G: “I’m sleeping” and H: “My eyes are shut”. 


Then the implication G = > H is this statement: 

“Tf I’m sleeping, then my eyes are shut.” 
Or equivalently, “That I’m sleeping implies that my eyes are shut.” 
Which is true.” 


Example 23. Let M: “x>0O” and N: “a> 1”. 


Then the implication M = > N is this statement: 
"hia SU inenms S21," 
Or equivalently, “That x > 0 implies that x > 1.” 


Which is false (counterexample: x = 0.5). 


Example 24. Let a: “x = 3.” and 7: “x? = 9.” 


Then the implication y = > a is this statement: 
“If ¢? = 9, then x = 3” 


Or equivalently, “That x? = 9 implies that x = 3.” 





Which is false (counterexample: x = —3). 


This all seems simple enough. However, you may find the formal definition of P => Qa 
little strange and unintuitive: 





™Provided the grass is outdoors. 
For most healthy human beings. 
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Definition 5. Given statements P and Q, the implication P = > Q is defined as 


NOT-P ORQ. 


That is, P = > Q is the statement defined as 


¢ True if P is false OR Q is true; and 
e False if P is true AND Q is false. 


Given the implication P = > Q, we call P the hypothesis, premise, or antecedent; and 
Q the conclusion or consequent. 





What’s most confusing about the above definition is this: From a false hypothesis, any 
conclusion may be drawn! 


That is, If P is false, then P = > Q is always true! 


Example 25. Consider these three statements: 


1. “If goldfish can walk on land, then pigs can fly.” 
2. “If there are 17 washing machines on Mars, then I am a billionaire.” 
3. “If x>0 and x <0, then V2 =77.” 


Pigs cannot fly, I am not a billionaire, and /2 # 77. It may thus seem that all three 
statements must be false. 


But strangely enough, all three are true! To see why, use the above Definition to rewrite 
the three statements as 


1. “Goldfish cannot walk on land OR pigs can fly.” Which is true, because “Goldfish 
cannot walk on land” is true. 


. “There aren’t 17 washing machines on Mars OR I am a billionaire.” Which is true, 
because “There aren’t 17 washing machines on Mars” is true. 


. “NOT-(z > 0 AND « < 0) OR V2 = 77” Which is true, because “NOT-(z > 0 AND 
x <0)” is true—it is impossible that x is both more than AND less than 0. 


The examples here illustrate that from a false hypothesis, any conclusion may be drawn! 


Exercise 9. Is each statement true? (Answer on p. 1749.) 


(a) “If Tin Pei Ling (TPL) is a genius, then the Nazis won World War II (WW2).” 
(b) “If TPL is a genius, then the Allies won WW2.” 


(c) “If 7 is rational, then I am the king of the world.” 


(d) “If 7 is rational, then Lee Hsien Loong is Lee Kuan Yew’s son.” 
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3.7. Other Ways to Express P = > Q (Optional) 


Eskimos supposedly have 50 different words for snow, presumably because snow is so im- 
portant and ubiquitous in their lives.“ 


Similarly, because the implication P = = @Q is so important and ubiquitous in both 
mathematics/logic and everyday life, we have many equivalent ways to express it: 


Example 26. Let E: “It is raining” and F: “The grass is wet”. Then all of the following 
statements are exactly equivalent: 


Maths/Logic Everyday English 


ae That it’s raining implies that the grass is wet. 
fe 








ie It’s raining only if the grass is we 





If B, then F’. If it’s raining, then the grass is wet. 





It Ey yk If it’s raining, the grass is wet. 





F if EB. The grass is wet if it’s raining. 





F when E. The grass is wet when (or whenever) it’s raining. 





F follows from E. That the grass is wet follows from the fact that it’s raining. 





FE is sufficient for F’. That it’s raining is sufficient for the grass to be wet. 








F is necessary for E. It is necessary that the grass is wet, for it to be raining. 





But don’t worry. The above is just FYI. 


In the main text of this textbook, we’ll avoid using the terms “only if”, “sufficient”, and 
“necessary”. Instead, we’ll stick to using only these three (equivalent) statements: 


°P = Q, “P implies Q”, and “If P, then Q”. 








™4See this Washington Post story: “There really are 50 Eskimo words for ‘snow’”. 
It’s far from obvious, but implies is logically equivalent to only if. And thus, the symbol ==> can be 
read aloud not only as implies, but also as only if. 
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3.8. The Negation NOT-(P = > Q) 


Example 27. Let J: “x is German” and J: “x is European”. 


Consider the implication [ => J: 
“If x is German, then x is European.” 


Which of the following correctly negates 1 = > J? That is, which of the following 
statements is NOT-(J => J)? 

(a) “If z is German, then x is not European.” 

(b) “If x is not German, then x is European.” 

(c) “a is German and not European.” 


(d) “ax is European and not German.” 


This is tricky and you should take as long as you need to think about it, before reading 
the answer/explanation on the next page. The point of this exercise is to demonstrate to 
yourself that it isn’t obvious what the negation of an implication is. (Or if it’s obvious, 
it'll demonstrate that you’re pretty smart.) 


(As I’ve repeatedly stressed, do not do the intellectually lazy thing of skipping ahead. 
Give it at least three minutes of honest effort before going to the next page.) 


Or dan. 


Whatevs. 


If you dun care, I oso dun care. 





(Example continues on the next page ...) 
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It will be easy to negate any implication with this result: 


Fact 3. Suppose P and Q are statements. Then 





NOT-(P = @) (P AND NOT-Q). 











Proof. See Exercise 11. 





Exercise 11. Prove the above Fact.” (Answer on p. 1749.) 


(... Example continued from the previous page.) 
Continue to let J: “x is German” and J: “x is European”. 
The implication J = > J is, “If x is German, then x is European.” 


So, its negation, NOT- (J ==> J), is by Fact 3 equivalent to 
I AND NOT-J: “x is German AND z is not European”. 
Exercise 12. Let K: “x is donzer” and L: “x is kiki”. Consider this implication: 
K => Lit x is donzer, then @ is kiki” 


Which statement below negates the above implication? That is, which is 
NOT-(kK ==> L)? 


(a) “If x is kiki, then x is not donzer.” 

(b) “a is kiki and not donzer.” 

(c) “If x is donzer, then x is not kiki.” 

(d) “x is donzer and not kiki.” (Answer on p. 1749.) 








Hint: Look at the definition of P = > Q (Definition 5). What is its negation? 
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We can now solve the Wason Four-Card Puzzle (reproduced): 


Example 10. (Wason Four-Card Puzzle) In a special deck of cards, each card has a 
letter on one side and a number on the other. You are shown these four cards: 


Shanmugam the Liar now comes along and makes this claim: 


“Tf a card has a vowel on one side, then it has an even number on the other side.” 


You suspect that Shanmugam is wrong. To prove him wrong, which of the above four 
cards should you turn over? (The goal is to turn over as few cards as possible.) 


Answer: Turn over A and 11. 


Explanation I. By Fact 3, the negation of P = > Q is P AND NOT-Q. So, the 
negation of Shanmugam’s claim is 


“A card has a vowel on one side AND an odd number on the other side.” 


Hence, turn over 
e Any vowel to see if it has an odd number on the other side (this would prove Shan- 
mugam wrong); and 


e Any odd number to see if it has vowel on the other side (“). 


In case you weren’t convinced, here’s Explanation II, which doesn’t directly use Fact 
3. We can call this the brute-force case-by-case method: 


1. An odd number behind A would prove Shanmugam wrong. So, we should turn over 
A. 


2. An odd number behind M would not prove Shanmugam wrong. Nor would an even 
number. So, we needn’t turn over M. 


3. A vowel behind 11 would prove Shanmugam wrong. So, we should turn over 11. 


4. A vowel behind 28 would not prove Shanmugam wrong. Nor would a consonant. So, 
we needn't turn over 28. 
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3.9. The Converse Q => P 


Informally, converse = “flip”. Formally, 


Definition 6. Given the implication P = > Q, its converse is the statement Q = > P. 


Example 28. Let E: “It’s raining” and F’: “The grass is wet”. ‘The implication FE => F 
is 


“Tf it’s raining, then the grass is wet.” 


The converse of the implication EF = > F is the implication F = > E: 


“Tf the grass is wet, then it’s raining.” 


Or equivalently, “That the grass is wet implies that it’s raining.” 


Note that F = = E is false. One way to show a statement is false is to provide a 
counterexample. Here we want to provide a counterexample where F is true (the grass 
is wet) but F is false (it isn’t raining). We can easily think of such counterexamples: 

1. The rain just stopped. 

2. Someone is watering the grass. 


3. A dog is peeing on the grass. 





4. You are peeing on the grass. 


In the above example, an implication E = > F is true but its converse F = > E is false. 

We now give an example where an implication is true and its converse is also true: 
Example 29. Let A: “Germany is in Europe” and ; B: “Germany is in Asia”; C: 
"+ lea2", and) “a= 3" 


The implication A = > C: “If Germany is in Europe, then 1+1 = 2” is true. Its converse 


C => A: “If 1+1=2, then Germany is in Europe” is also true. 


The implication B —> D: “If Germany is in Asia, then 1+1 = 3” is true. (Why?)” Its 
converse D => B: “If 1+1=3, then Germany is in Asia” is also true. (Why?)® 





The above two examples show that a true implication could have a converse that’s either 
true or false. This is stated as Fact 4(a) below. 


Now, what about a false implication? Perhaps surprisingly, a false implication’s converse 
must be true. This is stated as Fact 4(b) below. Example: 





™The hypothesis B is false and so the implication B => D is automatically true. 
The hypothesis D is false and so the implication D => B is automatically true. 
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Example 30. Let A: “Germany is in Europe” and ; B: “Germany is in Asia”; C: 
“ee le 2” and D141 23". 


The implication A = > D: “If Germany is in Europe, then 1+ 1 = 3” is false. But its 
converse D => A: “If 1+1=3, then Germany is in Europe” is true. (Why?)” 


The implication C = > B: “If 1+1 = 2, then Germany is in Asia” is false. But its 


converse B ==> C: “If Germany is in Asia, then 1+1=3” is true. (Why?) 


Fact 4. Let P and Q be statements. 


(a) If P = > Q is true, then Q ==> P can be true or false. 
(b) If P => Q is false, then Q = > P must be true. 





Proof. By definition, P = > Q is equivalent to NOT-P OR Q; similarly, Q =— P is 
equivalent to NOT-Q OR P. 


(a) Suppose P => Q is true. Then either NOT-P or Q is true—equivalently, P is false 
or Q is true. Let’s look at the four possible cases: 

1. P is false and Q is true, in which case Q = > P is false; 

2. P is false and Q is false, in which case Q = > P is true; 

3. Q is true and P is false, in which case Q = > P is false; 


4. Q is true and P is true, in which case Q = > P is true. 


So indeed, Q = > P can be true or false. 


(b) Suppose P = > Q is false. Then both NOT-P and Q are false—equivalently, P is 
true and Q is false. So, Q = > P is true. 

















The hypothesis D is false and so the implication D => A is automatically true. 
8°The hypothesis B is false and so the implication B => C is automatically true. 
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Exercise 13. Let G: “I’m sleeping”; H: “My eyes are shut”; M: “x >0”; N: “x > 1”; 


a: “x = 3”; and y: “x? = 9”. For each statement below, explain whether it’s true, write 


down its converse, and explain whether this converse is true. (Answer on p. 1750.) 


(a) -G =H. -(b) 1] N. “e) 4 a. 


Exercise 14. In Exercise 9, we showed that each statement below is true. Now, write 
down its converse and explain whether this converse is true. (Answer on p. 1750.) 
(a) “If Tin Pei Ling (TPL) is a genius, then the Nazis won World War II (WW2).” 

(b) “If TPL is a genius, then the Allies won WW2.” 

(c) “If 7 is rational, then I am the king of the world.” 


(d) “If 7 is rational, then Lee Hsien Loong is Lee Kuan Yew’s son.” 


Exercise 15. Let A: “Germany is in Europe”, B: “Germany is in Asia”, C: “1+1 = 2”, 
and D: “1+1=3”. For each statement below, explain whether it’s true, write down its 
converse, and explain whether this converse is true. (Answer on p. 1750.) 


(a) ASB. tb) SC. Ke) 2S Xd) CC ==: 


Exercise 16. Fill in each blank below with (i) must be true; (ii) must be false; or (iii) 
could be true or false. (Answer on p. 1751.) 


(a) If P is true, then P —> Q ado —- 2 
(b) If P is false, then P => Q and QO == P 
(oc) Tt PO ie tre, then O == FP 
(d) If P => @ is false, then Q => P 





29, Contents www.EconsPhDTutor.com 


3.10. Affirming the Consequent (or The Fallacy of the Converse) 


Affirming the consequent or the fallacy of the converse is a common error that 
people make in everyday life: 


Example 31. 1. “If it’s raining, then the grass is wet.” 


2. We observe, “The grass is wet.” 
981 


3. We incorrectly conclude, “Therefore, it’s raining. 


That the grass is wet does not imply that it’s raining. The grass may be wet because (a) 
it just stopped raining; (b) someone is watering it; (c) a dog is peeing on it; (d) you are 
peeing on it; or a billion other reasons. 





Formally, the fallacy takes this form: 
1. st Pp —> QQ.” 


2. “Q” 
3. “Therefore, P.” 


Example 32. 1. “If John has been doing chemotherapy, then John is bald.” 
2. We observe, “John is bald.” 
3. We incorrectly conclude, “Therefore, John is undergoing chemotherapy.” 


That John is bald does not imply he’s has been doing chemotherapy. John may be 
bald because (a) he’s suffering from male-pattern baldness; (b) he likes to keep his scalp 
clean-shaven; or a billion other reasons. 


Example 33. 1. “If Mary drinks a lot of Coke, then Mary is fat.” 
2. We observe, “Mary is fat.” 
3. We incorrectly conclude, “Therefore, Mary drinks a lot of Coke.” 


That Mary is fat does not imply she drinks a lot of Coke. Mary may be fat because (a) 
she drinks a lot of Pepsi; (b) eats a lot of junk food; (c) suffers from low metabolism; or 
a billion other reasons. 





When spelt out so explicitly, affirming the consequent or the fallacy of the converse 
seems rather silly. But unfortunately, people make this error all the time. Hopefully you'll 
now be able to avoid it. 


Exercise 17. Is the following chain of reasoning valid? 


1. “If Warren Buffett owns Google, then Warren Buffett is rich.” 


2. “Warren Buffett is rich.” 
3. “Therefore, Warren Buffett owns Google.” (Answer on p. 1751.) 








81By the way, such a chain of reasoning (whether valid or invalid) is traditionally called a syllogism. A 
syllogism has two or more statements called premises, followed by a conclusion. 
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3.11. The Contrapositive NOT-Q => NOT-P 
Informally, contrapositive = “Flip and negate both”. Formally, 


Definition 7. Let P and Q be statements. Given the implication P = > Q, its contra- 
positive is the statement NOT-Q => NOT-P. 


Example 34. Let J: “x is German”; and J: “x is European”. 


Consider the implication / = > J and its contrapositive NOT-J = > NOT-I: 


I = J: “If x is German, then x is European.” 
NOT-J = > NOT-I: “If x is not European, then x is not German.” 


Both J = > J and its contrapositive NOT-J ==» NOT-I are true. 


Next, consider the converse J = > I and its contrapositive NOT-I ==» NOT-J: 


J ==> I: “If x is European, then x is German.” 
NOT-I = > NOT-J: “If x is not German, then x is not European.” 


Both J = => I and its contrapositive NOT-J = > NOT-J are false. 


Example 35. Let EF: “It’s raining” and F’: “The grass is wet”. 
Consider the implication EF = > F and its contrapositive NOT-F = > NOT-E: 


E => F-: “Tf it’s raining, then the grass is wet.” 
NOT-F ==> NOT-E: “Tf the grass is not wet, then it’s not raining.” 


Both E = > F and its contrapositive NOT-F = > NOT-E are true. 


Next, consider the converse F = > E and its contrapositive NOT-E = > NOT-F: 


F => E: “Tf the grass is wet, then it’s raining.” 
NOT-E => NOT-F: “Tf it’s not raining, then the grass is not wet.” 


Both E = > F and its contrapositive NOT-F = > NOT-E are false. 





As the above examples suggest, every implication is equivalent to its contrapositive: 


Fact 5. Suppose P and Q are statements. Then 


(PO) (NOT-Q => NOT-P). 





Proof. By Definition 5, P => Q is NOT-PORQ. And NOT-Q ==> NOT-P is 
QORNOT-P. Hence, P —> Q and NOT-Q => NOT-P are equivalent.*? 

















®This proof implicitly assumes that the logical connective OR is commutative. 
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Fact 5 is especially useful on those occasions when it’s hard to prove an implication but 
easy to prove its contrapositive:** 


Example 36. It’s not obvious how we can prove this implication: 
If ct -a? +2741, then a #1. 
But proving its contrapositive is easy: 


If x =1, then ct - 2° +247 =1. 





3 








+27 =1. 


Proof. Simply plug in x = 1 to verify that «*- x 





Example 37. It’s not obvious how we can prove this implication: 
If x? is even, then x is even. 


But proving its contrapositive is easy: 


If x is odd, then 2? is odd. 





Proof. Let x = 2k +1 where k is some integer. Then 2x? = 4k? + 4k +1, which is odd. 














But don’t worry. In A-Level Maths, you won’t be required to write any proofs; the above 
is just FYI and to illustrate why the contrapositive is useful. 


Exercise 18. The statement “If x is German, then x is European” is true. Which 
statement below is its contrapositive? For each statement, explain whether it must be 
true, must be false, or could be either true or false. (Answer on p. 1751.) 
(a) “If x is European, then x is German.” 

(b) “If x is not German, then x is not European.” 

(c) “If «is not German, then x is European.” 


(d) “If x is not European, then x is not German.” 


(e) “If a is not European, then x is German.” 








83These examples are from &. 
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3.12. (P == QANDQ = P) —= (P = Q) 


Example 38. As before, let 


e J: “x is German.” 

e J: “x is European.” 
Me Sy 
Nee ool 


O: “x is a positive number.” 


To show that two statements are equivalent, we can use Definition 4. So for example, 


e M <=> OC, because both statements always have the same truth value. 
e I < J (counterexample: if x is Emmanuel Macron, then J is false while J is true). 
« M < N (counterexample: if x = 0.5, then M is true while N is false). 





Alternatively, we can use this fact: 


Fact 6. Suppose P and Q are statements. Then 


(P = QANDQ = P) 








Proof. See p. 1547 (Appendices). 











So, to show that P <— > Q, we can show that both P => Q and Q@ = => P are true. 
And to show that P < Q, we can show that either P => Q or Q = > P is false. 


Example 39. The implication M => O (“if « > 0, then x is a positive number”) is 
true. 


Also, the implication O = > M (“if x is a positive number, then x > 0”) is true. 
So, by Fact 6, M <=> O. 


Example 40. The implication J = > I (“if x is European, then x is German”) is false. 
po, by Fact 6, / <= J. 





Example 41. The implication N => M (“if x >1, then x > 0”) is false. 
So, by Fact 6, M << N. 
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3.13. The Four Categorical Statements and Their Negations 


In mathematics and logic, the word some means at least one. 


Here are the four categorical statements:** 


All Yes® “All S are P” (or “Every S is P”). 

All No “No S is P” (or “Every S is NOT-P”). 
Some Yes “Some S is P.” 
Some No “Some S is NOT-P.” 


Pe eS 


In each, we call S the subject and P the predicate. 


Example 42. We give six examples of each categorical statement: 


All Yes (“All S are P”) 


“All Koreans are Asian.” (Or, “Every Korean is Asian.”) 








“All Germans are European.” (Or, “Every German is European.”) 





“All animals are dogs.” (Or, “Every animal is a dog.”) 





“All mammals are bats.” (Or, “Every mammal is a bat.”) 
“All Koreans eat dogs.” (Or, “Every Korean eats dogs.”) 


“All Germans eat bats.” (Or, “Every German eats bats.”) 








All No (“No § is P”) 


“No Korean is Asian.” (Or, “Every Korean is not Asian.”) 








“No German is European.” (Or, “Every German is not European.”) 





“No animal is a dog.” (Or, “Every animal is not a dog.”) 





“No mammal is a bat.” (Or, “Every mammal is not a bat.”) 





“No Korean eats dogs.” (Or, “Every Korean does not eat dogs.”) 








“No German eats bats.” (Or, “Every German does not eat bats.”) 


The six subjects used are “Korean”, “German”, “animal”, “mammal”, “Korean”, and 
“German”. 


4 


The six predicates used are “Asian”, “European”, “a dog”, “a bat”, “eats dogs” (or “a 
’ ’ ’ ’ 
dog-eater”), and “eats bats” (or “a, bat-eater”). 





(Example continues on the next page ...) 





®4See Definition 263 (Appendices) for the formal definitions. 
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(... Example continued from the previous page.) 


Some Yes (“Some S is P”) 


“Some Korean is Asian.” (Or, “At least one Korean is Asian.”) 








“Some German is European.” (Or, “At least one German is European.”) 





” 


“Some animal is a dog.” (Or, “At least one animal is a dog.”) 








” 


“Some Korean eats dogs.” (Or, “At least one Korean eats dogs.”) 





( 
( 
“Some mammal is a bat.” (Or, “At least one mammal is a bat.”) 
( 
( 


“Some German eats bats.” (Or, “At least one German eats bats.”) 


Some No (“Some S is NOT-P”) 


“Some Korean is not Asian.” (Or, “At least one Korean is not Asian.”) 








“Some German is not European.” (Or, “At least one German is not European.”) 





“Some animals is not a dog.” (Or, “At least one animal is not a dog.”) 





“Some mammal is not a bat.” (Or, “At least one mammal is not a bat.”) 





“Some Korean does not eat dogs.” (Or, “At least one Korean does not eat dogs.”) 








“Some German does not eat bats.” (Or, “At least one German does not eat bats.”) 


Exercise 21. For each given pair of subject S and predicate P, write down the corre- 
sponding All Yes, All No, Some Yes, and Some No statements. (Answer on p. 1752.) 


S ie 
(a) | Donzer Kiki 
(b) | Donzer | Cancer 
(c) | Bachelor | Married 
(d) Bachelor | Smoke 




















Exercise 22. Explain whether each statement is true. (Answer on p. 1752.) 


(a) Corresponding All Yes and All No statements are always negations of each other. 





(b) Corresponding Some Yes and Some No statements are always negations of each other. 
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The last exercise showed that All Yes isn’t the negation of All No. Similarly, Some Yes 
isn’t the negation of Some No. So, what then is the correct negation of each categorical 
statement? 


Example 43. Consider the All Yes statement, “All Koreans eat dogs.” 


To show it’s false, we need merely show that there is at least one Korean who doesn’t eat 
dogs. So, its negation is 


“Some (i.e. at least one) Korean does not eat dogs.” 


Which is a Some No statement. 


Example 44. Consider the All No statement, “No Korean eats dogs,” (or, “All Koreans 
do not eat dogs.”) 


To prove it’s false, we need merely show that there is at least one Korean who eats dogs. 
So, its negation is 


“Some (i.e. at least one) Korean eats dogs”. 


Which is a Some Yes statement. 





In general, 


Fact 7. (a) The negation of an All Yes statement is the corresponding Some No state- 
ment. 


(b) The negation of an All No statement is the corresponding Some Yes statement. 








Proof. See p. 1549 (Appendices). 
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Example 45. The same examples as before, with their negations: 


All Yes (“All S are P”) 


“All Koreans are Asian.” 


Negation: Some No (“Some S is NOT-P”) 





“Some Korean is not Asian.” 





“All Germans are European.” “Some German is not European.” 





“All animals are dogs.” “Some animal is not a dog.” 





“All mammals are bats.” “Some mammal is not a bat.” 





“All Koreans eat dogs.” “Some Korean does not eat dogs.” 





“All Germans eat bats.” “Some German does not eat bats.” 


All No (“No § is P”) 


Negation: Some Yes (“Some S is P”) 
“No Korean is Asian.” 


“Some Korean is Asian.” 








“No German is European.” “Some German is European.” 





“No animal is a dog.” “Some animal is a dog.” 





“No mammal is a bat.” “Some mammal is a bat.” 





“No Korean eats dogs.” “Some Korean eats dogs.” 








“No German eats bats.” 





“Some German eats bats.” 
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Exercise 23. For each statements below, identify whether it’s an All Yes, All No, Some 
Yes, or Some No statement. Also, write down its negation. (Answer on p. 1752.) 


Statement 

(a) “All donzers are kiki.” 
(b) “No donzer is kiki.” 

(c) “Some donzer is kiki.” 
(d) “Some donzer is not kiki.” 
(e) “All bachelors are married.” 
(f) “No bachelor is married.” 


(g) “Some bachelor is married.” 


























(h) “Some bachelor is not married.” 





(i) “All donzers cause cancer.” 





(j) “No donzer causes cancer.” 





(k) “Some donzer causes cancer.” 





(1) | “Some donzer does not cause cancer.” 
(1m) “All bachelors smoke.” 


(n) “No bachelor smokes.” 











(o) “Some bachelor smokes.” 








(p) “Some bachelor does not smoke.” 


Exercise 24. While trying to excuse the less-than-perfect play of a basketball player, a 
commentator remarks, (Answer on p. 1753.) 


“Everybody is not LeBron James.” 


(a) Rewrite this statement into the form of a categorical statement. Identify the type of 
categorical statement, the subject, and the predicate. 


(b) Write down this statement’s negation. 


(c) Is the commentator’s statement is true? If false, what should he have said instead? 
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Exercise 25. Suppose this statement is true: “Every breadfruit is either donzer or kiki.” 
Based solely on this given statement, explain whether each statement below must be true, 
must be false, or could be true or false. (Answer on p. 1753.) 
(a) “Every breadfruit is both donzer and kiki.” 
(b) “Every breadfruit is donzer.” 
(c) “Every breadfruit is kiki.” 
(d) “Some breadfruit is donzer.” 
(e) “Some breadfruit is kiki.” 
(f) “No breadfruit is donzer.” 
(g) “No breadfruit is kiki.” 
(h) “Every non-breadfruit is neither donzer nor kiki.” 
(i) “Every non-breadfruit is not donzer.” 
(j) “Every non-breadfruit is not kiki.” 
(k) “Some non-breadfruit is not donzer.” 
“Some non-breadfruit is not kiki.” 
(m) “No non-breadfruit is donzer.” 
(n) “No non-breadfruit is kiki.” 


Exercise 26. Negate each statement. (Answer on p. 1753.) 


(a) “Every prime number is odd.” 
(b) “Every prime number greater than 2 is odd.” 


(c) “At least one rational number is greater than 7.” 


(d) “No number is both negative and imaginary.” 


(e) “Every number is both positive and irrational.” 
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3.14. Chapter Summary 


e« The conjunction P AND Q is true if both P and Q are true. 
e The disjunction P OR Q is true if either P or Q is true. 
« The negation NOT-—P is true if P is false and false if P is true. 
¢ P and Q are equivalent (written P <=> Q) if both always have the same truth value. 
e De Morgan’s Laws: 
— NOT-(PANDQ) <=> (NOT-P OR NOT-Q); 
— NOT-(PORQ) <= (NOT-P AND NOT-Q). 
e The implication P = > Q is 


— Defined as NOT-P OR Q; 

— Equivalent to its contrapositive NOT-Q = > NOT-P; 

— Negated by P AND NOT-Q; 

— Not generally equivalent to its converse Q = > P; 
* If P => Q is true, then Q = > P could be true or false. 
* If P = > Q is false, then Q = > P must be true. 


e« The four categorical statements and their negations: 


Statement Negation 
All Yes: “All S are P” | Some No: “Some S is NOT-P.”” 
All No: “No S is P” Some Yes: “Some S is P.” 











We call S' the subject and P the predicate. 
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4. Sets 


The set is the basic building block of mathematics. Informally, a set is a “container” or 
“box” whose contents we call its elements (or members). 


Example 46. Let A= {1,3,5} and B = {100, 200}. 


The set A The set B 


The set A contains three elements—namely, the numbers 1, 3, and 5. Informally, it is a 
“box” containing the numbers 1, 3, and 5. 


The set B contains three elements—namely, the numbers 100 and 200. Informally, it is 
a “box” containing the numbers 100 and 200. 





Note that when we talk about a set, we refer to both the box and the things inside it. 


Mathematical punctuation: 


¢ Braces {}—are used to denote the “container”. 


e A comma means “and” and is used to separate the elements within a set. 


Exercise 27. Write down C, the set of the first 7 positive integers.(Answer on p. 1754.) 


Exercise 28. Write down D, the set of even prime numbers. (Answer on p. 1754.) 
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4.1. The Elements of a Set Can Be Pretty Much Anything 
Note that for the A-Levels and hence also in this textbook, the elements of a set will almost 
always be numbers. However, in general, they needn’t be numbers; they can be pretty much 
any objects whatsoever:*° 


Example 47. Let V be the set of the four largest cities in the US. Then 


V = {New York City, Los Angeles, Chicago, Houston}. 


Example 48. Let L be the set of suits in the game of bridge. Then 


DL ={4,9,%, 4}. 


Example 49. Let E = {3,7’, The Clementi Mall, Love, the colour green}. 


The Clementi Mall 


y 


The set EF 


The set E contains exactly five elements: two numbers (3 and 7”); a shopping centre 
(The Clementi Mall); an abstract concept called love (denoted in the figure above by a 
red heart); and even the colour green (denoted by a green rectangle). 


Example 50. Let S be the set of Singapore citizens. Then S contains about 3.5M 
elements,®” three of which are Lee Hsien Loong, Ho Ching, and Chee Soon Juan. 


Example 51. Let U be the set of United Nations (UN) member states. Then U contains 
exactly 193 elements,®* three of which are Afghanistan, Singapore, and Zimbabwe. 


Exercise 29. Write down X, the set of Singapore Prime Ministers (both past and 
present). (Answer on p. 1754.) 








8° Actually, there are some restrictions on what can go into a set, but these technicalities are beyond the 
scope of the A-Levels. 

87 According to SingStat, the number of Singapore citizens in 2017 was about 3,439,200. 

88This UN webpage states that the most recent and 193rd state to join the UN was South Sudan in 2011. 
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A set can even contain other sets: 


Example 52. Let F' be the set that contains the sets A = {1,3,5} and B = {100,200}. 
That is, let F = {A,B} = {{1,3,53, {100, 2003}. 


The set A The set B 


100 
200 


The set F The set G 


Now consider instead this set: 
G = {1,3,5, 100,200}. 


The sets F and G look very similar. So, is G the same set as F’? 
Nope. The set F' contains exactly two elements, namely the sets A and B. 


In contrast, G contains exactly five elements, namely the numbers 1, 3, 5, 100, and 200. 
And so, F and G are not the same. 


You can think of F’ as a box that itself contains two boxes—namely, A and B, each of 
which contain some numbers. In contrast, G is a box that contains no boxes; instead, it 
simply contains five numbers. 


The following exercises continue with the above example: 
Exercise 30. Let H be the set whose elements are F and G. (Answer on p. 1754.) 


(a) How many elements does H have? 
(b) Write down H without using the letters A, B, F, or G. 


Exercise 31. Let J be the set whose elements are A, B, and G. (Answer on p. 1754.) 


(a) How many elements does J have? 
(b) Write down J without using the letters A, B, F, or G. 


(c) Compare the sets H (from the previous Exercise) and J. Are they the same? 
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4.2. In «and Not In ¢ 


Mathematical punctuation: 


e € means “is in” (or “is an element of”); 


e ¢ means “is not in” (or “is not an element of”). 
Example 53. Let J = {1,2,3,4,5,6,7}. Then 
leJ, 2eJ, 3€J, 4eJ, S€/, Ged, Ted. 


You can read these statements aloud as “1 is in J” (or “1 is an element of J”), “2 is in 
J” (or “2 is an element of J”), “3 is in J” (or “3 is an element of J”), etc. 


We can also write 1,2,3 € J (read aloud as “1, 2, and 3 are in J” or “1, 2, and 3 are 
elements of J”). 


Also, 8¢ J,9¢J,10¢J, etc. (read aloud as “8 is not in J” or “8 is not an element of J”, 
“9 is not in J” or “9 is not an element of J”, “10 is not in J” or “10 is not an element of 
J”, etc.). 


We can also write 8,9,10 ¢ J (read aloud as “8, 9, and 10 are not in J” or “8, 9, and 10 
are not elements of J”). 


Example 54. Let K = {Cow, Chicken}. Then 


¢« Cowe K (“Cow is in K” or “Cow is an element of Kk”). 
e Chicken e kK (“Chicken is in K’” or “Chicken is an element of kK”). 


¢ Cow, Chicken ¢ K (“Cow and Chicken are in K” or “Cow and Chicken are elements 
ork”), 


e Cat ¢ K (“Cat is not in K” or “Cat is not an element of kK”). 


Exercise 32. Fill in the blanks with either ¢ or ¢. (Answer on p. 1754) 
(a) Los Angeles The set of the four largest cities in the US. 


(b) Tharman The set of Singapore Prime Ministers (past and present). 
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4.3. The Order of the Elements Doesn’t Matter 





One implication of the above definition®® is that the order in which we write out the 
elements of a set does not matter: 


Example 55. Let A= {2,4,6} and B = {6,2,4}. 


Then A = B because every element that is in A is also in B and every element that is in 


B is also in A. 


The set A The set B 
In tact, 42.4.6) =42,6,4) =14. 2.6) =14.6,2)=16,2,4)=16.4, 2). 


Example 56. Let C= {Cow, Chicken} and D = {Chicken, Cow}. 


Then C' = D because every element that is in C’ is also in D and every element that is in 


D is also in C. 


The set C The set D 


Exercise 33. Are the two given sets equal? (Answer on p. 1754.) 


(a) {1,2,3} and {3,2,1}. (b) {{1},2,3} and {{3}, 2,1}. 








89 Actually, in set theory, this is not a definition, but an axiom (known as the Axiom of Extensionality). 
But here for simplicity, V’ll just call it a definition. 
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4.4. n(S) Is the Number of Elements in the Set S 


Let S be a set. Then the number of elements in S®° is denoted by 


Example 57. n({2,4,6}) =3. 


Example 58. n({Cow, Chicken}) = 2. 


n(s). 


Exercise 34. Let X be the set of Singapore Prime Ministers (past and present). Then 
what is n(X)? (Answer on p. 1754.) 





4.5. The Ellipsis “...” Means Continue in the Obvious Fashion 


More mathematical punctuation—the ellipsis “...” means “continue in the obvious fashion”. 


Example 59. L is the set of all odd positive integers smaller than 100. So in set notation, 
we can write D = {1,3,5,7,9,11,...,99}. 


Example 60. / is the set of all negative integers greater than —100. So in set notation, 
we can write M = {-99, -98, -97,...,-2,-1}. 





What is “obvious” to you may not be obvious to your reader. So only use the ellipsis when 
you're confident it will be obvious to your reader! And as I did with the sets above, never 
be shy to write a few more of the set’s elements (doing so costs you nothing except maybe 
a few more seconds and some ink). 


Exercise 35. In the above examples, what are n(Z) andn(M)? (Answer on p. 1754.) 


Exercise 36. Let N be the set of even integers greater than 100 but smaller than 1, 000. 
Write down N in set notation. (Answer on p. 1754.) 








Or the cardinality of S. 
°1Some also use card A. See ISO 80000-2:2009, Item No. 2-5.5. 


47, Contents www.EconsPhDTutor.com 


4.6. Repeated Elements Don’t Count 


Another implication of Definition 8 is that repeated elements don’t count (they’re 
simply ignored): 


Example 61. Let A= {2,4,6} and B = {2,2,4, 6}. 


Then A = B because every element that is in A is also in B and every element that is in 


B is also in A. 


The set A The set B 


In fact, {2,4,6} = {2,2,4,6} = {4,2,6,2,2,6, 4,2}. 
Moreover, n({2,4,6)) =n({2,2,4,60}) =n (44, 2,6,2,2,6,4,2}) =3. 


Example 62. {Cow, Chicken} = {Cow, Cow, Chicken} = {Chicken, Cow, Chicken}. 


Moreover, 


n({Cow, Chicken}) =n({Cow, Cow, Chicken}) = n({Chicken, Cow, Chicken}) = 2. 


Exercise 37. Let W = {Apple, Apple, Apple, Banana, Banana, Apple}. Find n(W). 
Also, rewrite W more simply. (Answer on p. 1754.) 





Exercise 38. Let C' be the set of even prime numbers. Find n(C).(Answer on p. 1754.) 
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4.7. KR Is the Set of Real Numbers 


Like most students entering college, mathematicians of the midnineteenth century 
thought they understood real numbers. In fact, the real number line turned out to 
be much subtler and more complicated than they imagined. 


— David M. Bressoud (2008). 


Few mathematical structures have undergone as many revisions or have been pre- 
sented in as many guises as the real numbers. Every generation re-examines the 
reals in the light of its values and mathematical objectives. 


— Faltin, Metropolis, Ross, and Rota (1975). 


So far, we’ve encountered only finite sets, i.e. sets with finitely many elements. 


In this and the next two subchapters, we introduce several infinite sets, i.e. sets with 
infinitely many elements. 


First, we have the set of real numbers (or simply reals): 


Definition 9. The set of real numbers is denoted R. 





Example 63. 16, —-1.87, 7» 3.14159, and /2 » 1.41421 are all real numbers. 


Now, by the way, what exactly is a real number? This sounds like a “dumb” question, but 
is actually a profound one that was satisfactorily resolved only from the late 19th century. 
Indeed, this question is a little beyond the scope of the A Levels. 


And so for the A Levels, we’ll simply pretend—as we did in secondary school—that “every- 
one knows” what real numbers are (even though, as the quotes above suggest, they actually 
don’t). We shall not attempt to define or construct the real numbers. 
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4.8. Z Is the Set of Integers 


Die ganzen Zahlen hat der liebe Gott gemacht, alles andere ist Menschenwerk. 


God made the integers, the rest is the work of man. 
— Leopold Kronecker (1886).% 


Next, Z is the set of integers: 


Definition 10. The set of integers, denoted Z, is 





ie Re Pe ed Ol ee ee 
Z is for Zahl, German for number. 


Example 64. 16 is an integer, while -1.87, 71, and J/2 are non-integers. 





Note that with an infinite set, we cannot explicitly list out all its elements. And so, when 
writing out an infinite set, we’ll sometimes find it helpful to use the ellipsis. 


When writing out Z above, we used two ellipses. The first ellipsis says we continue “left- 
wards” in the “obvious” fashion, with —4, —5, -6, etc. The second says we continue “right- 
wards” in the “obvious” fashion, with 4, 5, 6, etc. 


Exercise 39. 7 is the set of all prime numbers. With the aid of an ellipsis, write down 
H in set notation. (Answer on p. 1754.) 


Definition 11. An even integer is any element of the set {...,-6,—-4,-2,0,2,4,6,...}. 
An odd integer is any element of the set {...,-5,-3,-1,1,3,5,...}. 





It’s not on your syllabus, but a natural number is simply any positive integer:”? 


Definition 12. The set of natural numbers, denoted N, is 


Neti a) 








° According to Heinrich Weber (in his 1893 obituary for Kronecker), this quoted remark was made by 
Kronecker at an 1886 lecture to the Berliner Naturforscher-Versammlung. 
There’s actually some debate as to whether N should include 0. 4 
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4.9. QIs the Set of Rational Numbers 


Definition 13. A rational number (or simply rational) is any real number that can be 
expressed as the ratio of two integers. 


Any other real number is called an irrational number (or simply irrational). 


Definition 14. The set of rational numbers is denoted Q. 


Q is for quotient.™4 





Example 65. 16 € Q because we can express 16 as the ratio of two integers (e.g. 16/1). 


—1.87 € Q because we can express —1.87 as the ratio of two integers (e.g. —187/100). 


Example 66. V2, ¢Q. In words: “V2 and 7 are not elements of the set of rational 
numbers.” Or more simply: “V2 and 7 are irrational.” 


(Note though that this is far from obvious. It takes a little work to prove that V2 is 
irrational and even more work to prove that 7 is irrational.) 





As you probably already know, a number is rational if and only if its digits eventually 


recur: = 


Example 67. 1/3 = 0.33333--- = 0.3 is rational and sure enough, it has the recurring digit 
3. We will use the overbar to denote recurring digit(s).°° 


W/7 = 0.142857142857142857--- = 0.142857 is rational and sure enough, it has the recurring 
digits 142857. 


Similarly, 16 = 16.000--- = 16.0 and 1.87 = 1.87000:-- = 1.87 are rational and have the 
recurring digit 0. (Of course, when the recurring digit is 0, we usually don’t bother 
writing it.) 


Example 68. V2 x 1.4142135623... and m* 3.1415926535... are irrational. And sure 
enough, their digits never recur. 


(But again, this is far from obvious and takes some work to prove.) 








Also quoziente in Italian, Quotient in German, and quotient in French. 
See Proposition 25 (Appendices). 
*Some writers also use an overdot instead: 1/3 = 0.33333---= 0.3. But we won't. 
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4.10. A Venn Diagram of the Real Numbers 


R= The set of all real numbers All reals are either rational or 
irrational (and never both). 


All rationals are either integers 
Irrationals or non-integers (and never both). 


Five remarks: 


4. R, Z, and Q are infinite sets 

(i.e. they contain infinitely 
Rational many elements). 

non-integers We may thus write n(R) = 


n(Z) =n(Q) =. 


5. Infinity (oo) and negative 
Rationals Q infinity (—oo) are not num- 
bers.” 


852.99 





As with real numbers, we won't try to formally define what either co or —0o is.°® We'll just 
repeat and emphasise the key point here: 


Infinity is not a number. 


3. The areas in the above Venn diagram may incorrectly suggest that there are “as many” 
rationals as irrationals. But this is false—it turns out (and this is beyond A-Levels) 
there’s a precise sense in which there are “more” irrationals than rationals.” 


4. This textbook follows your A-Level syllabus and exams in using blackboard bold font 
(R, Z, Q ). Note though that some other writers use bold font (R, Z, and Q). 


5. We can handwrite R, Z, and Q as an R, Z, and Q with an extra vertical line on the left, 
an extra diagonal line “inside”, an extra vertical line “inside”: 


\ ZW 


7 Actually, some writers in some contexts find it convenient to call co and —co numbers—for example, they 
may call co and —-co extended real numbers. But this textbook will keep it simple and insist that 
neither oo nor —oo is a number. Fa 

*8Indeed, there’s no need to define either oo or —0o at all. There isn’t even any need to think of either as 
some definite object or “thing”. Instead, either is usually simply used as shorthand for expressing some 
long-winded statement. For example, we can think of the statement “n (Q) = co” as equivalent to “the 
number of elements in Q is unbounded from above”. 

°° And perhaps surprisingly, there are “as many” integers as rational non-integers. 
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4.11. More Notation: *, -, and o 


To create a new set that contains only the positive elements of the old set, append 
superscript plus sign (*) to the name of a set: 


© 


1. Z* = {1,2,3,...} is the set of all positive integers. 

2. Q is the set of all positive rational numbers. 

3. R* is the set of all positive real numbers. 

To create a new set that contains only the negative elements of the old set, append a 
superscript minus sign (~) to the name of a set: 

1. Z = {-1,-2,-3,...} is the set of all negative integers. 

2. Q is the set of all negative rational numbers. 

3. R” is the set of all negative real numbers. 

To add the number 0 to a set, append a subscript zero (9) to its name: 
Zo = {0,1,2,3,...} is the set of all non-negative integers. 

. Zo = {0,-1, -2,-3,...} is the set of all non-positive integers. 

Qj is the set of all non-negative rational numbers. 

Q) is the set of all non-positive rational numbers. 


Rg is the set of all non-negative real numbers. 


oe St a Se Pe? 


. Ro is the set of all non-positive real numbers. 


Exercise 40. Which of the sets introduced above are finite? (Answer on p. 1754.) 
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4.12. The Empty Set 2 


A set can contain any number of elements. Indeed, it can even contain 
zero elements: 


Definition 15. The empty set {} is the set that contains zero elements 
(and is often also denoted @). 
The empty set 
Informally, the empty set {} = @ is the “container” with nothing inside. ft or @. 
Hence the name. 


Example 69. In 2016, the set of all Singapore Ministers who are younger than 30 is {} 
or @. This means there is no Singapore Minister who is younger than 30. 


Example 70. The set of all even prime numbers greater than 2 is {} or @. This means 
there is no even prime number that is greater than 2. 





Example 71. The set of numbers that are greater than 4 and smaller than 4 is {} or @. 
This means there is no number that is simultaneously greater than 4 and smaller than 4. 





Example 72. The set {@} is not the same as the set ©. 

{@} is a set containing a single element, namely the empty set. 
Informally, {@} is a box containing an empty box—it is not empty. 
In contrast, @ is the empty set. 

Informally, it is simply an empty box. 


We can also rewrite the two sets as 


(y= 1a and © 2 — 1). 


Now it is perhaps clearer that {{}} # {}. Voy =r 


Example 73. The set {@,3,{@}} is the set contain- 
ing exactly three elements, namely the empty set, the 
number 3, and a set containing the empty set. 


Take care to note that this set does not contain four 
elements. 


{2,3,{@}} 





Exercise 41. Tricky: Let S = {{{}},2,{@},{}}. What is n(S)? (Answer on p. 1754.) 
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4.13. Intervals 


Using left parenthesis (, right parenthesis ), left bracket [, and right bracket |, we 
can write intervals of real numbers: 


Example 74. Here are nine intervals: 








A= (1,3) 
pene) 
C=(1,3] 
Deis) 
B= (1,00) 
F = Iya) 
G' = (-oo, 1) > 
H = (-c0, 1] > 


I= (-«,co) —__ >, 
































(a) The interval A=(1,3) is the set of real numbers that are > 1 and <3. 
(b) The interval B = [1,3] > land <3. 
(c) The interval C = (1,3] >land <3. 
(d) The interval D =[1,3) > land <3. 
(e) The interval FE = (1, 00) aly 
(f) The interval F =[1, 00) alle 
(g) The interval G = (-00, 1) <1 
(h) The interval H = (—oo, 1] Sl 
(i) The interval I = (-00, 00) denotes the set of all real numbers. That is, (—00, 00) = R. 


Given below are the left and right endpoints of the above nine intervals: 


A|B D 
Left endpoint | 1 | 1 1 
Right endpoint | 3 | 3 3 
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Definition 16. Let a and b be real numbers with b >a. An interval is any one of these 
nine sets: 


(a) The interval (a,b) is the set of real numbers that are >a and < b. 
(b) The interval [a,}] >aand <b. 
(c) The interval (a, | >a and < b. 
(d) The interval [a, b) >a and < b. 
(e) The interval (a, 00) eye 
(f) The interval [a, 00) 20, 


(g) The interval (-0o, a) <a 
(h) The interval (-0co, a] <a. 


(i) The interval (—0o, co) is the set of all real numbers. That is, (-—0oo, oo) = R. 


Given below are the left and right endpoints of the above nine intervals: 


b|cid 





Left endpoint 
Right endpoint 



































By the way, we call 


¢ (a,b) an open interval. 
¢ [a,b] a closed interval. 


¢ (a,b] a half-closed interval, a half-open interval, a left-half-open interval, or a 
right-half-closed interval. 


¢ [a,b) a half-closed interval, a half-open interval, a left-half-closed interval, or a 
right-half-open interval. 


Exercise 42. het Ae [11], B=(1.1), C = (1,1), Dp =(11), ond 2 = (1.1.01). Find 
the number of elements in each of these five sets. Express each set in another way. It 
turns out that each of the sets A, B, C, and D is called a degenerate interval, while E 
is called a non-degenerate interval. Can you write down some possible definitions of the 
two terms degenerate interval and non-degenerate interval? (Answer on p. 1755.) 


Exercise 43. Express R, R*, Rj, R°, and Rp in interval notation. (Answer on p. 1755.) 








100One good argument in favour of reverse bracket notation is that it avoids confusing the open interval 
(a,b) with the ordered pair (a,b) (we'll learn more about ordered pairs in Ch. 7). However, by and 
large, the reverse bracket notation remains uncommon, except in continental Europe and especially 
France (where it was introduced by the Bourbaki group). ~ 
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4.14. Subset Of ¢ 


Definition 17. Let A and B be sets. If every element of A is an element of B, then we 
call A a subset of B and write 


AG £. 


Otherwise, we say that A is not a subset of B and write A¢ B. 


Example 75. Let M = {1,2}, N = {1,2,3}, and O = {1,2,4,5}, and P = {3,2,1}. Then 


M is a subset of N, O, and P MeN MeO and WeP 
N is a subset of P, but not of MorO NCP,butN¢M andN¢€O 
O is not a subset of M/, N, or P O€M,O€GN, andO€P 
P is a subset of N, but not of MorO PCN, but P€M and P¢O 





Observe that N ¢ P and PCN. Indeed, the sets N = P. We have the following fact: 


Fact 8. Two sets are equal ==> They are subsets of each other. 


Proof. Two sets A and B are equal 
<= Every element in A is in B and every element in B is in A (Definition 8) 


<= Aisa subset of B and B is a subset of A (Definition 17). 














Exercise 44. Consider Z, Q, and R. (Answer on p. 1755.) 


(a) Is Z a subset of Q or R? 
(b) Is Q a subset of Z or R? 
(c) Is R a subset of Z or Q? 


Exercise 45. Explain whether this statement is true: (Answer on p. 1755.) 


“The set of current Singapore Prime Minister(s) is a subset of the set of current 
Singapore Minister(s).” 


Exercise 46. Let A and B be sets. Explain whether each of the following statements is 
generally true. (If false, give a counterexample.) (Answer on p. 1755.) 


(a) AGB == A=B. (d) A=B = > BCA. 
(b) BeA = A=B. (e) A=B <> ACB. 
(c) A=B S AGB. (tf) A=B = Bea. 
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4.15. Proper Subset Of c 


Definition 18. Let A and B be sets. If Ac B but A # B, then we call A a proper subset 
of B and write 


vce 2. 


Otherwise, we say that A is not a proper subset of B and write A ¢ B. 


Example 76. Let M = {1,2}, N = {1, 2,3}, and O = {1,2,4,5}, and P= {3,2,1}. Then 


M is a proper subset of N, O, and P MeN MeO and Woe? 
N is not a proper subset of M,O,or PP N€M,N¢O,N¢P 
O is not a proper subset of M, N,orP O¢M,O¢N,O¢P 
P is not a proper subset of M, N,orO P¢M,P¢€N,P¢O 


Observe that N = P so that by Definition 18, N¢ P and P¢ N. 


Exercise 47. Let S be the set of all squares and R be the set of all rectangles. Is Sc R? 
(Answer on p. 1756.) 


Exercise 48. Does AC B imply that Ac B? (Answer on p. 1756.) 
Exercise 49. Does Ac B imply that A ¢ B? (Answer on p. 1756.) 


Exercise 50. Explain whether this statement is true: “If A is a subset of B, then A is 
either a proper subset of or is equal to B.” (Answer on p. 1756.) 
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Remark 13. If A is a (proper) subset of B, then we are very much tempted to say that 
A is “(strictly) smaller than” B—or equivalently, that B is “(strictly) larger than” A. 
Informally and intuitively, this is perhaps permissible. However, we shall 


avoid speaking of one set as being “(strictly) smaller 
or larger than” (or “the same size as”) another. 


The reason is the “(proper) subset of” relation is merely one way by which we can compare 
the “sizes” of sets. There are at least two other ways, called cardinality and measure. 


Example: Suppose A = [0,1] and B = [0,2]. Then A is a proper subset of B, so that in 

this sense, we may consider A to be “strictly smaller than” B. However, it turns out that 

e The cardinality of A is the same as that of B. So, in the sense of cardinality, A is 
“the same size as” B. 

e The measure of A is strictly smaller than that of B. So, in the sense of measure, A 
is “strictly smaller than” B. 


Happily, for A-Level Maths, you needn’t know anything about the concepts of cardinal- 
ity and measure. This remark is merely to explain to you why we avoid speaking of one 
set being “(strictly) smaller or larger than” (or “the same size as”) another. 


Example 77. Again, let M = {1,2} and N = {1,2,3}. So, McN. 
While tempting, we shall avoid saying that M is (strictly) smaller than N. 
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4.16. Union U 


Definition 19. Let A and B be sets. The union of A and B, denoted Au B, is the set 
of elements that are in A or B. 


The set A The set B 





AUB is the yellow region. 
Tips °U for Union, 
Example 78. Let T = {1,2}, U = {3,4}, and V = {1,2,3}. Then 


TuU ={1,2,3,4}, TUV = {12,3}, VUV = {1,2,3,4}, TUU UV = (1,2, 3, 4}. 


Exercise 51. Rewrite each set more simply: (Answer on p. 1756.) 


(a) [1,2] u [2,3] (b) (—co,-3) u [-16,7) (c) {0}uZ* 


Exercise 52. What is the union of each pair of sets?(Answer on p. 1756.) 


(a) The set of squares S' and the set of rectangles R. 
(b) The set of rationals and the set of irrationals. 
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4.17. Intersection n 


The set A The set B 





An B is the yellow region. 


Definition 20. Let A and B be sets. The intersection of A and B, denoted An B, is 
the set of elements that are in A and B. Two sets intersect if their intersection contains 
at least one element. 





Equivalently, two sets A and B intersect if their intersection is non-empty, i.e. 


AnB##@. 


Definition 21. Two sets are mutually exclusive or disjoint if they do not intersect (i.e. 
their intersection is empty). 


Example 79. Let T = {1,2}, U = {3,4}, and V = {1,2,3}. Then 
PHUV=Eo, Pav =71 200 inv =43), Fava =~, 


The intersection of T’ and U is empty. Hence, T and U are mutually exclusive or disjoint. 


The intersection of the three sets T’, U, and V is also empty. However, the three sets 7’, 
U, and V are not mutually exclusive or disjoint: 


Definition 22. Three or more sets are mutually exclusive or disjoint if every two of them 
are mutually exclusive or disjoint. 
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Example 80. Let T = {1,2}, U = {3,4}, and W = {5,6}. Then 


(Meee On eae. 


Note though that 2 (the above equation) does not suffice for declaring that T, U, and W 
are mutually exclusive. 


Instead and as per Definition 22, 7, U, and W are mutually exclusive or disjoint because 
every two of them are mutually exclusive or disjoint: 


P= 2. aw —2. ay =, 
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4.18. Set Minus 


The set minus (sometimes also called set difference) sign \ is very convenient. Sadly, it 
does not appear in the A-Level syllabus. Nonetheless, it’s worth a quick mention and I'll 
sometimes use it in this textbook. 





The set A The set B 





A\ B is the yellow region. 
Example 81. Let T = {1,2}, U = {3,4}, and V = {1,2,3}. Then 


Toe, TV =@, OSV a4). 





Exercise 55. In the above example, what are V\ T and V\ U? (Answer on p. 1756.) 


Examples of why the set minus notation is sometimes very convenient and helps us avoid 
ugly monstrosities: 


Example 82. Consider the set of all real numbers except 1. 
Without the set minus sign, we’d write this as (—00, 1) U(1, c). 


With the set minus sign, we can more simply write R \ {1}. 


Example 83. Consider the set of all real numbers that aren’t integers. 


Without the set minus sign, we’d write this as ---U (-3, -2) u (-2,-1) U(-1,0) U (0,1) u 
du 3) 


With the set minus sign, we can more simply write R \ Z. 
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4.19. The Universal Set & 


The universal set, denoted & (that’s a squiggly EF), is the set of “all” elements.'°! Note 
though that what we mean by “all” elements depends on the context: 


Example 84. In the context of a roll of a six-sided die, the universal set might be the 
set of all possible outcomes: 


& = {1,2,3,4,5,6}. 


Example 85. In the context of a spin of a European-style roulette wheel, the universal 
set might be is the set of all possible outcomes: 


& = {0,1,2,3,...,36}. 


Note that in American-style roulette, there is a 38th possible outcome—double zero 00. 
And so in the American context, the universal set might instead be: 


G= (00012 3a e) 
(It looks like Marina Bay Sands does American-style roulette.) 


Example 86. In the context of a game of chess, the universal set might be the set of all 
possible outcomes for White: 


& = {Win, Lose, Draw}. 


Exercise 56. For each context, write down the universal set and its number of elements. 


(a) 4D. 
(b) Sex (as on a Singapore NRIC). 
(c) Singapore’s official language(s). 


(d) Singapore’s national language(s). (Answer on p. 1756.) 








101This is not a precise definition. Indeed, there can be no single precise definition of the universal set 
since what this is depends on the context. 
Note also that the “set of all objects” leads to a contradiction (see e.g. Russell’s paradox) and thus does 
not exist. 
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4.20. The Set Complement A’ 


The set A 


Definition 24. Let A bea set. The set complement 


of A, denoted A’, is the set of all elements that are 
not in A. 





Equivalently: A’=&\ A. That is, A’ is the universal 
set & minus those elements in A. 


In the figure (right), A’ is the yellow region. 


To repeat, what the universal set & is (i.e. what we 
mean by “all” elements) depends on the context: 





Example 87. Let A= {2,3}. If the relevant context is the roll of a die, then 
C= id, 3, 40,0) and Ala 11,2, 3.45 5.6s\ A = 1456! 
But if instead the relevant context is a European-style roulette wheel spin, then 
@ =10 1 2,4.6,36} and Al 240,152,586} \ A= 10, 1,405.6, . 20,80}. 
And if instead the relevant context is the positive integers smaller than 10, then 
C42, oO) and A =41257..,90) VA S11, 4, 8,6, 7, 8,9}. 
Example 88. Let B = {2,4,6,...}. If the relevant context is the positive integers, then 
6 =Z," and = EPA DAO Botan) 4 lo) tke 


That is, B’ is simply the set of positive odd numbers. 


But if instead the relevant context is the set of all integers, then 


E=2, and Pat KAO eane) = 2), ON ooo, (aes 


That is, B’ is the set of positive odd numbers and all non-positive integers. 


Example 89. Let C = R* = (0,00). If the relevant context is all real numbers, then 
E=R and C“=R\C =. 


That is, C’ is simply the set of non-positive reals. 
But if instead the relevant context is the set of reals greater than or equal to -1, then 
& =[-1, 00) and C’ =[-loo) \ C= [-1,0]. 


That is, C’ is the set of reals between -1 and 0 (inclusive). 


65, Contents www.EconsPhDTutor.com 








66, Contents www.EconsPhDTutor.com 


4.21. De Morgan’s Laws 


It turns out there’s a deep connection between logic and set theory. In particular, 


The intersection N corresponds to the logical connective AND (the conjunction). 
The union U corresponds to the logical connective OR (the disjunction). 
Earlier, for logic, we had De Morgan’s Laws (Facts 1 and 2) 


(a) The negation of the conjunction P AND Q is the disjunction NOT-P OR NOT-Q. 
(b) The negation of the disjunction P OR Q is the conjunction NOT-P AND NOT-Q. 


Correspondingly, for set theory, we also have these De Morgan’s Laws: 


(a) The complement of the intersection Pn Q is the union P’uQ". 
(b) The complement of the union P UQ is the intersection P’n Q’. 


The set P The set Q The set P The set Q 





(PQ) = P’uUQ' is in yellow. (PUQ) = P’nQ' is in yellow. 


For future reference, let’s jot this down as a formal result: 


Fact 9. Suppose P and Q are sets. Then 


(a) (PnQ)'=P’uQ’ and (b) (PUQ)'=P'nQ’. 








Proof. See p. 1553 (Appendices). 
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Example 90. Suppose that every animal is (i) either tall or short; and (ii) either fat or 
lean. So for example, the horse is tall and lean, the elephant is tall and fat, the pig is 
short and fat, and the mouse is short and lean. 


Let T' be the set of tall animals and F' be the set of fat animals. 


Consider 7'n F’, the set of animals that are tall AND fat. Its complement is the set of 
animals that are short OR lean (yellow region below): 


Ciay ep we kee ae 


The set T The set F 


Consider T’u F’, the set of animals that are tall OR fat. Its complement is the set of 
animals that are short AND lean (yellow region below): 


(NOT 2) ee aay ce 


The set T’ The set F 
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4.22. Set-Builder Notation (or Set Comprehension) 


Previously, we simply wrote out a set using the method of set enumeration. That is, we 
simply enumerated (i.e. listed out) all their elements: 


Example 91. The set of Singapore PMs (both past and present) is 





S = {Lee Kuan Yew, Goh Chok Tong, Lee Hsien Loong}. 


Where a set has too many elements to list out, we can use the ellipsis “...”. But this too 
counts as the method of set enumeration: 


Example 92. The set of integers greater than 100 is 


T = {101, 102, 103, 104,...}. 





We now introduce a second method of writing out a set, called set-builder notation or 
set comprehension: 


Example 93. The set of Singapore PMs (both past and present) is 
S' = {x:ax has ever been the Singapore PM}. 


66,99 


In set-builder notation, the mathematical punctuation mark colon means such that. 
Following the colon is the property or criterion that 2 must satisfy in order to be an 
element of the set. Hence, S is 

“the set of all objects x such that x has ever been the Singapore PM”. 
Note that the letter or symbol x is simply a dummy or placeholder variable. We 
could have replaced x with any other letter or indeed any other symbol and the set would 
have been unchanged. For example, we could’ve replaced x with the letter y: 


S = {y:y has ever been the Singapore PM}. 


Or even a smiley face ©: 


S ={©: © has ever been the Singapore PM}. 
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Example 94. Let T = {x:x2€Z,x>100}. (Recall that the comma “,” means AND.) 
Then T' is 


the set of all objects x such that x is an integer AND x > 100. 


Observe that this time, following the colon are two properties or criteria that x must 
satisfy in order to be an element of the set. 


We could also have written T as “the set of all integers x such that «x > 100”: 


P=tpeZe a> N00. 


Example 95. Let S be the set of ASEAN members. 


Using set enumeration, we'd write 


S = {Brunei, Cambodia, Indonesia, Laos, Malaysia, Myanmar, 


the Philippines, Singapore, Thailand, Vietnam}. 
Using set-builder notation or set comprehension, we’d write 
S ={x:ax is a member of ASEAN}. 
This is “the set of all objects x such that x is a member of ASEAN”. 


Example 96. Let Familee be the set of individuals who have ever been members of 
Singapore’s Royal or First Family. Consider 


A = {x:a has ever been the Singapore PM, x ¢ Familee}. 


In words, A is “the set of all objects x such that x has ever been the Singapore PM AND 
x has never been a member of the Familee”. And so, 


A = {Goh Chok Tong}. 


Example 97. Let B = {x: a is a member of ASEAN, x has fewer than 5,000 islands}. 


In words, B is “the set of all objects x such that x is a member of ASEAN AND z has 
fewer than 5,000 islands”. So, using set enumeration, we’d write 


B = {Brunei, Cambodia, Laos, Malaysia, Myanmar, Singapore, Thailand, Vietnam}. 
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We’ve used set-builder notation to describe sets of prime ministers or countries. But more 
commonly, we'll use it to describe sets of numbers: 


Example 98. Let A= 12 pee lee o}. 
In words, A is “the set of all objects x such that x? - 1 =0”. 


Hence, A= {-1,1}. 
Example 99. Let B= 12 :g*7-1=0,2> o}. 
In words, B is “the set of all objects x such that x?-1=0 and x>0”. 


Hence, Baa. 


We could also have written, B= 12 eR’ 7°15 o}. 


This says that B is “the set of all positive real numbers x such that x? - 1 = 0”. 


Example 100. Consider C = {r€Z:3.5<a< 5.5}. 
In words, C' is “the set of all integers x such that 3.5 < x < 5.5. 


Hence, C= {45 
Example 101. The set of positive reals may be written 
R* =(0,00) ={eeRia>0}. 


In words, this is “the set of all reals x such that x is greater than 0”. 


Similarly, the set of non-negative reals may be written as 
R= (0, oj=i2 eR a SO}. 


In words, this is “the set of all reals x such that x is greater than or equal to 0”. 


Similarly, we may write 


O =72O: 2-0}, Le Pele Sh. 
Q = {xe Q: x20}, 0 ={2eZ:x >}. 





Exercise 60. Write out, in words, what Q*, Qj, Z*, and Zj are. (Answer on p. 1756.) 
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Example 102. Let S be the set of positive even numbers. Then we may write 
= 424 Oe =e ea zhee |. 


In words, S is “the set of all objects x such that x equals 2k AND k is a positive integer”. 
Notice we’ve introduced a second dummy variable k to help us describe the set. 


Again, both symbols x and k could’ve been replaced by any other symbols and we’d still 
have the same set. For example, 


= 124,007.) =\0 pp -29.¢6 7 | 
={«:*=24,¢¢€Z*} 
={©:©=2m,m¢Z*}. 


(Of course, letters like x, k, p, and q are customary and hence preferable to weird symbols 
like shapes and faces.) 


Here’s another way to write down S: 
© =4104,60, 6 =r eo |. 


In words, S' is “the set of all elements x7 such that x divided by 2 is a positive integer”. 


And here’s another: 
S =42,4,6, 8,04) 12k ke Z }. 


In words, S' is “the set of all elements 2k such that k is a positive integer”. 


Exercise 61. Let a <b. Rewrite each set in set-builder notation. (Answer on p. 1757.) 


(a) RF (b) Q@ (c) Z (d) Ro 
(e) Q (f) Zo (g) (4,6) (h) [a,0] 
(i) (a,)] (j) (a,b) (k) (-00,-3) U (5, 00) 

(DT) (es; (2,3) U3, cs) (m) (-00,3)n (0,7) 

(n) The set of negative even numbers 

(o) The set of positive odd numbers 

(p) The set of negative odd numbers. 

(q) (7,40, 7 10m,..2) 

(vr) {=-2n, 1, 4, 7x, 10m,... }. 
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4.23. Chapter Summary 


e Informally, a set is a container or box whose objects we call its elements. 
Whenever we talk about a set, we refer to both the container and the objects inside. 
e € means in and ¢ means not in. 
e The ellipsis “...” means continue in the obvious fashion. 
e The order of the elements doesn’t matter and repeated elements don’t count: 
fA Did} 4 Bebe ee byt oe lily dalek cays 
¢ R, Z, and Qare the sets of reals, integers, and rationals.'°? 


¢ R*, R’, Rj, and Rp are the sets of positive reals, negative reals, non-negative reals, and 
non-positive reals. Similarly, we have 


Q*, OY, Qj, and Q;; and Z*, Z, Zi, and Zp. 


« The empty set {}=@ is the set with zero elements. 


e Interval notation. Let a< b. Then 


“— 


excluded) and 6 (excluded). 
included) and b (included). 
excluded) and 6 (included). 
included) and 6 (included). 


1. (a,b) is the set of reals between a 
2. [a,b] is the set of reals between a 
3. (a,b] is the set of reals between a 


GS LS Fe 


4. [a,b) is the set of reals between a 


e Aisa subset of B—denoted Ac B—if every element in A is also in B. 

e Aisa proper subset of B—denoted Ac B—if AC Band A B. 

e AUB, the union of A and B, is the set of elements that are in A OR B. 

e ANB, the intersection of A and B, is the set of elements that are in A AND B. 

« A\ B (or A-B), A set minus B, is the set of elements that are in A AND not in B. 
¢ The universal set & is the set of “all” elements (where “all” depends on the context). 
¢« The set complement A’ is the set of elements that are not in A. 

¢ De Morgan’s Laws: (PnQ)’= P’UQ’ and (PUQ)'= Pn. 


¢ Set-builder notation (or set comprehension). The set of objects satisfying the 
property P is denoted {x: P(x)}. 


Exercise 62. Write each set more simply. (Answer on p. 1757.) 


(a) R\R*. (b) Rv (QuZ). 


(c) [1,6] \ ((3,5) 9 (1,4)). (dy) 405.9 1e....ne46 8) 
(eo) 425 8 ies ae ea (f) (0,5]([1,8] 9 [5,9))” 








102Not on your syllabus: N is the set of natural numbers. 
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5. O-Level Review 


5.1. Some Mathematical Vocabulary 


Example 103. Consider the equation 1 + 4 =2+3. 


LHS expression RHS expression 


1 + 5 
1 1 1 


Term Term Term Term 


The expression on the equation’s left-hand-side (LHS) is 1 +4 and contains the two 
terms | and 4. 


The expression on the right-hand-side (RHS) is 2+3 and contains the two terms 2 
and 3. 





Informally, the difference between an equation and an expression is this: 


An equation contains an equals sign; an expression does not. 


Example 104. Consider the equation y = 5x +6. 


Variable Constant term 
= oe + | 6 
y x ~ 
Coefficient Variable 


This equation contains two variables: x and y. 
The coefficient on x is 5. The coefficient on y is 1. 


The constant term or more simply constant is 6. (The constant term is simply any 
real number!” that does not involve any variables.) 


Example 105. Consider the inequality 8x? + 47 +3>y-2. 


Constant term 


7A NX 
Sr ea dees ee) 2 
xN x 


{ 
Coefficient Variable 
This inequality contains two variables: x and y. 


The coefficients on x? and x are 8 and 4. The coefficient on y is 1. 


The constant terms on the LHS is 3. The constant terms on the RHS -2. 








103\Vhen we study complex numbers in Part IV, constants will also include complex numbers. 
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5.2. The Absolute Value or Modulus Function 


Definition 25. The absolute value (or modulus) function, denoted |-|, is defined for all 
xé€R by 


ior 72 (), 


for x <0. 





Example 106. |5| = 5, |-5] = 5, |0| = 0. 


Each of the following three equations is not generally true. 




















6 6 
Example 107. | = EB because |— 


2 ae = 
Example 108. >| = By because |— 


Ea 2 
Example 109. | = a because |— 





What’s true is this: 








Fact 10. [fa,beR with b#0, then 7 = tt 
a 0 (O| _ [al 
Proof. If a = 0, then |—] = |0| =0=—=— =—. 
b Jo] | |d| 








Now suppose a ¢ 0. 


Case 1. aand b have the same signs. 
Then either |a| /|b| = a/b or |a|/|b] = (—a) / (-b) = a/b. Moreover, a/b > 0 so that 
|a/b| = a/b. 

Case 2. a and b have opposite signs. 


Then either |a|/|b| = (-a)/b or |a|/\b| = a/(—b). Moreover, a/b < 0 so that 
|a/b| = -a/b. 














If the above proof doesn’t convince you, hopefully the following examples do: 


6| |6 6 
Example 110. A = — because | =|3|\=< and 


[2| 2 
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Example 111. F|- 5 Bp Ever al Ss 


Example 112. | 5-4 


Example 113. | sli 





We also have this similar result: 


Fact 11. Jfa,b€R, then |abj = |a} |b. 


Proof. If a,b > 0, then |ab| = ab = |a| |b]. Y If a,b < 0, then |ab| = ab = jal |b). Y If a>0 and 
b <0, then |ab| = -ab = |a||b|. Y If a< 0 and b> 0, then |ab| = -ab = |al |b). ¥ 














Example 114. |6 x 2| = |6||2| because |6 x 2| = [12] = 12 and |6||2| = 6 x 2 = 12. 
Example 115. |-6 x 2| = |-6||2| because |-6 x 2| = |-12] = 12 and |-6]|2| = 6 x 2 = 12. 


Example 116. |6 x (—2)| = |6||-2| because |6 x (-2)| = |-12] = 12 and |6||-2] = 6 x 2 = 12. 


Example 117. |(-6) (-2)| = |-6||-2| because |(-6) (—2)| = |12] = 12 and |-6] |-2| =6 x 2 = 
12, 
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5.3. The Factorial n! 





And so, O! = 1, 
TES del 
a ae aa eae 
a= 2 ea = 12x 5, 
Meo x4 = 1e2 xs <4, 
bla dle b= 1x 2*3x4 x5, 
6! =5!x6=1x2x3x---x6, 


You may be wondering, “Why is 0! defined as 1?” It turns out this is the definition that 
causes us the least overall inconvenience.!”? 





104This latter equivalent definition is an example of a recursive definition. 
10For example, with the Maclaurin series (Ch. 102) and combinatorics (Part VI). 
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5.4. Exponents 


Definition 27. Suppose b #0 and x € Z. Then b to the power of x, denoted 6b”, is the 
number defined by 


pial’ 


Given the number b’, we call b its base and x its exponent. 


Example 118. 2! =2, 2?=2-2=4, 2?=2-2-2=8, 2*=2-2-2-2=16, 
2 = 1048576. 


20 times 


1 
Example 119. 27! = 5 0.5, 2°? = 


ee ee 


Q4=-.-.---=— = (0.0625, 
222 216 





1 


= = 0.000 000 953 674 316 406 25. 
2 1048576 








ey i 
E le 120. = 
xampie (5) 5, ( 


(+) =3 1 1 1 
2 
——_— $s’ 





( 
2 
1 


Example 122. 2° = 1, (; 





Note well that Definition 27 covers only the case where the base 6 is non-zero and the 
exponent x is an integer. 


We now cover also the case where the base b is 0 and the exponent x is any real number: 
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Definition 28. Suppose x ¢ R. Then 0 to the power of x, denoted 0”, is the number 
defined by 


toner > (i 
fora =), 


Undefined, fon ar <0) 


Example 123. 0° = 0, 07 =0, 0’ =0. 
0-3, 0-7, and 07! are undefined. 


0° = 1. There’s actually nothing “obvious” or “natural” about defining 0° = 1. Similar to 
0! = 1, this is merely the definition that will cause us the least inconvenience.1° 





Next, we’ll define roots. But to do so, we’ll need to make use of this result: 


Theorem 1. Suppose b > 0 and x is a non-zero integer. Then there exists a unique 
positive real number a such that 








Proof. Omitted.!°" 











Example 124. Let 6 = 18.73 and x = 39. Then Theorem | says that there exists a > 0 
such that 


a = 18 72 


Definition 29. Let b > 0 and x be a non-zero integer. By Theorem 1, there exists a 
unique positive real number a such that 


a 


We call a the xth root of 6 and write a = Vo. 


We'll also say a is b to the power of 1/x and write a = ue 

















106One convenience this definition affords is this: For all b ¢ R, we simply have b° = 1. 





Some writers argue that we should simply leave 0° undefined. Ss 


But I think 0° = 1 is the definition that will cause us the least inconvenience in A-Level Maths. 4 
107See e.g. Rudin (Principles of Mathematical Analysis, 1976, p. 10, Theorem 1.21). Note that our 
Theorem 1 also allows for the possibility that the exponent x is a negative integer. 
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Example 125. By Definition 29, 


42 = W/4=V/4=2, 





; 1\3 
82/822 (; S 
= V8 2 8 a” 


1 \3 ibe 
16224716 =2, (=) =).06254= ae = (00625 = 0.5 = a 


16 
104857620 = ¥/1048576 =2. 


1 
1048 576 


= */0.000000 953 674 316 406 25 = 0.5 = . 


: ; u 20 
—S_ =a(h 4 1 A D) a 
Grea 0.000 000 953 674 316 406 2520 





Definition 29 requires that b> 0. If b< 0, then the definition doesn’t apply: 
Example 126. There is no positive number (or indeed any real number) a such that 
a 


So, for now, we'll leave -1'/? = V/-1 undefined. 
(Spoiler: Later on in Part IV (Complex Numbers), we will define i= -1!/? = V-1.) 





Definition 29 also requires that x #0. If x =0, then Vb or bo is simply undefined. 
For some special cases, we have these customary notation and terminology: 
e If x =1, we won’t write a= Vb. Instead, we'll simply write a = b. 


e If x = 2, we won’t write a = Vb. Instead, we'll simply write a = Vb. Moreover, we'll 
usually call it the square root of b. 


e Similarly, we'll usually call a = Vb the cube root of b. 


We next extend our definition of b” to the case where b > 0 and z is any rational number: 


Definition 30. Let b> 0 and x € Q. Suppose x = m/n for some integers m and n # 0. 
Then b to the power of x is denoted b* and is the number defined as 
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Example 127. By Definition 30, 


Bye | 20S 1a 


i ; 1383 
QE | 2) (007 ee Ro Tae 
= (1.259...) = 5.039... 
Proposition 1. (Laws of Exponents) Suppose a,b>0 andx,y¢R. Then 
1 


(a) bY = 0", (b) B=. (C) = oP, (d) (b*)¥=b. — (e) (ab)® = ab". 





Our partial proof below covers only the case where x,y € Z* (exponents are positive inte- 
gers). (This partial proof can easily be extended to cover also the case where x and/or y 
are non-positive integers. ) 


For a more complete proof covering also the case where x and y are any real numbers, see 
p. 1726 (Appendices). 


Proof (partial). Suppose x,y € Z*. Below, = will denote the use of Definition 27. 





(a) b°bY 2b-b----- Debs lesen a) a, b = bt. 
eS Oe ees SN 
x times y times x+y times 
- 1 
(b) Since x > 0, we have |-2z| = x and hence b* = eal 
(c) If x>y so that x-y>0, then 
y times x times 
—_— FF 
‘ b- bees 6b, bebe ee 6b bt 
be Y=b-b-.--:-: b=b-b--:-: b = = 
a . be beeee b-b----- 6b bY 
x-y times xz—y times een aut aut 
y times y times 


If instead x < y so that x -y <0 and |x -y| = y-<, then 


x times x times 
ye-y 2 1 1 _ 1 08 tess b  b-b eee bb 
ple-yl = a ee ee Bo bebidas bby 
-—_—~ -—~ ——_ 
|c—y|=y-x times y-« times x times y times 
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y times 














Oo ee ee eer 
(d) (b7)¥ = (b-b- +. b)Y = (bebe b) Rive (Deb ee: b) =D-De b = bt, 
x times x times x times ty, VINES 
(e) (ab) = (ab) (ab)...(ab) =a-a--:: axb-beees b= a", 
x times a times x times 


Exercise 63. Let x €¢ R. Simplify each expression. (Answer on p. 1758.) 


54a ‘ Q51-« Qx+2 eye 23x 
 , b 2—_——__.—_—_ 
52a+1 aos Re 7. 52a ( ) v2 Yaa 


(a) 
Exercise 64. Let b>0 and z,y¢€ Z*. Explain whether each equation is generally true. 
(a) BO) = bY, (by ey So. (Answer on p. 1758.) 


Remark 21. We have defined b” to cover several cases. 


However, we still have not defined b” in these two cases: 

1. The base 6 is positive and the exponent 2 is irrational. 

2. The base 6 is negative and the exponent x is a non-integer. 
Unfortunately, defining these is somewhat beyond the scope of H2 Maths. 


Yet at the same time, in H2 Maths, we will often deal with the above two cases of b”, 
which is a little discomforting. 


One comfort is that with Case 1, Proposition 1 (the Laws of Exponents) still holds. 


With Case 2, those Laws of Exponents don’t always hold and we should be a little more 
careful. Nonetheless, for H2 Maths, there is probably little danger in simply assuming 
that these Laws always hold. 
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5.5. The Square Root Refers to the Positive Square Root 
By Definition 29, 


The square root refers to the positive square root. 


Example 128. It is true that the equation b? = 25 has two solutions, namely, 
b=5 b=-5. 


However, each of the following four statements is false: 


952 = 25. X 953 =-5. X /25 = +5. X VI = =5 ek 


By Definition 29, 2542 or 25 isa positive number. So, 
25u? = ./95 = 5. 
To talk about the negative square root —5, (simply) stick a minus sign in front: 


=251? = —/25 =-5 





In general, given any b>0, the square root of b is the positive number denoted 


Vb > 0. 


The negative square root of b is the negative number has a minus sign in front: 
-Vb<0. 


This seems like a trivial and “obvious” point. You may find it strange that I have devoted 
this subchapter to this point. But overlooking this point is a common source of error: 


Exercise 65. True or false: “Vx? = x for all x € R” (Answer on p. 1758.) 
We 


Exercise 66. True or false: “For all  ¢ Rx {0}, — 1.” (Answer on p. 1758.) 
Te 


Exercise 67. True or false: “For all x ¢ R\ {0}, a 1.” (Answer on p. 1758.) 
a 
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Remark 22. Because of the “ambiguity” just discussed, some writers refer to V25 as the 
principal square root of the number 25. More generally, given any non-negative number 
x, they call the positive number \/z the principal square root of x. 


We shall however have no use for the term principal in this textbook. The reason is that 
there is in fact no ambiguity at all—we have in this textbook clearly defined the positive 
number 25 to be the square root of 25. There is thus no need to refer to 25 as the 
principal square root of 25. 


Remark 23. Definition 29 is to some extent an arbitrary convention, similar to whether 
we drive on the left or right side. Singapore drives on the left side of the road (because 
Singapore was part of the British Empire). But Singapore could just as easily drive on 
the right side of the road (which is what nearly all of the world except the former British 
Empire does). Except it doesn’t and it would be quite unwise to try to go against the 
accepted convention. 


Here similarly, Definition 29 says that the square root (of a positive number 0) is itself 
a positive number. But mathematicians could just as easily have chosen the opposite 
convention. Except they didn’t and it would be quite unwise to try to go against the 
accepted convention. 


It’s not really important which convention a community chooses. What’s important is 
that everyone in that community knows what the convention is and agrees to use it, hence 
reducing confusion and accidents (whether in maths or on the road). 
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In Exercise 65, we gave this false statement: 
For all eR, Vx? =. x 


The correct statement should instead be this: 


Fact 12. For all x ¢R, Vr? = |z\. 


Proof. If x >0, then V2? = x = |z|. If x <0, then V2? = -2 = |z\. 














Here’s another result we’ll find useful: Given any x # 0, the expression 


a x 
x In| 


gives us its sign. That is, 





Prooy, MW e> 0, then) a) Saye = 1 and |p)/eei/ (le e)=1ileL 
If x <0, then 2/ |x| = x/ (-x) =-1 and |z| /x = 1/ (\z| /x) = 1/(-1) = -1. 














Example 129. In Exercise 66, we gave this false statement: 


Ve Ve, 


Using the above two facts, the correct statement should instead be this: 


For all x € R \ {0}, 


D f 0 
For all weRv {0}, ¥™ -l_ or > U, 
x 


for x < 0. 


Example 130. Similarly, in Exercise 67, we gave this false statement: 


= ve, 
ea eas 


Using the above two facts, the correct statement should instead be this: 


For all x € R \ {0}, 


for x > 0, 


Us 
For all xe R\ {0}, —==— = 
Vx? for x <0. 
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5.6. Rationalising the Denominator with a Surd 


The term surd usually refers to a square root \/-. 


! ror 1 v2_ V2 
Yo 7 


At Bo we used what Ill call the Times One Trick (TOT). It is of course always 








Example 131. 


legitimate to multiply any quantity by 1, though one might wonder why we’d ever want 
to do that. The above is one example of when the TOT comes in handy. 





Given a denominator with a surd, we can often rationalise it by using the TOT, the conju- 
gate, and the identity (a +b) (a —b) = a? — b?: 


1 or 1 1-/2. 1-Vv2 _i-v2_1-vV2_ 4 
=——= 


ie 1+V2 1-V2 p-(ap 1-2 ie 


Example 132. 





We take this opportunity to talk about the + and ¥ notation: 


1 2 2- 
Example 133. We have ——— Ter 2% va =2-¥V3. 


ae 53 08) 23 
[ror Aye 
2/3 2/3 243 = rag 72+ V8. 


Using plus-minus + and minus-plus ¥ notation, we can combine the two statements * 
and ) into this single statement: 


1 1 24/3 27/3 _ 
Tvl e Ge oe 








And, 





The minus-plus notation ¥ indicates the “opposite” of +. So here, 


e If + is +, then + is —- and we have *. And 





e If + is —, then + is + and we have ). 
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108The LHS is the quadratic formula in familiar form. The RHS is also the quadratic formula, but in an 
alternative and less familiar form. 
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5.7. Logarithms 


Informally, logarithms are simply the inverse of exponents. A bit more formally, 


Definition 32. Let x €« R and b,n > 0 with 6 #1. If b” =n, then we call x the base-b 
logarithm of n and write 


x = log,n. 








Example 134. 2?=8 <=> 3=log,8. 





You should find these Laws of Logarithms familiar: 


Proposition 2. (Laws of Logarithms) Let x¢R and a,b,c>0 with b#1. Then 
(a)  log,1=0 
(b) log, b=1 
(ec), “los, b7 = 
(doa 


(e) clog,a= log, a° 


1 
(f)  log,—=-log,a (Logarithm of Reciprocal) 
a 


(g) log, (ac) = log, a + log, c (Sum of Logarithms) 


(h) log, _ log, a — log, c (Difference of Logarithms) 
C 
forc#1 (Change of Base) 


1 
(j) logyse= b loge: fora#l 





Proof. In this proof, —=> denotes the use of Definition 32. 


(a) bo =1 <=> log,1=0. 
(b) bt =b <=> log, d= 1. 
(c) b° =b" <=> log, b" =a. 
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(d) Let y=log,a <=> DY =a <— > HS =a, 
(e) First use Proposition 1(d): 
pores (ples 2)" ge ts clog, a = log, a. 


1 e 
(f) log, - = log, aot (e) _ logy a. 
(g) First use Proposition 1(a): 


pos. atlog,c _ p!8o aplosy ¢ (qd) ac rane log, (ac) = log, a +P log, C. 





1 1 i 
(h) log, = log, («. -} ®) log, a + log, — 2 log, a — log, -. 
é C C C 
(i) Let y = log,a or bY =a. Let z =log.a or c* =a. 
l 
Hence, b’ = c* or z = log, b’ = ylog.b or log.a = log, alog.b or log,a = ae 
O8e 





(@) log,c @ log,e 1 
jog ae ob , 








(k) log,» C 





Exercise 70. Show that each expression equals 2. (Answer on p. 1760.) 


it 
(a) log, 32 + log, a (b) logs 45 — logy 25 (c) log, 768 — log, V3 


Remark 25. Some (including your TI84) write logxz to mean the base-10 logarithm of 
x. Still others write log x to mean the natural logarithm of x. (Note that at this point, 
we still haven’t discussed what the natural logarithm function is. We will do so only 
in Ch. 28.) 


We have thus a rather confused situation with different writers using different notation. 


In this textbook, we'll stick strictly to this notation on your H2 Maths syllabus (p. 18): 


log , x logarithm to the base a of x 


Inx natural logarithm of x 


Ig x logarithm of x to base 10 


We will only ever write log, x and even then fairly rarely. We will never write log x. 
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5.8. Polynomials 


Definition 33. Let co,c,...,¢n be constants, with c, #0. The expression 


OD Pen 7 te or eer ty 


is called an nth-degree polynomial (in one variable x). We also call 


¢ Each cx’ the ith-degree term (or more simply the ith term); 


¢ Each c; the ith coefficient on x' (or the ith-degree coefficient, or the ith coefficient); 


¢ The Oth coefficient co the constant term (or, more simply, the constant). 
A (nth-degree) polynomial equation (in one variable x) is any equation 


2 


Cb +O ae Ct ce, = 0) 


or any equation that can be rewritten in the above form. 





As usual, in the above definition, x is merely a dummy variable that can be replaced 
with any other symbol, like y, z, ©, or y&. 


Example 137. The expression 7x — 3 is a lst-degree or linear polynomial. 


The equation 7x - 3 = 0 is a 1st-degree polynomial (or linear) equation. 


lst-degree term Oth-degree term 
7 


IS 

am cm: a 
(a —3 
x 


——, 


1st-d ficient  ™ 
shige Bh shat a Oth-degree coefficient 


Term | Coefficients 
Oth-degree | -3 =3 
lst-degree | 7x 7 














The Ist-degree term is 72. 
The lst-degree coefficient is 7. 


The Oth-degree term, the constant term, the constant, and the Oth-degree coef- 
ficient is —3. 
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Example 138. The expression 3x? + 47 —5 is a 2nd-degree or quadratic polynomial. 


The equation 327 + 42 — 5 = 0 is a 2nd-degree polynomial (or quadratic) equation. 


lst-degree term 
2nd-degree term ~— t 7 Oth-degree term 
a — 


a’ 
2B Teele = 
2nd-degree coefficient { “ Oth-degree coefficient 
lst-degree coefficient 


Term | Coefficient 
Oth-degree | -5 
lst-degree | 47 

















Qnd-degree | 32” 


Example 139. The expression —5x® + 2x + 9 is a 3rd-degree or cubic polynomial. 
The equation —5x? + 2x +9 = 0 is a 3rd-degree polynomial (or cubic) equation. 


When a particular coefficient is 0, we usually don’t bother writing out that term (as is 
the case here with the 2nd-degree term). 


2nd-degree term 


3rd-degree term ft a ist-degree term 
Comm a oe 
3rd-degree coefficient — —5a° + 0x2? +2x2+ 9 


4 
2nd-degree coefficient { Oth-degree term & coefficient 
1st-degree coefficient 


Term | Coefficient 
Oth-degree 9 
lst-degree | 27 











2nd-degree | 02? 





3rd-degree  —5x® 











You get the idea. We also have 4th-, 5th-, 6th-, ... degree (or quartic, quintic, sextic, 
...) polynomials and equations. 


Example 140. Technically, the expression 7 could be regarded as a Oth-degree poly- 
nomial, because 7 = 7z°. Indeed, in certain contexts, it may be convenient to refer to 
the expression 7 as such. 





But more commonly, we’ll simply call the expression 7 a constant. 
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This textbook will look mostly only at polynomials in one variable. So unless otherwise 
stated, when we say polynomial, we’ll always mean a polynomial in one variable. 





But in case you were wondering, an example of a polynomial in two variables is Ar + Bry + Cy. 
In general, the degree of each term in a polynomial is the sum of the exponents on the variables. And 
the polynomial’s degree is simply the highest such degree. So here, the term Az has degree 1, Bry has 
degree 2, and C'y has degree 1. So this polynomial is of degree 2. 

Another example of a polynomial in two variables is Ax? + Bry + Cy? + Dx + Ey + F. Despite looking 
more complicated than the previous example, this polynomial also has degree 2 because the greatest 
sum of exponents on any term is again 2. 

And by the way, as Ch. 142.20 (Appendices) discusses, the conic section is, in general, described by 
this 2nd-degree polynomial equation in two variables: 


Ax? + Bry + Cy? + Da + Ey+ F =0. 
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6. O-Level Review: Inequalities 


Ch. 41 (Solving Inequalities) will go through inequalities in greater depth. This chapter 
simply reviews some basics you should find at least vaguely familiar: 


6.1. The Sign of the Product of Two Real Numbers 


2 
Z 
ie) 
5 
a 
o 
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za 
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= 
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et 
A 
et 
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Example 141. If a=2>0 and b=3>0, then ab=6> 0. 


iw) 
—_ 
aan) 
Q 
oo 
A 
i=) 
ot 
= 
oO 
=) 
g 
om 
V 
=) 


) ) 


Example 142. If a=-2<0 and b=-3<0, then ab=6>0. 


3. If a and b have opposite signs (i.e., a>0 AND b<0 or a<0 AND b> 0), then ab <0. 


Example 143. If a=2>0 and b=-3<0, then ab=-6 <0. 


Example 144. If a=-2<0 and b=3>0, then ab=-6< 0. 


4. And of course, if either a = 0 or b =0, then ab =0. 


Example 145. If a=0 and b=3>0, then ab=0. 


Example 146. If a =-2 <0 and b=0, then ab=0. 


Exercise 71. Given some x € R, what are the sign of 7z and -7x? (Answer on p. 1760.) 
More generally, 


Fact 14. Let a1, a9,...,dn € R. Suppose P =aja)...dn. Then 


(a) P=0 <=> At least one of aj, ao, ..., and Gp is zero. 


(b) P>O <=> An even number of a1, ag, ..., and a, are negative (and the rest positive). 


(c) P<0 <=> An odd number of ay, ag, ..., and dy, are negative (and the rest positive). 








Proof. See p. 1612 (Appendices). 











Example 147. XXX 


Exercise 72. XXX (Answer on p. 94.) 
A72. 
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6.2. Multiplying an Inequality by an Unknown Number 


Given the equation a = b, we can multiply by any number «x to get 
ax = bx. J 


That is, an equality is preserved when we multiply it by any number. 


In contrast, an inequality is not always preserved when we multiply it by a number: 


Example 148. Consider the inequality 2 > 1. 


(a) It is preserved if we multiply it by a positive number like 3: 


Dea 1s or 06> 3. 


(b) It is reversed (or flipped) if we multiply it by a negative number like —3: 


2 x (-3) < 1 x (-3) or —6 < -3. 
(c) It becomes an equality if we multiply it by zero: 


2xQ=1x0 0=0. 





The above seems obvious. But a common mistake is to multiply an inequality by some 
unknown number x and expect it to be preserved: 
Example 149. Let x ¢ R. Beng reasons, “We know that 8 > 5. Therefore 8x > 5.” 
Beng’s reasoning is wrong: 
(a) If 2 >0, then yea, Beng happens to be correct and 82 > 5a. 
(b) If x <0, then 8x < 5a and he’s wrong. 
(c) If 2 =0, then 8x = 5x (= 0) and he’s again wrong. 





In general, 


Fact 15. Let a,b,x¢€R. Suppose a> b. 


(ayy 2s Oy ihen-an > ba. 
(b) i <0) thenan < bz. 


(Gye = 0, hen ar = b="): 


Analogous results also hold if we replace a>b with a> b. 





Proof. Since a > b, we have a- 6 > 0. 

(a) If x > 0, then by Fact ??(b), (a—b)x>0 or ax - bx >0 or az > ba. 
(b) If x <0, then by Fact ??(c), (a-b)x <0 or ax - bx <0 or az < ba. 
(c) If z =0, then by Fact ??(a), az = 0 and br =0. 


The proof of the last sentence is similar and omitted. 
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So, in general, we may have to break our analysis into two (or three) cases, depending on 
whether x is positive, negative, or zero: 


Example 150. Suppose a, b,x ¢« R with x #0 and 
1 
) 


What can we say about a and b? 


1 
Beng reasons, “Multiply > by x to conclude that a > b.” 


As usual, Beng is wrong. We must break our analysis down into two cases: 


i 
(a) If « > 0, then yea, he happens to be correct—we can indeed multiply > by x to 
conclude that a > b. 


1 
(b) But if « <0, then multiplying > by x reverses the inequality, so that a < b. 


Correct reasoning: 
If x > 0, then a > b. But if x <0, then a< b. 
By the way, why did we specify that x + 0?11° 
Exercise 73. Solve each inequality. (Answers on p. 1760.) 


22+1 2s 


fee) rey 


(a) = * 0 (b) =7>0 @ = >0. (d) 








NOTE x = 0, then a/x and b/x would involve the cardinal sin of dividing by zero. 
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6.3. The Sign of the Reciprocal of A Real Number 
Given a € R, what is the sign of ~? 


Lit oO, fen Si) 
a 
1a | 
Example 151. If a=2>0, then — = 3 2, 
a 


2: ace Mien ea. 
a 


iy 1 
Example 152. If a =-2< 0, then — = mas < 0. 
a 





And of course, 
3. If a=0, then : is undefined. 
a 


The following result summarises the above observations: 


Fact 16. [fa#0, then a has the same sign as 1/a. 


1 i 
Proof. Since a- — =1>0, by Fact ??, a and — must have the same sign. 
a a 














Equivalently, a>0 <= -—>0 and a<0 — -<0. 
a a 
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6.4. Does Taking Reciprocals Always Preserve an Inequality? 
No: 


Fact 17. Suppose a>b (with a,b #0). 


deed! 
(a) Ifa and b have the same sign, then — < =. (Reciprocation reverses order) 
a 


b 


| apa | 
(b) Ifa and b have opposite signs, then — > 5 (Reciprocation preserves order) 
a 





Proof. (a) Since a> b, b-a £0. 
1 
Since a and b have the same sign, ab > 0 and so by Fact 16, ab >i; 
a 


b- b i oe 

By é and Fact 15(a), ne 20: Rearranging, — < = Equivalently, — < -. 
ab ab ab a Db 

(b) Since a > b and a and b have opposite signs, it must be that a >0 and b< 0.141 So, by 
Fact 16, 1/a>0, 1/b< 0. Hence, 1/a > 1/b. 














Example 153. XXX 
Example 154. XXX 


Exercise 74. XXX (Answer on p. 98.) 
ATA. 





Tt cannot be that a <0 and b> 0. 
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6.5. Does Taking Square Roots Always Preserve an Inequality? 


Fact 18. Let a,b>0. If a>b, then a> Vb. 


Proof. First, \/a, Vb > 0. So, (Ja) =a and Vb=b. 
Hence, if /a < Vb, then a = vay < (vo) = b. 


Thus, by the contrapositive of the last implication, we have the claim. 














Next, if a <0 or b<0, then Va or Vo is simply undefined, so that the statement /a > Vb 
is meaningless. !” 


Altogether, the answer to the above question is, “Yes, taking square roots always preserves 
an inequality.” 


6.6. Does Squaring Always Preserve an Inequality? 


Example 157. 5 > —5 but 5° = (-5)° 
To properly answer the titular question, we first need some preliminaries: 


Fact 19. Suppose ae R. Then 


(a) a#0 a” > 0; and 


(b) a=0 





Example 158. 2+0 <> 2?>0 


Example 159. -240 <=> (-2)°>0 


Proof. (a) By Fact 14(b). 
(b) (=) Ifa=0, then a? =0x0=0. 
(<=) If a +0, then by (a), a? > 0 and in particular a? + 0. 

















12ven allowing for complex numbers, if either c or d is not real, then the statement c > d still remains 
meaningless (Ch. 79.3). 
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Proof. If a> 0, then |a| = a and so Jal” = a’. 





If a <0, then |al = -a and go |al’ = (-a)* = a?. 











The titular question is somewhat tricky and requires distinguishing between several cases: 





Proof. Below, * denotes the use of Fact 20. 

(a) Suppose a < 0. Then by S, b<0. 

Also, |a| = -a rae [0| 

So, a? = al? < |b)? 282. 

(b) Suppose a >0 and 6>0. Then a = |al > |b| = b. So, a? = lal” > |b)” = 0”. 
(c) Suppose a > 0 and b<0. 

(c)(i) Suppose a > |b]. Then a? > ||” 2 8. 

(c) (ii) Suppose a < |b|. Then a? < |b = 07. 

(c) (iii) Suppose a = |b]. Then a? = bf = b?. 
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6.7. Does Exponentiation Always Preserve an Inequality? 


0: its 


Z 


Example 160. 5 >3 but 5? < 3 (1/25 < 1/9) 
Example 161. 5 >3 but 5° = 3° (1 = 1) 
Example 162. —2 > -7 but (-2)° < (-7)* (4< 49) 


Example 163. 3 > -5 but 3? < (-5)* (9 < 25) 


ip) 


o, “if a>b, then a* > b*” is not always true. Nonetheless, it is true if a,b, x > 0: 


Fact 22. Let x>0 andb2>0. Ifa> bd, then a” > 6’. 


Example 164. 5 >3 and 5? > 3? (25 >9) 


Example 165. 5 >3 and 5” S of. 


Again, in the main text, we haven’t actually defined b” in the case where « is irrational. 
1 
Nonetheless, > is true, with 156.99 x 5” > 37 x 31.544. 





Example 166. 5 > 0 and 5‘ > 0* (625 > 0) 
To prove Fact 22, we’ll make use of these “obvious” results: 


Fact 23. Let x > 0. 
(a) Ifb>0, then b* >0. 


(byt; b> I thenb? > 1. 

















Proof. See p. 1726 (Appendices). 
We can now prove Fact 22: 


Proof. Since a,x >0, by Fact 22(a), a” > 0. 
If 6= 0, then 6° =0* =0, so that a” > 6”. 














If b> 0, then b” > 0, so that = 2 (+) 5 dt 2 and 5 use Proposition 1 and Fact 22). 





3By the way, the previous three subchapters simply looked at special cases of exponentiation: Given b”, 
e Taking reciprocals is equivalent to setting x = -1; 
e Taking square roots is equivalent to setting x = 1/2; 


e Squaring is equivalent to setting x = 2. 
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Part I. 
Functions and Graphs 





Revision in progress (November 2021). 


And hence messy at the moment. 
Appy polly loggies for any inconvenience caused. 
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[A] large part of mathematics which became useful developed with absolutely no 
desire to be useful, and in a situation where nobody could possibly know in what 
area it would become useful; and there were no general indications that it ever 
would be so. By and large it is uniformly true in mathematics that there is a time 
lapse between a mathematical discovery and the moment when it is useful; and that 
this lapse of time can be anything from thirty to a hundred years, in some cases 
even more ... 


This is true for all of science. Successes were largely due to forgetting completely 
about what one ultimately wanted, or whether one wanted anything ultimately; in 
refusing to investigate things which profit, and in relying solely on guidance by 
criteria of intellectual elegance; it was by following this rule that one actually got 
ahead in the long run, much better than any strictly utilitarian course would have 
permitted. 


— John von Neumann (1954). 
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7. Graphs 


7.1. Ordered Pairs 


Recall that with sets, the order of the elements doesn’t matter: 


Example 167. {Cow, Chicken} = {Chicken, Cow}. 
Example 168. {-5,4} = {4,-5}. 


We now introduce a new mathematical object called an ordered pair (a,b). Like the 
sets {Cow, Chicken} and {-5,4}, you can think of an ordered pair as a container with two 
objects. 


But unlike sets, with ordered pairs, the order matters (hence the name).'“ 


Definition 34. Given the ordered pair (a,b), we call a its first or x-coordinate and 6 its 


second or y-coordinate. 





Two ordered pairs are equal if and only if both their z- and y-coordinates are equal. 


Fact 24. Suppose a, b, x, and y are objects; and (a,b) and (x,y) are ordered pairs. Then 





(a, b)= (4,4) — a=2 AND Gay. 





Proof. See p. 1554 in the Appendices. 











Example 169. Consider these two ordered pairs: (Cow, Chicken) 
(Chicken, Cow). 


The ordered pair (Cow, Chicken) has x-coordinate Cow and y-coordinate Chicken. 
The ordered pair (Chicken, Cow) has x-coordinate Chicken and y-coordinate Cow. 
Since these two ordered pairs have different x- and y-coordinates, they are not equal: 


(Cow, Chicken) # (Chicken, Cow). 


This is in contrast to what we saw above with sets, where: 


{Cow, Chicken} = {Chicken, Cow}. 


To distinguish an ordered pair from a set with two elements, we use parentheses (instead 
of braces). Be very clear that the ordered pair (a, b) is a completely different mathematical 
object from the set {a,b}: 

(Cow, Chicken) # {Cow, Chicken}; 


(Chicken, Cow) # {Chicken, Cow}. 








‘M4¥For the formal definition of an ordered pair, see p. 1554 in the Appendices. 
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Example 170. Consider these two ordered pairs: 


(-5,4) and (4,-5). 


The ordered pair (—5,4) has x-coordinate —5 and y-coordinate 4. 
In contrast, the ordered pair (4,-5) has x-coordinate 4 and y-coordinate —5. 


Since these two ordered pairs have different x«- and y-coordinates, they are not equal: 
(-5, 4) # (4, -5) 


This is in contrast to what we saw above with sets, where {-5,4} = {4,-5}. 


Again, be very clear that the ordered pair (a,b) and the set {a,b} are two completely 
different mathematical objects: 


(-5,4) # {-5, 4} and (4,-5) # {4,-5}. 





It is possible that in an ordered pair (a,b), we have a = b: 


Example 171. Here are four ordered pairs that are distinct (or not equal): 
(Cow, Chicken), (Chicken, Cow), (Cow, Cow), and (Chicken, Chicken). 


The ordered pair (Cow, Cow) has x-coordinate Cow and y-coordinate Cow. 


The ordered pair (Chicken, Chicken) has x-coordinate Chicken and y-coordinate Chicken. 


Example 172. Here are four distinct ordered pairs: 


Can meal (a) 


The ordered pair (1,1) has x-coordinate 1 and y-coordinate 1. 





The ordered pair (-5,-5) has x-coordinate —5 and y-coordinate —5. 
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‘The OED editor Peter Gilliver spent nine months working on the word run (New York Times, 2011). 
Previously, set had the most different meanings, at 430 (Guinness Book of World Records). 
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7.2. The Cartesian Plane 


Rather than ordered pairs of cows and chickens, we’ll usually be concerned with ordered 
pairs of real numbers: 


Definition 35. A point is any ordered pair of real numbers. 


Example 173. A = (-5,4), B= (1,1), and C = (2,-3) are points 
D = (Cow, Chicken) and E = (3, Chicken) are not. 


Definition 36. The cartesian plane is the set of all points: 


1(%,y) 2a, y eR}. 


Fun Fact 


The cartesian plane’! is named after René Descartes (1596-1650), who’s also the same 


dude who came up with “Cogito ergo sum” (“I think, therefore I am”). Ww 





Definition 37. The origin is the point O = (0,0). 





6There’s some disagreement over whether to capitalise cartesian here—see e.g. [mathoverflow, 
Indeed, in your H2 Maths syllabus, it used to be capitalised on p. 19 but not on pp. 7-8! (My guess is 
that while pp. 1-15 were written by the local Singapore authorities, pp. 16-20 were simply copy-pasted 
from some standard Cambridge notation template.) 
[2020 update: They’ve now corrected this in the latest 2021 syllabus. Now cartesian on p. 19 is also 
not capitalised.| 
My personal preference is to capitalise cartesian, but it seems that the A-Level exams do not do so. I 
shall therefore follow the sacred A-Level exams by not capitalising cartesian. 
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Example 174. The cartesian plane below is centred on the origin O = (0,0) (a-coordinate 
0 and y-coordinate 0) and stretches out to oo in both the z- and y-directions. 


Three points are depicted: 
e A=(-3,2)has x-coordinate -3 and y-coordinate 2; 
¢« B=(1,1) has z-coordinate 1 and y-coordinate 1; and 


e C =(1.5,-1) has x-coordinate 1.5 and y-coordinate -1. 


Ay 








When depicting the cartesian plane, it is customary (and helpful) to draw the z-axis (or 
horizontal axis) and y-axis (or vertical axis). 


The point at which these two axes intersect is the origin O = (0,0). 


Remark 27. For now, we'll be concerned only with the cartesian plane, which is a two- 
dimensional space.!’ 


And so, for now, whenever we say point, it should be clear that we’re talking about an 
ordered pair of real numbers. 


Note though that more generally, it is also to talk points in a one-dimensional space and 

a three-dimensional space: 

¢ A point (in one-dimensional space) is simply any real number. 

e As we'll learn in Part III (Vectors), a point (in three-dimensional space) is any ordered 
triple of real numbers. 


By the way and just so you know, a point (in any-dimensional space) is itself a zero- 
dimensional object. 




















7Formally, we may define a two-dimensional space to be any subset of the cartesian plane R?. Similarly, 
a one-dimensional space is any subset of the real number line R and a three-dimensional space is any 
subset of R*. See Definition 271 (Appendices). 
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7.3. A Graph is Any Set of Points 
You’re probably used to thinking of a graph (or a curve) as a “drawing”. But formally, 
Definition 38. A graph (or curve) is any set of points. 
So equivalently, a graph is any subset of the cartesian plane. 


Example 175. The set G = {(-3, 2), (1,1), (1.5,-1)} contains three points. And so by 
definition, G is also a graph. 


We've defined graph as a noun (it is a set of points). But at the slight risk of confusion, 
we'll also use graph as the verb meaning to draw a graph. So, we can either say, “The 
graph G is drawn below,” or, “G is graphed below”. 





Example 176. The set H = {(-3, 2) , (1.5, -1)} contains two points. And so by definition, 
FH is also a graph. 


(-3, 2) 


The graph 
H = {(-3,2),(1.5,-1)} 


contains two points. 
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Example 177. The set J = {(1.5,-11)} contains one point. And so by definition, J is 
also a graph. 


The graph 
1) 


contains one point. 
| 








If a set contains at least one element that isn’t a point, then it isn’t a graph: 


Example 178. Consider J = {(-5,4), Love}. 


The set J contains two elements—the point (—5,4) and the abstract concept called Love. 
Since J contains at least one element that isn’t a point (ie. an ordered pair of real 
numbers), J is not a graph. 


Example 179. Consider Kk = {(-5,4),(1,1),1}. 


The set K contains three elements—the points (-5,4) and (1,1), and the number 1. 
Since K contains at least one element that isn’t a point (ie. an ordered pair of real 
numbers), K is not a graph.''® 


Example 180. The set R contains at least one element that isn’t a point (for example, 
5). Indeed, R contains no points at all and infinitely many elements that are not points. 
Thus, R is not a graph. 


(The same is true of Q and Z.) 





You may be used to thinking of a graph as a “drawing”. But you should now think of a 
graph as being simply a set of points. A “drawing” of a graph is not the graph itself, but 
merely a visual aid." 





118 As explained in Remark 27, a real number can also be regarded as a point in one-dimensional space. 
However, as stated earlier, for now, whenever we say point, we mean a point in two-dimensional space. 
That is, a point is strictly an ordered pair of real numbers. And so, following this usage, the number 1 
here is not a point. 

19 Albeit. a tremendously helpful one. Indeed, analytic or cartesian geometry was one of the ma- 
jor milestones in the history of mathematics. The idea of combining algebra and geometry is today 
“obvious” even to the secondary school student, but wasn’t always so to mathematicians. 
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7.4. The Graph of An Equation 


In the previous subchapter, we looked at graphs that were simply sets of random, isolated 
points. 


But going forward, we’ll be looking mostly at graphs of equations (and shortly, also of 
functions): 


Definition 39. The graph of an equation is the set of points for which the equation is 
true. 


Example 181. Let G be the graph of the equation y = 2 +2. Then 


G 2444) yeas 2). 


In words, G is the set of points (x,y) for which the equation y = 7 +2 is true. And so, G 
contains 


(-1,1), because 1 = -1+ 2; and (1,3), because 3 = 1+ 2. 


We can say, “Below we’ve graphed the equation y = x + 2” or more simply, “Below is 
graphed y = x + 2,” or, “Below is graphed G”. 


3 ca 


The graph of the equation 
y = 2+ 2 is the set 
G={(z,y)i y= + 2}, 
which contains 


(-1,1) and (1,3). 


(=4,1) 








We'll also say that the equation y = x +2 describes a line—specifically, the line with 
gradient 1 and which passes through the point (0,2). 





(We'll learn more about lines and gradients in Ch. 8.) 
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Example 182. Let H be the graph of the equation y = x7. Then 


HAG) es |. 


In words, H is the set of points (x,y) for which the equation y = x? is true. And so, H 
contains 


(1,1), because 1 = 17; and (2,4), because 4 = 2?. 





We'll also say that the equation y = 2? describes a parabola. 
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7.5. Graphing with the TI84 


Our first examples of using the TI84 (see “Use of Graphing Calculators”, p. xxxvii): 


Example 183. Graph the equation y = 2”. 


1. Press to turn on your calculator. 

2. Press to bring up the Y= editor. 

3. Press R@SMRN to enter “X”; then to enter the squared “?” symbol. 
4. Now press and the calculator will graph y = 2”. 


Step 1. Step 2. Step 3. Step 4. 


(Screenshots are of the end of each Step.) 











Example 184. XXX 
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7.6. The Graph of An Equation with Constraints 


Example 185. Let G, be the graph of the equation y = x + 2 with the constraint x > 3. 
Equivalently, let G; be the graph of y=x2+2, 7>3. Then 


Gi={(yy) yaa +2, 23). 


In words, G'; is the set of points (,y) for which “y = 7+2 AND x > 3” is true. And so, 
G, contains 


e (5,7), because 7=5+2 AND 523. 
e (3,5), because 5=3+2 AND 323. 


In contrast, G; does not contain (1,3), because 1 } 3. 


Gi = {(a,y):y=2+2,0 23} 








3 


Above we labelled our graph as G; = {(2,y):y=2+2, x>3}. But going forward, we’ll 
be a little lazy/sloppy and simply label it as y = x + 2, x > 3 (as done below), with the 
understanding that this is the graph that satisfies the equation and constraint contained 
in that label. Nonetheless, you should always remember: 


A graph is a set of points. 
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Example 186. Let G2 be the graph of y=x7+2, 7 >3. Then 


Go ={(z,y): y=2+2, c> 3}. 


The set G2 is exactly the same as G but with one difference—the constraint (inequality) 
is now strict, so that this time, G2 does not contain (3,5). 
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Example 187. Let H; be the graph of y = x”, x < 2. Then 
Ay = {(a,y)iy= a", ws 2}. 


In words, Hj is the set of points (x,y) for which “y = 2? AND x < 2” is true. And so, My 
contains 

¢ (1,1), because 17=1 AND 1<2. 

¢ (2,4), because 2? = 4 AND 2 <2. 


In contrast, H, does not contain (3,9), because 3 ¢ 2. 











Example 188. Let H» be the graph of y = x”, x <2. Then 


HS (Gaye yan 2 <2) 


The set Ho is exactly the same as H, but with one difference—the constraint (inequality) 
is now strict, so that this time, Hj does not contain (2,4). 
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1 
Example 189. Let I, be the graph of 77+ y? = 1, x > na Then 
1 
h={(o,y)ia +P=1, o>-3h. 


1 
In words, J, is the set of points (x,y) for which “x? + y? = 1 AND x > a is true. And 


so, J; contains 


1,0), because 17+ 0? = 1 AND 1 >-0.5. 
2 2 
oO), be cause (= + (=) =1 AND 


2 Z 


(2 ys | 
-(- é because (- 5) + (| =1 AND a > aS 
Cay 


Q 2 
; , because (- 5) + _v3 =1 AND ce > eae 
2 v4 2 2 
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1 
Example 190. Let Jz be the graph of 77+ y? = 1, x > = Then 


il 
p= {(c,y) i? +P =1, o>-3}. 


The set Jy is exactly the same as J, but with one difference—the constraint (inequality) 


i ¥3 1 fe 
is now strict, so that this time, J» does not contain (-5 3) or (-5 -¥). 


\ 
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More generally, a graph can also be of multiple equations with multiple constraints: 
Example 191. Let J be the graph of 


for x < 0, 


; for x > 0. 


In words, J is the set of points for which either “y = -r AND zx < 0” or “y = 2? AND 
x > 0” is true. 


J is the graph of 


ior 2 = 0 


tor 2 > 0. 








We can actually write down J using set-builder notation and the union operator: 


T= NG) = 0) yea ee 0, 


But this is cumbersome and difficult to read. And so, we’ll usually simply specify J as 
was done above. 
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Example 192. Let K be the graph of 


for #2, 


for a2. 


In words, K is the set of points for which either “y = a2 AND x ¢ 2” or “y=0 AND a = 2” 
is true. 


K is the graph of 


Le Oey 22 


y= QO, - for &= 2, 








Exercise 76. Graph each of the following. (Answers on p. 1763.) 


(a) y=e", -l<a<2. 
(b) y=3r4+2, -1l<x<2. 
(@) yor i=l ee 2. 


Exercise 77. Graph each of the following. (Answer on p. 1764.) 


x+1, for x < 0, 
(a) y= 
xl, tor >). 


x+1, for % < 0, 
(b) y= 
x-1, for @ >), 
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7.7. Intercepts and Roots 


In secondary school, you learnt about x-intercepts, y-intercepts, and solutions (or 
roots). We now give their formal definitions: 


Definition 40. Let G be a graph. 


¢ If the point (0,6) is in G, then we call (0,0) a vertical or y-intercept of G. 
¢ If the point (a,0) is in G, then we call (a,0) a horizontal or x-intercept of G. 


If, moreover, G is the graph of an equation, then we also call a a solution or root of 
that equation. 


Example 193. The graph of the equation y = x + 2 has horizontal or x-intercept (-2,0) 
and vertical or y-intercept (0,2). 


Or more simply, “The equation y = ++2 has horizontal or x-intercept (-2,0) and vertical 
or y-intercept (0, 2).” 


We also say that —2 is a solution (or root) of the equation y = x + 2. 
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Example 194. Consider the equation y = 27. The point (0,0) is both its (only) x- 
intercept and its (only) y-intercept. Also, the equation has solution (or root) 0. 


Y 











Example 195. The equation xz? + y? = 1 has two z-intercepts (-1,0) and (1,0), two 
y-intercepts (0,-1) and (0,1), and two solutions (or roots) -1 and 1. 


zN 


SiG ay 
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Example 196. The equation y = x” - 1 has two z-intercepts (-1,0) and (1,0), 
y-intercept —1, and two solutions (or roots) -1 and 1. 
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8. Lines 


You already know what a line is, informally and intuitively. Here’s a formal definition:'”° 


Definition 41. A line is the graph of any equation 


ax + by+c=0, 





where a,b,c € R and at least one of a or b is non-zero. 
You may find this definition a little puzzling—didn’t we always simply write lines as 
y=dxt+e? 


It turns out that if b #0, then ax +by+c=0 and y = dz +e are equivalent, as we now prove: 


a Cc 
ax +by+c=0 —> by = -ax-c —> y= -7u 7. 
—— “WY 

d e 


Writing a line as y=dx+e has two advantages—it immediately tells us that its 
gradient”! is d and y-intercept is (0,e). 


But writing a line as y = dz +e also has one big disadvantage—it can’t describe the case 
where b= 0, i.e. vertical lines: 


Example 197. The line x + y+1=0 can also be written as y = —x - 1. 


In contrast, the vertical line x - 1 = 0 cannot be written in the form y = dz +e. 


The line x +y+1=0 can 


also be written as y = —a - 1. The vertical line 


x-1=0 cannot be 
written in the 
form y =dxr+e. 











Here’s what we’ll do in this textbook: If we know for sure that a line isn’t vertical, then 
we'll write it in the form y = dz + e, because of the two advantages. Otherwise, we'll write 
it as ax+by+c=0. 





201 efinition 41 covers only lines in 2D space. In Part IV (Vectors), we'll learn a more general definition 
(Definition 143) that covers also lines in higher-dimensional spaces and supersedes Definition 41. 
21We’ll formally define the gradient of a line in Ch. 8.3. 
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8.1. Horizontal, Vertical, and Oblique Lines 


Definition 42. A line is 


(a) Horizontal if any two points (in that line) have the same y-coordinate; 


(b) Vertical if any two points have the same z-coordinate; and 


(c) Oblique (or slanted) if it is neither horizontal nor vertical. 


Example 198. The line y = -1 is horizontal, the line x = 2 is vertical, and the line 
y = x+1 is oblique. 








Fact 25. Suppose ax + by+c=0 describes a line. Then 


(a) The line is horizontal —> a=0. 
(b) The line is vertical <— > b=0. 








Proof. See p. 1557 in the Appendices. 











Example 199. The line y = —-1 is horizontal because the coefficient on x is zero. 


The line x = 2 is vertical because the coefficient on y is zero. 





The line y = x + 1 is oblique because the coefficients on x and y are both non-zero. 
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8.2. Another Equation of the Line 


Fact 26. The unique line that contains both of the distinct points (21, y1) and (a2, y2) is 





(2 - #1) (y—1) = (2-1) (2 - 21). 





Proof. See p. 1558 in the Appendices. 











Example 200. The line containing the points (1,2) and (-1,3) is 
(-1-1) (y-2) = (8-2) (e-1) Eee 


A 








The line containing the points (2,0) and (4,5) is 


(4-2) (y-0) = (5-0) (4-2) 2y = 5a - 10 





If x; #2 and yj; # yo, then we can rewrite the equation in Fact 26 as 


Y-Y * LX 
Y2-Y1 ta-VXy 





This latter equation is perhaps more familiar and easier to remember. But of course, it’s 
illegal if x, = x2 or y, = y2, which is why we sometimes prefer the equation in Fact 26. 
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We reuse the last example: 


Example 201. The line containing the points (1,2) and (-1,3) is 


1 il 5 
y-2=-5(r-1) Dee 





Exercise 79. Find the line that contains each pair of points. 


(a) (4,5) and (7,9) (b) (1,2) and (-1,-3) (Answer on p. 1765.) 





128, Contents www.EconsPhDTutor.com 


8.3. The Gradient of a Line 


Informally, the gradient (or slope) of a line is 
e “Rise over Run”; 


e Ay/Azx or “Change in y over change in x” (A is the upper-case Greek letter delta); 
e The answer to this question: 


“If we move 1 unit to the right while remaining along the line, by how many units will 
we move up?” 


Example 202. The gradient of the line y = 27 +1 is 


Ay 2 


Ac 1 
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Example 203. The gradient of the line y = —x - 1 is 


My 1. 1 
ae i 











In general, to find a line’s gradient, pick two distinct points on the line (71, y,) and (22, yo), 
then compute 


Ri A - 
Gradient = aye a 
Run Ag 2-21 


Figure to be 


inserted here. 





One caveat is that if the line is vertical, then 7, = x2 so that our last expression has 
denominator 0 and is thus undefined. And so, if a line is vertical, we’ll simply say that its 
gradient is undefined. 
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Let’s jot the above down as a formal definition: 


Definition 43. Suppose a non-vertical line contains the distinct points (21,y,) and 
(x2, y2). Then its gradient is the number defined as 


Yx— Ui 
LQ — LY 


If a line is vertical, then its gradient is simply left undefined. 


Fact 27. Suppose ax + by+c=0 is a non-vertical line. Then its gradient is —a/b. 





Proof. Since the line ax + by + c = 0 is non-vertical, 6 # 0 (Fact 25). 


Case 1. If a = 0, then the line contains the points (0. 5) and (1 5), so that by Definition 
43, its gradient is 


| 
S10 
| 
——™"~ 
| 
S10 
NY 
g 


Case 2. If a # 0, then the line contains the points (0.-£) and (1 -“**), so that by 
Definition 43, its gradient is 














Exercise 80. XXX (Answer on p. 131.) 
A80. 
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8.4. Yet Another Equation of the Line 


Fact 28. The line that contains the point (a1,y,) and has gradient m is 





y-yi=m(x-21). 


Proof. Let the line be ax + by +c 20. 
The line’s gradient is m z —a/b. Plugging 2 into + yields y + ; 3 ma)” 


fe C 
Also, the line contains (x1, y:) — plugging this into 3 yields y; + 5 7 MG OF > 4 maz - Yi- 














Plug = into 2 to get y+ mz, -y, = Mz or y-y, = M(x- 21). 


Example 206. The line that 
contains the point (-1,2) and 
has gradient 3 is!” % 


y-2=3[x-(-1)], 
or, y =30 +5. 
The line that contains the 
point (7,5) and has gradient 
—2 is ae 


C= (20), 


oe 





Exercise 81. Find the line with the given point and gradient. (Answer on p. 1765.) 
(a) Point (4,5) and gradient 3 (b) Point (1,2) and gradient -2 








122Since the line’s gradient is defined, b + 0. 
!23Tt looks like Wolfram Alpha understands slope but not gradient. 
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8.5. Parallel and Perpendicular Lines 


Definition 44 (informal). Two lines are parallel if their gradients are equal. 
If two lines / and m are parallel, we’ll write / || m; if not, we’ll write J | m. 


Definition 45 (informal). Two lines are perpendicular if 


(a) Their gradients are negative reciprocals of each other; or 


(b) One line is vertical while the other is horizontal. 


If two lines / and m are perpendicular, then we will also write | 1 m. 





If two lines / and m are perpendicular, we’ll write | 1 m; if they aren’t, we’ll write 1 [ m. 


In Part IV (Vectors), we’ll give Definition 150, which is a more general definition of when 
two lines are parallel or perpendicular and which will supersede the above two definitions. 


Example 207. XXX 


Example 208. XXX 


Exercise 82. XXX (Answer on p. 133.) 
A82. 


133, Contents www.EconsPhDTutor.com 


8.6. Lines vs Line Segments vs Rays 


Example 209. Let A and B be points. 


The line AB contains both A and B. It has infinite 
length, extending “forever” in both directions. 


B 
A age In contrast, the line segment AB has finite length—it contains 


A, B, and every point between, but no other point. 


With lines and line segments, the order in which we write A and B doesn’t matter. The 
line AB is the same as the line BA. And the line segment AB is the same as the line 
segment BA. But with rays, the order in which we write A and B does matter. 


The ray AB is the “half-infinite line” that 
starts at A, passes through B, and also con- B 
tains every point “beyond” B. A 


Btn contrast, the ray BA is the “half-infinite line” that starts at 
B, passes through A, and also contains every point “beyond” A. 


A 


The rays AB and BA are distinct. Both have infinite length. 


It makes no sense to speak of the length of a line or a ray (because these have infinite 
length). We can however speak of a line segment’s (finite) length.'4 








24¥For the formal definitions of a line, a line segment, and a ray, see Definition 273 in the Appendices. 

125See Item No. 2-8.4. The 40-something-page PDF costs a mind-blowing 158 Swiss Francs or about 
S$214 at the ISO store. As always, you may or may not be able to find free versions of this document 
elsewhere on the interwebz. 

!26Tndeed, they don’t seem to be very careful about distinguishing between lines and line segments. 
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9. Distance 


9.1. The Distance between Two Points on the Real Number Line 


On the real number line, the distance between two points (or real numbers) is simply the 
magnitude of their difference. Formally, 


Definition 46. Let A and B be two real numbers. The distance between A and B, 
denoted |AB]|, is the number defined by 


|AB| =|B- Al. 
Example 210. The distance between A = 3 and B= 5 is 
|AB| = |B - A] = |5 - 3] = [2] = 2. 
Example 211. The distance between C' = 7 and D = -2 is 


ICD| = |D - C| = |-2-7| =|-9| =9. 





Of course, the distance between A and B is the same as the distance between B and A: 


|AB| = |B - A|=|A- B] =|BA. 


Exercise 83. Let A=5, B=-7, and C=1. What are |AB|, |AC|, and |BC|?(Answer on 


p. 1766.) 
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9.2. The Distance between Two Points on the Cartesian Plane 


Definition 47. In a right triangle, we call the side facing the right angle the triangle’s 





hypotenuse and the other two sides its legs. 


Let A = (a1, a2) and B = (b,, b2) be two points on the cartesian plane. How should we define 
the distance between A and B? 


To motivate this definition, let’s review Pythagoras’ Theorem: 


Theorem 2. (Pythagoras’ Theorem) Suppose a right triangle has legs of lengths a 
and b, and hypotenuse of length c. Then 


Ge bbe ac 





Proof. Construct the triangle ABC, where the vertices A, B, and C are opposite the sides 
of lengths a, b, and c, respectively. 


Let D be the point on AB that is the base of the perpendicular from the point C: 


B 





—d 
The triangles ABC’ and C'BD are similar. Hence, w= or ate (c-d). 
C a 


b d 
The triangles ABC and ACD are similar. Hence, - = b or b? 2 ed. 
C 





Now, = + 2 yields a2 + b? =c(c-d)+cd=c(c-d+d) =. 











So, if we draw A = (a,,a2) and B = (b,,b2) as two vertices of a right triangle, then by 


Pythagoras’ Theorem, the length of the hypotenuse AB should simply be \/ (a; - by + (a2 - by)”: 


A = (a1, a2) 






(ay = by) + (ag = by)” 


ag — bg 


B 


(b1, be) ay, — by Ge (a1, b2) 
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The foregoing discussion suggests this formal definition of distance: 


Definition 48. Let A = (a),a2) and B = (bi, b2) be points in the cartesian plane. The 
distance between A and B, denoted |AB], is the number defined by 


ABS) (al =0) Hana bn). 


Also, we define the length of the line segment AB to be |AB|. 


Example 212. The distance between A = (3,6) and B = (5,-2) is 


|AB] = \/ (3-5)? + [6 - (-2)]” = V2? + 8? = V68. 


Example 213. The distance between C = (1,7) and D = (0,-2) is 


ICD] =\/ (1-0) + [7 - (-2)]? = V12 + 92 = V82. 


Of course, the distance between A and B is the same as the distance between B and A: 





|AB| = (a; —b1)” + (a2 — be)’ = (by — a1)" + (bo — a2 * = |BAl. 


(We say that the distance or length operator |-| is commutative, just as addition and 
multiplication are: a+ b=b+a and ab = ba.) 


Exercise 84. Let A = (5,-1), B = (-7,0), and C = (1,2). What are |AB|, |AC|, and 
IBC? (Answer on p. 1766.) 
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9.3. Closeness (or Nearness) 


Definition 49. Let A, B, and C be points. 

If |AB| <|AC], then we say that B is closer (or nearer) to A than C. 
If |AB| =|AC], then we say that B is as close (or near) to A as C. 
If |AB| > |AC|, then we say that B is further from A than C. 


Let S be a set of points. We say that D ¢S is the point in S that’s closest to A if 
|AD| <|AE| for every point Fe S. 


Example 214. Let A = (3,6), B = (5,-2), and C = (1,7). We have 


|AB| = \/ (3-5)? + [6 - (-2)]° = V2? + 8? = 68, 
|AC| = (3-1)? + (6-7) = V2? + 12 = V5, 
IBC| =\/ (5-1)° + (2-7) = V42 4 92 = V117. 


Since |AC| < |AB|, C is closer to A than B. Equivalently, B is further from A than C. 
Since |AB| = |BA| < |BC|, A is closer to B than C. Equivalently, C is further from B 


Exercise 85. Let A =(5,-1), B= (-7,0), and C = (1,2). Fill in the blanks: 


(a) Ais B than C. 
(b) C is A than B. (Answer on p. 1766.) 


Exercise 86. Find the point on the line x = 3 that is closest to the point (5,6). (Answer 
on p. 1766.) 
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10. Circles 


The unit circle centred on the origin (the word unit means the circle has radius 1) is 
graphed below. 


This is the set of points (x,y) whose distance from the origin (0,0) is 1, i-e. 


V (a - 0)? + (y-0)? = 1. 


Or equivalently and more elegantly, 





That is, the unit circle centred on the origin 7s this set: 


G=i@aen ty = 1}. 


Figure to be 
inserted here. 


2 
Example 216. The point ( ] is on the unit circle because it satisfies the 


equation x? + y” = 1: 





A.(2 


Example 217. The point (0.1,-v0.99) is on the unit circle because it satisfies the 


equation x? + y” = 1: 


0.12 4 (- v0.99) - 0.01 +0.99 = 1. 
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Next, the circle of radius r centred on (a,b) is graphed below. 


This is the set of points (x,y) whose distance from the point (a,b) is r, i.e. 
(x-a)’ +(y-b) =r. 
Or equivalently, 
(x- a)’ +(y-b) =r. 


The above discussion motivates this definition: 


Definition 51. Let (a,b) be a point in the cartesian plane and r > 0. The circle of radius 
r centred on (a,b) is the graph of 


(z-a) +(y—b) =r. 


The circle’s radius is the number r, its centre is the point (a,b), and its diameter that is 
the number 2r. 





Equivalently, the circle of radius r centred on (a,b) is this set: 


G ={(x,y):(a@-a)’ +(y-b) =r}. 


Figure to be 


inserted here. 





Example 218. XXX 
Example 219. XXX 


Example 220. XXX 
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10.1. Graphing Circles on the TI84 


Example 221. Graph the equation 2? + y? = 1. 

Unfortunately, you cannot directly input this equation, because the piece of junk that is 
the TI84 requires that you enter equations with y on the left hand side. And so here, we'll 
have to tell the TI84 to graph two separate equations: y = V1- x? and y=-V 1-2”. 


1. Press to turn on your calculator. 
2. Press to bring up the Y= editor. 


Most buttons on the TI84 have three different functions. Simply pressing a button ex- 
ecutes the function that’s printed on the button itself. Pressing the button and 
then a button executes the function that’s printed in blue above the button. And pressing 
the and then a button executes the function that’s printed in green above the 
button. 


3. Press the button and then the button to execute the \/ function and enter 
eed, press (1) @ eg atean 8. Altogether you’ve entered V1 — 2?. 


Warning: Confusingly, the TI84 has two different minus buttons: @ and ((-)}. The @ is 
used as the minus operation in the middle of an expression, as was just done. In contrast, 
the |(-)| is used to denote negative numbers at the beginning of an expression, as we’ll do 
in Step 5 (below). Unfortunately, you really have to key in the correct button, or else 
you'll get a frustrating error message. 


4. Now press UNNI and the blinking cursor will move down, to the right of “Y, =”. 


Step 1. Step 2. Step 3. Step 4. 


(Example continues on the next page ...) 





142, Contents www.EconsPhDTutor.com 


(Example continues on the next page ...) 


We'll now enter the second equation. 


5. Press the button. Now repeat what we did in step 3 above: Press the blue 
button and then \/ (which corresponds to the button) to enter “SEP. Next press 
@ GSR is to enter “1 — X°”. Altogether you will have entered -V1 - 2?. 


6. Now press (VN) and the calculator will graph both y = V1 - 2? and y =-V 1-2”. 
Notice the graphs are very small. To zoom in, 


7. Press the ZOOM) button to bring up a menu of ZOOM options. 


8. Press (2) to select the Zoom In option. Nothing seems to happen. But now press 
NEVE and the TI84 will zoom in a little for you. 


We expected to see a perfect circle—instead, we get an ellipse. Hm, what’s going on? 
The reason is that by default, the x- and y- axes are scaled differently. To set them to 
the same scale, 


9. Press the WOON) button again to bring up the ZOOM menu of options. Press (5) to 
select the ZSquare option. The T184 will adjust the x- and y- axes so that they have 
the same scale and thus give us a perfect circle. 


Step 5. Step 6. Step 7. 


P.S. An alternative to Step 5 is to enter “—Y,” instead of “-V1- X?”. To do so, replace 
Step 5 with these instructions: First press |(-)] to enter the minus sign, as was done in 
Step 5. Next press QWAURS to bring up the VARS menu. Then press })] to go to the 


Y-VARS menu. Now press (3Q08359 to select “1: Function...”. Press (MBE again to 
select “1: Y,”. Altogether, we will have entered “Yj = -Y,”. Now go to Step 6. 


Exercise 88. Graph (x -3)° + (y+2)° on your TI84. (Answer on p. 143.) 
A88. TBD. 
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10.2. The Number 7 (optional) 


Theorem 3. The ratio of a circle’s area to the square of its radius is constant. 


Proof. Omitted. See e.g. Euclid’s Elements (c. 300 BC, XII.2—Joyce, 1997).!?" 


Definition 52. The circumference of a circle is its length. 


Remark 34. In writing down the above definition, we’re actually cheating a little: So far 
we’ve only defined what the length of a line segment is; we have not defined the “length” 
of any curve (e.g. a circle). 














It turns out that the question of the “length” of a curve is a little harder than one might 


think. In particular, it is beyond the scope of H2 Maths (and is dealt with in H2 Further 
Maths, but only cursorily and as a mindless formula that students are to mug and apply). 


Nonetheless, just to screw students over, the question of length abruptly appeared as a 
curveball question in the 2018 A-Level H2 Maths exam (see Exercise 673). 





Theorem 4. The area of a circle with radius r and circumference C is Cr/2. 


Proof. Omitted. This result was first proven by Archimedes (c. 250 BC, Measurement of 
a Circle, Proposition 1—see e.g. Heath, 1897). 


Corollary 1. The ratio of a circle’s circumference to its radius is constant. 


Proof. Suppose two circles have circumferences C and C and radii r; and ro. 
Then by Theorem 4, their areas are Cyr,/2 and Cor2/2. 
And by Theorem 3, 














On OI gO 


2 2 











We’ve just shown that the ratio of a circle’s circumference to its radius is constant. 





Given Corollary 1, we can define the number 7t: 


Definition 53. The real number 7 is the ratio of a circle’s circumference to its diameter. 


Remark 35. The above is the definition of 7 that should be familiar to you from primary 
school and will be good enough for us. 


Note though that more advanced mathematical texts will usually adopt some other defi- 
nition of 7—for example, after defining the sine function sin, they might define zt to be 
the smallest positive number such that sin7t = 0. 





Fact 29. A circle with radius r has circumference 2nr and area Ttr?. 


7But this result was probably proven before Euclid. 
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Proof. Let C' be the circle’s circumference. 
By Definition 53, 7 =C/(2r). Rearranging, C' = 27rr. 
By Theorem 4, the circle’s area is Cr/2 = (27r) r/2 = mtr’. 














Fun Fact 


There’s a movement to replace 7 = 3.14... with T= 6.28... as the “standard” constant, 
where T = 6.28... is the ratio of a circle’s circumference to its radius. See “The Tau 
Manifesto” or Wall Street Journal (2020-03-13). 
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11. Tangent Lines, Gradients, and Stationary Points 


11.1. Tangent Lines and the Gradient of a Graph at a Point 


Example 222. Graphed below is the equation y = x? + 1. 


The red line is tangent to the graph at the point (—2,5). It can be shown that this line’s 
gradient is —4. So, the graph’s gradient at this point is —4. 


Gradient 0 








The blue line is tangent to the graph at the point (0,1). It can be shown that this line’s 
gradient is 0. So, the graph’s gradient at this point is 0. 


The green line is tangent to the graph at the point (3,10). It can be shown that this 
line’s gradient is 6. So, the graph’s gradient at this point is 6. 





Here’s the informal definition of tangent lines you probably learnt in secondary school: 


Definition 54 (informal). A tangent line (to a graph at a point) is a line that “just” 
touches that graph at that point. 


Definition 55 (informal). The gradient (of a graph at a point) is the gradient of the 
tangent line (to that graph at that point). 


Tangent comes from the Latin for touching. Later on in Part V (Calculus), we’ll define tan- 


gent lines more formally and precisely (see Definition XXX). But for now, these definitions 
will be good enough. 
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Example 223. Consider the graph of y = 7° + 1. 


The red line is tangent to the graph at the point (—2,-7). It can be shown that this 
line’s gradient is 12. So, the graph’s gradient at this point is 12. 


IX 
Y 








The blue line is tangent to the graph at the point (0,1). It can be shown that this line’s 
gradient is 0. So, the graph’s gradient at this point is 0. 


The green line is tangent to the graph at the point (3,28). It can be shown that this 
line’s gradient is 27. So, the graph’s gradient at this point is 27. 
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11.2. Stationary Points 





Equivalently, a point of a graph is called a stationary point if the tangent line (to that 
graph at that point) is horizontal. 


Example 227. Graphed below is the equation y = x? + 1. 


The point (0,1) is a stationary point of the graph because the gradient of the graph 
at that point is zero (or equivalently, the tangent line to the graph at that point is 
horizontal). 


Gradient 0 











The points (—2,5) and (3,10) are not stationary points of the graph. 


Example 228. XXX 


Example 229. XXX 


Example 230. XXX 
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12. Maximum, Minimum, and Turning Points 


In secondary school, you learnt about maximum points and minimum points. 


We’ll now relearn these, but in a little more depth. 


12.1. Maximum Points 


In particular, for maximum points, we’ll learn to make these two distinctions: 


¢ Global vs local maximum points; 

¢ Strict vs non-strict maximum points. 

Informally, a global maximum (point) of a graph is a point that’s at least as high as 
any other point (that’s also in that graph).'? 


And a strict global maximum (point) of a graph is a point that’s strictly higher than 
every other point (that’s also in that graph). 


Example 231. Consider y = —xz?. The point D = (0,0) is a global maximum (of the 
eraph of y = —x”), because it is at least as high as any other point (in that graph). 











Indeed, it is also a strict global maximum, because it is strictly higher than every 
other point. 








29H ere, we'll merely give the informal definitions of the eight types of extrema introduced. For their 
formal definitions, see Definition 277 (Appendices). 


149, Contents www.EconsPhDTutor.com 


Example 232. Consider y = sinx and these three points: 


A-(-1), B-(2,1), and C =). 
2: 2 D 


Each of A, B, and C is a global maximum (of the graph of y = sin x), because each is at 
least as high as any other point (in that graph). 








Indeed, y = sinz has infinitely many global maxima: For every integer k, the following 
point is a global maximum: 
il 
((2« + 5) i} : 
2 


Note though that y = sinz has no strict global maximum, because no point is strictly 
higher than every other point. 





Obviously, if a point is strictly higher than any other point, then it must also be at least 
as high as any other point. And so, 


Fact 30. Every strict global maximum is also a global maximum. 


However, the converse is not true. That is, a global maximum need not be strict. (In 
the above example, each of A, B, and C is a global maximum, but none is a strict global 
maximum. Indeed, the graph of y = sin has no strict global maximum.) 


We next introduce the concept of a local maximum: 
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Informally, a local maximum (point) of a graph is a point that’s at least as high as any 
“nearby” point (that’s also in that graph). 


Example 233. Consider the graph of y = 62° — 15a -— 10a? + 302”. 


Neither FE = (-1,19) nor F = (1,11) is a global maximum, because neither is at least as 
high as any other point. 


E = (-1,19) 





Yoon loa — 107 + 30 





However, each of EF and F is a local maximum, because each is at least as high as any 
“nearby” point. 





Informally, a strict local maximum (point) of a graph is a point that’s strictly higher 
than any “nearby” point (that’s also in that graph). 


Example 234. In the last example, the points & and F are also strict local maxima, 


because each is strictly higher than any “nearby” point. 





Again and obviously, if a point is strictly higher than any “nearby” point, then it must also 
be at least as high as any “nearby” point. And so in general, 


Fact 31. Every strict local maximum is also a local maximum. 


Again, the converse is not true. That is, a local maximum need not be strict: 
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Example 235. Consider y = 3. The point G = (1,3) is a local maximum, because it’s at 
least as high as any “nearby” point. However, it is not a strict local maximum, because 
it is not strictly higher than any “nearby” point. 


Actually, every point in y = 3 is a local maximum (though not a strict local maximum)! 


Indeed, every point in y = 3 is a global maximum (though not a strict global maximum)! 


(The last two sentences may seem puzzling. If so, take a moment to convince yourself 
that they’re true.) 


y=3 
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12.2. Minimum Points 


Similarly, for minimum points, we can make the same two distinctions: 


¢ Global vs local minimum points; 

e Strict vs non-strict minimum points. 

Informally, a global minimum (point) of a graph is a point that’s at least as low as any 
other point (that’s also in that graph). 

And a strict global minimum (point) of a graph is a point that’s (strictly) lower than 
any other point (that’s also in that graph). 


2 


Example 236. Consider y = 7°. The point K = (0,0) is a global minimum (of the 


eraph of y = x”), because it is at least as low as any other point (in that graph). 


nN 
Y 








K = (0,0) 





Indeed, it is also a strict global minimum, because it is (strictly) lower than every 
other point. 
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Example 237. Consider y = sinx and these three points: 


571 7 on 
cle) ee) eee. (2a) 


Each of H, J, and J is a global minimum, because each is at least as low as every other 
point (in the graph of y = sin). 


y= sine 








Indeed, y = sinx has infinitely many global minima: For every integer k, the following 
point is a global maximum: 
1 
2k-—]7,1). 
((2-5) =) 


Note though that y = sinz has no strict global minimum, because no point is strictly 
lower than every other point. 





Obviously, if a point is strictly lower than any other point, then it must also be at least as 
low as any other point. And so, 


Fact 32. Every strict global minimum is also a global minimum. 


However, the converse is not true. That is, a global maximum need not be strict. (In 
the above example, each of H, J, and J is a global minimum, but none is a strict global 
minimum. Indeed, the graph of y = sinz has no strict global minimum.) 
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Informally, a local minimum (point) of a graph is a point that’s at least as low as any 
“nearby” point (that’s also in that graph). 


Example 238. In the graph of y = 6x° — 15x24 — 102° + 30x”, neither L = (0,0) nor 
M = (2,-8) is a global minimum, because neither is at least as low as every other point 
(in that graph). However, each is a local minimum, because each is at least as low as any 
“nearby” point. 


Or lon —l0n On 











Informally, a strict local minimum (point) of a graph is a point that’s strictly lower 
than any “nearby” point (that’s also in that graph). 


Example 239. In the last example, the points L and M are also strict local minima, 





because each is strictly lower than any “nearby” point. 


Again and obviously, if a point is strictly lower than any “nearby” point, then it must also 
be at least as low as any “nearby” point. And so, 


Fact 33. Every strict local minimum is also a local minimum. 


However, the converse is not true. That is, a local minimum need not be strict: 
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Example 240. Consider y = 3. The point G = (1,3) is a local minimum, because it’s at 
least as low as any “nearby” point. However, it is not a strict local minimum, because 
it is not strictly lower than any “nearby” point. 


Actually, every point in y = 3 is a local minimum (though not a strict local minimum)! 
(This statement may initially seem puzzling, but you should take a moment to convince 
yourself that it is true.) 


y=3 





Indeed, every point in y = 3 is a global minimum (though not a strict global minimum)! 
(Ditto.) 
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12.3. Extrema 


It will be convenient to have a word for all maximum and minimum points. 


Definition 57. An extremum (plural: extrema) is any maximum or minimum point. 





A strict extremum is any strict maximum or minimum point. 


So, so far in this chapter, we’ve learnt about eight types of extrema, now summarised: 


(a) A global maximum is at least as high as any other point. 
(b) The strict global maximum is higher than any other point. 
(c) <A local maximum is at least as high as any “nearby” point. 
(d) A strict local maximum is higher than —_ any “nearby” point. 
(e) A global minimum is at least as low as any other point. 
(f) The strict global minimum is lower than —_ any other point. 
(g) A local minimum is at least as low as any “nearby” point. 
(h) A strict local minimum is lower than any “nearby” point. 


Exercise 89. Explain whether each statement is true. (Answer on p. 1769.) 


(a) A global maximum must also be a strict local maximum. 
(b) A global maximum must also be a local maximum. 


(c) A strict global maximum must also be a global maximum, strict local maximum, and 
local maximum. 





(d) A global maximum cannot be a local minimum. 
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12.4. Turning Points 


Informally, a turning point is any point at which a graph “turns”. Formally, 


Example 241. The table below says that A = (-8,-81) in the given graph is a local 
maximum, a global minimum, and a local minimum, but is not any of the other five 
types of extrema and is not a turning point. (Verify that the entire table is correct.) 


Ay 











Global maximum 





Strict global maximum 





Local maximum 





Strict local maximum 





Global minimum 





Strict global minimum 





Local minimum 





Strict local minimum 
Turning point v "A 
































Besides the eight points A-H, does this graph have any other extrema?!” 





130Ves. In fact, there are infinitely many other extrema: 





For every p < -3, the point (p, -81) is—like A—a global minimum, local maximum, and local minimum. 
And for every q > 5, the point (q,125) is—like H—a global maximum, local maximum, and local 
minimum. 
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Example 242. The graph of y = x (left) has no extrema. 











In contrast, the graph of y = x with the constraint x > -1 (right) has one extremum, 
namely J = (-1,-1), which is a global minimum, strict global minimum, local minimum, 
and strict local minimum. Observe though that J is not a turning point. 


Example 243. Consider the graph G = {A, B,C}. 


Informally, the points A, B, and C’ aren’t “close” to each 


other. (More formally, they are isolated points.)'*! Be) 


@ 
And so, interestingly, each of A, B, and C is a strict 


local maximum of G. This is because each of A, B, and 
Chas no “nearby” point. It is thus trivially or vacuously 
true that each of A, B, and C is strictly higher than any 
“nearby” point. 








By the same token, it is likewise trivially or vacuously true that 
each of A, B, and C is a strict local minimum of G. 


Note though that there’s only one strict global maximum or global maximum, namely A. 
Likewise, there is only one strict global minimum or global minimum, namely C’. 


Observe also that none of A, B, or C is a turning point. 


Exercise 90. Identify each graph’s extrema and turning points. (Answer on p. 1767.) 


(a) y=27+1. (b) y=a27+1, -1 
(CG) ay ="coez- (d)= secu, =1<7< 1 








131Tnformally, a point of a set is isolated if no other point in the same set is “nearby”. For the formal 
definition, see Definition 276 (Appendices). 
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13. Solutions and Solution Sets 


To solve an equation is to find all its solutions (or roots). Equivalently, to solve an 
equation is to find that equation’s solution set. 


Example 244. Solve the equation x-1=0 (xe R). 


Here are three perfectly good (and equivalent) answers to the above problem: 


é “om = 1” 
6 nie. 


« “The equation’s solution set is {1}.” 


Example 245. Solve the equation x +5=8 (~€R). 

Here are three perfectly good (and equivalent) answers to the above problem: 
ae oy 

« “getal 


« “The equation’s solution set is {3}.” 





The above two examples were easy enough to solve. In the next chapter, we'll review the 
solution to the quadratic equation ax? + br +c = 0. For now, here’s a quick example: 


160, Contents www.EconsPhDTutor.com 


Example 246. Consider the equation 2? -1=8 (x€ R). % 
Let’s see how graphs can help us solve this equation. 


Graph y = «7-1 and y=8. We find that these two graphs intersect at x = —3 and z = 3. 








Thus, the equation x? - 1 = 8 (x € R) has two solutions: -3 and 3. Equivalently, its 
solution set is {-3,3} or {+3}. 
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Example of a cubic equation: 


Example 247. Consider the equation x? — 32 = 32 - x? (x € R). % 


Graph y = 2° —- 3x and y = 34-2”. We find that these two graphs intersect at x = —3, 
o= OU and 222, 








Thus, the equation 23-32 = 32-2? (x € R) has three solutions: —3, 0, and 2. Equivalently, 
its solution set is {-3,0, 2}. 


By the way, we won’t be learning to solve the general cubic equation. But we will be 
learning to solve specific instances of the cubic equation that are not too difficult to 
factorise (Ch. 38). 


Also, we are required to know how to use a graphing calculator to find the solutions to 
this and indeed just about any equation. We’ll learn how to do so later. 
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Like the above examples, most equations we’ll encounter will have only finitely many solu- 
tions. But this isn’t always the case: 


Example 248. Consider the equation sin xz = 0 (a € R). % 


It has infinitely many solutions—for every k € Z, k7t is a solution. The equation’s solution 
set is {k7t: ke Z}. 
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We now look at inequalities. To solve an inequality is to find all its solutions (or 
roots). Equivalently, to solve an inequality is to find its solution set. 


Inequalities usually have infinitely many solutions: 


Example 249. Solve x-1 50 (2 eR). 

Three perfectly good (and equivalent) answers to the above problem: 
og Sl” 

e “x €(1,00).” 


1 
¢ “The solution set of > is (1, 00).” 


2 
Example 250. Solve 7+ 5>8 (we R). 


Three perfectly good (and equivalent) answers to the above problem: 


e Lap SB 


4 ‘on fe [3, 00 ).” 


2 
¢ “The solution set of > is [3,00).” 





The above two inequalities were easy enough to solve. In Ch. 41.3, we’ll learn the general 
solution to the quadratic inequality ax? + bx +c >0. For now, here’s a quick example: 
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Example 251. Solve x? -1>8 (x «R). x 


Again, let’s see how graphs can help us solve this inequality. Graph y = 7-1 and y= 8. 


We find that the graph of y = x7-1 is above that of y = 8 in the light-red regions indicated 
below (excluding the dotted red lines). 


Thus, three perfectly good (and equivalent) answers to the above problem are 
ae 

e “x € (—00, -3) U (3, c0) = R\ [-3,3].” 

¢ “The solution set is (—oo, -3) U (3, 0) = R\ [-3,3].” 


Also, we may say in words that every real number except those between —3 and 3 (inclu- 
sive) is a solution. 
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Example 252. Solve x? - 3x > 3a - 2”. at 


Graph y = x° — 3x and y = 34 — x”. We find that the graph of y = x? — 3a is above that of 
y = 3x — x? in the light-red regions (including the solid red lines) indicated below. 


Thus, three perfectly good (and equivalent) answers to the above problem are 
s =26050, 827. 

e “x €[-3,0] U[2, 00).” 

¢ “The solution set is [-3,0] U [2, 00).” 


Also, we may say in words that every real number between —3 and 0 (inclusive) and every 
real number greater than or equal to 2 is a solution. 








Here are the formal definitions of a solution and a solution set: 


Definition 59. Given an equation (or inequality) in one variable, any number that 
satisfies the equation (or inequality) is called a solution (or root) of that equation (or 


inequality). And the set of all such solutions is called the solution set of that equation 
(or inequality). 





For now, we’ll be dealing only with real numbers. And so for now, we’ll be looking only at 
real solutions and real solution sets. 


But in Part IJ (Complex Numbers), we’ll learn also that solutions can include non-real or 
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complex numbers. Here’s a quick preview: 


Example 253. The equation 77+1 = 0 (x € R) has no solution. (We say that its solution 
set is @, the empty set.) But this is only because we’ve specified that x € R. 


In contrast, the equation x? + 1 = 0 (x € C) has two solutions: —i and i. (We say that its 
solution set is {-i,i}.) 
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14. O-Level Review: The Quadratic Equation y = az’ + bx + ¢ 


In this chapter, we’ll review the quadratic equation: 
O=ax*+brt+e (a #0). 


(If a =0, then the equation is not quadratic but linear: 0 = bx +c.) 


Let’s find the solutions (or roots) of this quadratic equation: 


b 
Divide both sides by a # 0: O=27+-24 = 
a a 
b2 b b2 b2 
Add and subtract Tae Ce a era aes S) 


By the way, © is an example of the Plus Zero Trick:'*? It is always perfectly legitimate 
to add zero to any expression. So, given any quantity q (in this case q = b?/4a7), it is also 
always legitimate to add +q- gq =0 to any expression. 








bY RB e¢ 
Complete the square: O=(x4+—]) -—++-. 
2a 4a? a 
Rearrange: (2+ b ) = b ne b? — dac 
ee 2a) 4a2 as 4? 
b b2-4 Vb2 -4 
Take the square root: ct—=s siege Aiea 





2a Aaq2 2a 


Rearrange to get the quadratic formula (i.e. the two solutions or roots of the quadratic 
equation): 


oe \/b? — dac 
7 2a ) 


I 


The quadratic formula is not on the List of Formulae (MF26) and so sadly, you’ll have to 
memorise it. Unfortunately, I’ve never come across a good mnemonic that works (for me). 
Look for one that works for you. (Lemme know if you think you’ve found a good one!) 





'82Tao (2016) calls this the “middle-man trick”. 
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One mnemonic is to sing the quadratic formula to the tune of Pop Goes the Weasel.!%* 
Arranged by a musical genius so that each syllable matches each note: 


Moderato 


=== 


x e-quals to ne-ga-tive b, 


————e er 


Plus or mi-nus square root, 


SS 


b squared mi-nus four a c, 


é SSS 


All! O-ver two a. 






























































Definition 60. The discriminant of the quadratic equation 0 = ax? + bx +c is b? — 4ac. 





On the next page, Fact 34 summarises the key features of the quadratic equation and also 
reviews some of the concepts we’ve gone through in previous chapters. Six examples follow. 





133T checked out about ten versions on YouTube and unfortunately I found all of them to be very annoying 
and cannot recommend them. Maybe I’ll make one—don’t worry, I won’t be the one singing. 
134The symbol A is the upper-case Greek letter delta. 
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By the way, the equation y = az? + bx +c is also a quadratic equation, but in two variables. 


Fact 34. Consider the graph of the quadratic equation y = ax? + bx +c. 
(a) The only y-intercept is (0,c). 
(b) There are two, one, or zero x-intercepts, depending on the sign of b? - dace: 
(i) If b?-4ac>0, then there are two x-intercepts: 
be PA 
2a 


-b+ VP et) («- -b- Vb =~) 








2a 2a 


And as? +be+e=a(e- 


(ii) If b? - 4ac = 0, then there is one x-intercept x = —b/2a (where the graph just 
touches the x-axis). 


2 
And aa? +bn+c-a(a+=-) . 
2a 
(iii) If b’-4ac <0, then there are no x-intercepts. There is also no way to factorise 
the quadratic polynomial ax? + bx + ¢ (unless we use complex numbers). 
(c) It is symmetric in the vertical line x = —b/2a. 


(d) The only turning point is (-—b/2a,-b?/4a +c), which is a strict global (i) minimum if 
a>0O; or (ii) maximum if a <0. 














Proof. See p. 1568 in the Appendices. 





Corollary 2 (informal). Let G be the graph of the quadratic equation y = ax? + ba +c. 


(a) Ifa>0, then G is U-shaped. 
(b) Ifa<0, then G is n-shaped. 





“Proof” “Clearly”, the graph is either U-shaped or n-shaped.'®? 


(a) By Fact 34(d)(i), if a> 0, then the graph has a strict global minimum. So, the graph 
must be U-shaped. 


(b) By Fact 34(d)(ii), if a <0, then the graph has a strict global maximum. So, the graph 
must be n-shaped. 


135T¢ is this bit here that makes the result informal. We have to more precisely define the terms “U-shaped” 
and “n-shaped”. One simple possibility is to replace these two terms with the terms convex and concave. 
However, we’ll only be introducing these latter two terms in Part V (Calculus). So, for now, we’ll just 
stick with the informal “U-shaped” and “n-shaped”. 
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We can distinguish between six cases of the quadratic equation, depending on whether of 
a 20 and whether b? - 4ac : 0. 


Here are six examples to illustrate the six cases: 


Example 254. Consider the quadratic equation y = x? + 3a + 1. 





1. The only y-intercept is D = (0,1). 


2. The equation 0 = x? + 32+ 1 has two real solutions or roots: 


jp in/ p= Age  =38 2/2 4 1, Be 
re 


2a 2x1 Ja) 


-3-VJ5 -3 oS 
So, the two x-intercepts are A = ( a) and (= ( = 0) 


Z 2 


2.28) 


And we may write 27+3r+1= (« = 





3 
3. The graph is symmetric in the line = -— = ——. (We'll learn more about 


symmetry in Ch. 16.) 


b b? 3 a Goals 
4. Th ] int is B = |-—,c- —] =|--=,1 - ——] =| -=, -- }. 
e only turning point is ( a =) ( 5 ial ( 5 *) 
Since a=1>0, B is the strict global minimum and the graph is U-shaped. 
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Example 255. Consider the quadratic equation y = x? + 2x + 1. 





. There is one y-intercept: B = (0,1). 


. The equation y = x7 + 2x +1 has one real root: 


a OE VE dae _-24 V2P-4x1x1 _-24V0_ | 
7 2a : 2x1 [a ae oN 


And so, there is one x-intercept, where the graph just touches the x-axis: 
A = (-1,0). 
We can factorise the quadratic polynomial: 
eer eal) =G 4). 


b 2 
. There is one (vertical) line of symmetry: x = -— =- =-1. 
2a 2x1 


. In general, if a quadratic equation has only one real root, then the turning point is 


also the x-intercept: 
b b? ee 
—-—,c-—]=|-1,1-——]=(-1,0). 
2a" a ( rel ow 


. Since the coefficient 1 on x? is positive, the graph is U-shaped, with the turning point 
being the strict global minimum. 
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Example 256. Consider the quadratic equation y = 2? + 7+ 1. 








. There is one y-intercept: B = (0,1). 
. The equation y = x? + z+ 1 has no real roots: 


=) 4%/b? Ane be Vy le eae Hl ey 
ra Sea ee ee ee 


2a 2x1 2 


And so, there are no z-intercepts. We cannot factorise the quadratic polynomial 
(unless we use complex numbers). 
b 1 1 


. There is one (vertical) line of symmetry: + = -— =- =-_. 
2a 2x1 2 





. The one turning point is 


ms 2 
(eB) Gd) 
2a 4a 2 4x1 2 4 


. Since the coefficient 1 on x? is positive, the graph is U-shaped, with the turning point 
being the strict global minimum. 
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Example 257. Consider the quadratic equation y = —x? + 32-1. 


N 
Y 





. There is one y-intercept: A = (0,1). 
. The equation y = —x? + 3x —1 has two real roots: 


a eV 4ac _ -3# 3?-4x(-1)x(-1)_ -34V5_ 34#V5 
: 2a 7 2 x (-1) . 7 





And so, there are two x-intercepts: 


= (Fo 


We can factorise the quadratic polynomial: 
ni? s3r-1=-(z- 


. There is one (vertical) line of symmetry: + = -— =- 


. The one turning point is 


b b? 3 3 ao) 
cole B)-(b-1-nta) 69) 
2a Aa y) 4 x (-1) 24 
5. Since the coefficient —1 on x? is negative, the graph is n-shaped, with the turning point 
being the strict global maximum. 
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Example 258. Consider the quadratic equation y = —7? + 2x - 1. 


IN 
Y 





. There is one y-intercept: A = (0,-1). 


. The equation y = —x? + 2% — 1 has one real root: 


_-b4 Vb? -dac  -24\/2?-4x(-1)x(-1)_ -24V0_ 
ee 2% (1) oe 


ie 
And so, there is one x-intercept, which is also where it just touches the x-axis: 
B= (1,0). 
We can factorise the quadratic polynomial: 
Pe =) 


b 2 
—— —_. 1 
2a 2x (-1) 
. In general, if a quadratic equation has only one real root, then the turning point is 
also the x-intercept: 


B= (-s.e-E)=(11- pay ]- 0.0). 


2 





. There is one (vertical) line of symmetry: x = - 


. Since the coefficient —1 on x* is negative, the graph is n-shaped, with the turning point 
being the strict global maximum. 
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2 


Example 259. Consider the quadratic equation y = -x* + 7-1. 


IN 
Y 


fy > 





. There is one y-intercept: A = (0,-1). 


. The equation y = —x? + x - 1 has no real roots: 


me —bt V2 -dac -14V1?-4x(-1)x(-1)_ -1l4V-3 


2a : 2x (-1) = 


And so, there are no z-intercepts. We cannot factorise the quadratic polynomial 
(unless we use complex numbers). 


j 
. There is one (vertical) line of symmetry: x = ee “Ox (al) 2° 
. The one turning point is 


b b? 1 12 Ly 
Poe) (tn) 4) 
2a Aa 2 4x (-1) 2 A 
. Since the coefficient —1 on x? is negative, the graph is n-shaped, with the turning point 
being the strict global maximum. 
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The last six examples in one figure: 


ee+at+l y 


ag" 4 35 = 


See al VI 
2 2 I 
eon + 1 Ze +27-1 


ea] 
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15. The Distance Between a Point and a Line 





The above definition is general in that G can be any graph (i.e. any set of points). But to 
keep things simple, we’ll look only at cases where G is a line. 


We can find B—the point on / that’s closest to A—using what we’ve learnt about quadratic 
equations: 
Example 260. Let A= (5,1) be a point and the line / be (the graph of) y = 2”. 


Pick® any arbitrary point P < (p,2p) on 1, where pe R. 
By Definition 48, the distance between A and P is 


(p — 5)" + (2p - 1)" 2 5p? — 14p + 26. 


This last surd expression is minimised when the quadratic expression inside is min- 
imised.!3" By Fact 34(d), this occurs when 


Pip Al 
Dp = = = 


Se 
Oa? 2 (5) 


Now, plug p = 1.4 into + to get B= (1.4, 2.8), the point on / that’s closest to A. 
By Definition 61, the distance between A and / is simply the distance between A and B, 


which we get by plugging p = 1.4 into the surd expression for distance in 2. 


5p? —14p+26| = 1/5 (1.4)? - 14x 1.44 26 = V16.2. 


p=1.4 








136Clarification in case you’re not sure what’s going on here: Here we first set the v-coordinate of P to be 
any arbitrary real number p. Then, since P is on J, its y-coordinate must be 2p. 

137 Actually, to justify this assertion, we also need to show that \/- is an increasing function. But we’ll 
gloss over this for now as we haven’t yet introduced the concepts of a function and when a function is 
increasing. 
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Example 261. Let A = (-4,0) be a point and the line | be y = -x/3 +2. 


Pick any arbitrary point P 2 (p,-—p/3 +2) on 1. 


By Definition 48, the distance between A and P is 


\/ (p+ 4)° + (-p/3 + 2)° 2 \/10p2/9 + 20p/3 + 20. 


Again, the above expression is minimised when 





ape 20/3 


~ 2%a” ~~ 2(10/9) 


Now, plug p = —-3 into + to get B = (-3,3), the point on / that’s closest to A. 
Also, plug p = -3 into 2 to get the distance between A and /L 


/10p2/9 + 20p/3 +20} = 1/ 10(-3)* /9 + 20(-3) /3 + 20 = V0. 


p= 





We can use the “quadratic equations" idea illustrated in the above examples to find the 
unique point B on the line / that’s closest to A: 


Proposition 3. The unique point on the line ax + by+c=0 that is closest to the point 
(p,q) is 


a ap+bq+ce ap+bq+c 
ee ete OF” aR 





Proof. The proof is not difficult conceptually, but does involve tedious and messy algebra. 
So, we'll relegate it to the Appendices—see p. 142.6. 














Knowing the point B on the line / that’s closest to A, we can easily find the distance 
between A and / (this is simply the distance between A and B): 


Corollary 3. The distance between a point (p,q) and a line ax + by+c=0 is 


lap + bq + c| 


Jat +b? 





Proof. By Proposition 3, the point on the line that is closest to (p,q) is 


( ap+bq+e “| 
=— Q—_—— — ————— ae ae 
e+e 4 a? +b? 


So, the distance between this point and (p,q) is 
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| -( etary | -( - peer reyy 
oe a? +b? oe a? + b? 




















( pein *ey (au barey ap +bq+c ap lap + bq + c| 
= a———_——— —— i ee a S ————————————_,, 
a? +b? a? +b? Gee a? +b? 
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15.1. Foot of the Perpendicular (from a Point to a Line) 


It turns out that if B is the point on the line / that’s closest to the point A, then I is 
perpendicular to the line AB. 





C 


Moreover, the converse is also true: If / is perpendicular to AB (where B is some point on 
l), then B is the point on / that’s closest to A. 


The next result simply combines these last two assertions: 


Fact 35. Let A be a point that is not on the line 1. Suppose B is a point onl. Then 


B is the point on | that is closest to A —, 11 AB. 





Proof. Here we'll prove only <—. (For a proof of ==, see p. 1564 Appendices.) 

( <— ) Suppose 1 1 AB. Let C # B be any other point on J. Then ABC forms a right 
triangle!** with hypotenuse AC. 

By Pythagoras’ Theorem, |AC] =\/|AB|’ +|BC/’ > |AB|. So, B is closer to A than C. 


Since C' was arbitrarily chosen, we conclude that B is closer to A than any other point on 
i 














It will be convenient to give B a special name in the case where | 1 AB: 


Definition 62. Let A be a point that isn’t on the line /. The foot of the perpendicular 


from A to lis the (unique) point B on / such that AB 1 l. 





Altogether, given a point A not on the line J, 


1. By Proposition 3, there exists a unique point B on the line / that’s closest to A; 
2. Moreover, by Fact 35( == ), B is the foot of the perpendicular from A to 1. 


3. Now, B is also the unique foot of the perpendicular. This is because if there were some 
other point C' # B that were also the foot of the perpendicular from A to 1, then by Fact 
35( <= ), C would also be a point on / that’s closest to A, contradicting Statement 1. 


In summary, 


Corollary 4. Suppose A is a point not on the line l. Then there exists a point B that 


is both (a) the unique point on | that’s closest to A; and (b) the unique point on | such 
Cede tae 








138This proof may be considered slightly informal as it makes use of geometry and triangles. In Ch. 62 
(Part IV, Vectors), we'll prove this result again using the language of vectors. 
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16. Reflection and Symmetry 


16.1. The Reflection of a Point in a Point 


Example 262. Let P = (-2,0) and Q = (0,1) be points. 
Suppose we reflect P in Q to produce the point R. Then what would R be? 
Well, Common Sense suggests that the reflection R should satisfy these two properties: 


(a) |PQ|=|QR (b) # is on the line PQ. 


(Of course, we should also have R # P.) 











How can properties (a) and (b) be satisfied? Common Sense suggests that 


e Since Q, — P, = +2, we must also have R, - Q, = +2. 


e Since @, — Py =+1, we must also have R, - Q, = +1. 


+2 +2 


Le Sa. 
Hence, P=(-2,0) m0 =(0)1) i = (22), 
ee. 


+1 +1 





Intuitively, it seems “obvious” that there should be exactly one point R that satisfies both 
properties (a) and (b). Happily, it turns out that this intuition is correct.’ 


And so, we are allowed to write down this formal definition of a reflection: 





'39See Proposition 26 (Appendices). 
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Definition 63. Let P and Q be distinct points. The reflection of P in Q is the unique 
point R + P that satisfies these two properties: 


(a) |PQ| = |QRI. (b) # is on the line PQ. 





By Common Sense, if P = (a,b) and Q = (c,d), then the reflection of P in Q should be the 
point 


R= (c+(c-a),d+(d-b)) = (2c-a,2d- 5). 
For future reference, let’s jot this down as a formal result: 


Fact 36. Jf P=(a,b) and Q = (c,d) are distinct points, then the reflection of P in Q is 
the point 


R= (2c-a,2d-b). 








Proof. See Exercise 93. 











Rather than mug Fact 36, it’s probably easier to find reflection points by directly using our 
Common-Sense Reasoning: 


+4 +4 
Example 263. The reflection of (1,2) in (5,8) is (9,14). 
eS 


+6 +6 


Exercise 92. What is the reflection of (8,5) in (—2,4)? (Answer on p. 1770.) 


Exercise 93. Prove Fact 36—to do so, simply verify that the point R = (2c- a,2d-b) 
satisfies properties (a) and (b). (Answer on p. 1770.) 
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16.2. The Reflection of a Point in a Line 


Definition 64. The reflection of a point P in a graph is the reflection of P in the point 





on the graph that’s closest to P. 


The above definition is general in that G can be any graph (i.e. any set of points). But to 
keep things simple, we’ll look only at cases where G is a line. 


Example 264. Find the distance between the point P = (—3,2) and the line / given by 
=i, 


Let Q be the point on / that’s closest to P. 
Let’s use what we learnt in Ch. 15 to find Q: 


Let S + (s,2s+3) be an arbitrary point on I. Then the distance between P and S is 


\V (8 +3) + (28 +3-2) = V5s2 + 10s + 10, 


which is minimised at s = -10/(2 x 5) =-1. 


Plug this into = to get Q = (-1,1). (As we learnt in Ch. 15, Q is also called the foot of 
the perpendicular from P to / and / 1 PQ.) 








And now, by Fact 36, the reflection of P in Q is R= (1,0). 
By Definition 64, R is the reflection of P in l. 
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Some simple but useful results: 


Example 265. The reflection of (-2,1) in y = x is (1,-2). The reflection of (0,1) in 
yam is (1,0), 











We'll give two proofs of the above result. First, a formal proof using the usual “quadratic 
equations” idea: 


Proof. The distance between (p,q) and an arbitrary point (a,a) on y= is 


(p—a)’ +(q—-a) =\/2a2-2(p+q)at pt @. 


This is minimised at a = 2(p+q)/2-2 = (p+q)/2. So, the point on y = x closest to (p,q) 
is D = ((p+q) /2,(p+q) /2). Hence, the reflection of (p,q) in D and thus also in the line 
y= is (q,p). 














Second, an informal proof-by-picture: 


Proof. Let A= (p,q) and B be the point on y = x that’s closest to A. 
Mark also the points C= (p,p), D = (q,q), and E = (q,p). Clearly, ACED is a square. 
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y A= (p,q) D=(4,49). 

Io. yo! 

I am ; I 

I ‘ 7 I 

! N ye ! 

! * 7, ! 

l N ¢ l 

I LV" I 

s 

I sé I 

Y BS. 

I rs I 

I Z 7, I 

I 7 ny I 

l ge we l 
OSD) Qo vccciceeseccenee ® 

ee E =(q,p) 
ayo 
“ oe 
=} > 





The lines AF and y =x are the square’s diagonals, which are perpendicular to each other. 
(This is a result from geometry we haven’t formally stated and proven in this textbook.) 


Moreover, |AB| =|BE]. 


Altogether, by Definition 63, EF is the reflection of A in B. Hence, by Definition 64, EF is 
also the reflection of A in /. 














Fact 38. The reflection of the point (p,q) in the line y =-«x is the point (-q,-p). 


Example 266. The reflection of (-2,1) in y = -x is (-1,2). The reflection of (0,1) in 
y= is (1,0). 











Again, we'll give two proofs of the above result. First, a formal proof using the usual 
“quadratic equations” idea: 


Proof. The distance between (p,q) and an arbitrary point (a,-a) on y = —2 is 
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(p-a)° + (qta)’ =\/202+2(q-p)atp+e@? 


This is minimised at a = 2(q-p)/(2-2) = (q-p)/2. So, the point on y = -2 closest to 
(p,q) is D = ((q-p) /2,(p-q) /2). Hence, the reflection of (p,q) in D and thus also in the 
line y = —x is (-q, -p). 














Second, an informal proof-by-picture: 


Proof. Informal proof-by-picture: 
Let A= (p,q) and B be the point on y = -x that’s closest to A. 


Mark also the points C' = (p,-p), D = (-q,q), and E = (-q,-p). Clearly, ACED is a square 
(four sides have equal length and four angles are right). 





_ D=(-49 A= (p,q) 
| ‘ 7 
I XN 7 I 
I Me Zo I 
I \ 7 I 
! ‘ af ! 
! . ! 
! ‘ oa rs l 
I ‘* - I 
‘“ ~ 7 
I 4 I 
N 7 
! x P. l 
! N F I 
! \. 4 I 
7 
! < 3 l 
I XN 7 I 
I > 7 I 
N 7 
l VAN l 
! Na’ ! 
I & I 
I 7 ‘N I 
7 XN 
! 7 B *s ! 
I 7 N I 
7 NX 
! 5 . l 
I 7 \ I 
! 4 \ l 
I . be I L 
7 XN 
I ¢ N I > 
7 ny 

1 P < I 
I 77 N I 
! ra ss ! 
! , . ! 
! ’ \ l 
I 7 MS I 
I ‘\ I 
17 = (<< = em 

fH =(-;-p) ‘e 

C . 
= — ‘\ 
(p, p) N 





The lines AF and y = -x are the square’s diagonals, which are perpendicular to each other. 
(This is a result from geometry we haven’t formally stated and proven in this textbook.) 


Moreover, |AB| =|BE]. 


Altogether, by Definition 63, EF is the reflection of A in B. Hence, by Definition 64, EF is 
also the reflection of A in /. 

















Fact 39. The reflection of the point (p,q) in the vertical line x = d is the point (2d —- p,q). 


Example 267. XXX 
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Again, we'll give two proofs of the above result. First, a formal proof using the usual 
“quadratic equations” idea: 


Proof. The distance between (p,q) and an arbitrary point (d,y) on x =d is 


(p-d)’ + (q-y) = Vy? - 2ay + p? + @? + a? - Qpd. 


This is minimised at y = —(-2q)/(2-1) = q. So, the point on x = d closest to (p,q) is 
A =(d,q). Hence, the reflection of (p,q) in A and thus also in the line x = d is (2d - p,q). 














Second, an informal proof-by-picture: 


Proof. Informal proof-by-picture:!” 








“Clearly”, the point on x = d that’s closest to (p,q) is (d,q). 


And by Fact 36, the reflection of (p,q) in (d,q) is (2d- p,q). Hence, by Definition 64, the 
reflection of (p,q) in x =d is (2d-p,q). 














Fact 40. The reflection of the point (p,q) in the horizontal line y = d is the point 


(2d =a): 





Example 268. XXX 


Again, we'll give two proofs of the above result. First, a formal proof using the usual 
“quadratic equations” idea: 


Proof. The distance between (p,q) and an arbitrary point (2,d) on y =d is 


V (p-2) + (q-d) = \/x? - 2px +p? + 92+ 2 - ¢d. 


This is minimised at x = —(-2p)/(2-1) = p. So, the point on y = d closest to (p,q) is 
A=(p,d). Hence, the reflection of (p,q) in A and thus also in the line y = d is (p, 2d - q). 














Second, an informal proof-by-picture: 


Proof. Informal proof-by-picture: 





40For a formal proof, see p. ?? (Appendices). 
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Vv: 





“Clearly”, the point on y = d that’s closest to (p,q) is (p,d). 


And by Fact 36, the reflection of (p,q) in (p,d) is (p,2d-q). Hence, by Definition 64, the 
reflection of (p,q) in y =d is (p,2d-q). 














Most generally, 


Fact 41. The reflection of the point (p,q) in the line ax + by+c=0 is the point 


( pi ceanedicn’ 
“ a2 + 0? 


(oe) 


a? + b? 





Proof. By Proposition 3, the point on the line ax + by +c =0 that is closest to (p,q) is 


( ap +bq+e a 
= Q———_—— —_ —EEEEEE—— 
e+e 4 a? + b? 


So, the reflection of (p,q) in ax +by+c = 0 is the reflection of (p,q) in the above point. And 
by Fact 36, this reflection point is 


ap+bq+c ( ned ( ap +bq+c ee) 
2 |p - a—.>———_ | - p, 2| g - b ————_] - ¢] = | p - 2a ———_., ¢ - 26 ] . 
( (» Oa +P ) eee a? + b? i ie ar a? +b? 














As with some similar results in Ch. 15, you should definitely not bother mugging Fact 41. 


Instead, seek to understand the methods by which it was derived. 


Exercise 94. Find the reflections of (3,2) in y=az and y=-z. (Answer on p. 1770.) 
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16.3. Lines of Symmetry 


Definition 65. Let G and L be graphs. The reflection of G in L is the set of points 


obtained by reflecting every point in G in L. 





Again, the above definition is general in that L can be any graph. But again, to keep things 
simple, we'll look only at cases where L is a line: 


Example 269. By informal observation,!*! the reflection of y = x? in the line y = 2 is 











Definition 66. Suppose the reflection of the graph G in the line / is G itself. Then we 
say that G is symmetric in l, or that that | is a line (or axis) of symmetry for G. 


2 is symmetric in the line 


2 


Example 270. By informal observation,’ the graph of y = x 
x = 0 (also the y-axis). Equivalently, x = 0 is a line (or axis) of symmetry for y = 2°. 


Figure to be 
inserted here. 








141Here’s a formal proof: By Fact 40, the closest point on y = 2 to any point (a, a’) in y = x? is (a, 2). 
The reflection of (a, a’) in (a,2) is the point (a,4 - ay. So, the reflection of y = x” in the line y = 2 is 
y=4-2". 

12Formal proof: By Fact 39, the closest point on x = 0 to any point (a, a”) in y = x? is (0;a7): The 
reflection of (a, a’) in (0, a?) is the point (-a,a’), which is also a point in the graph of y = x”. We've 
just shown that the reflection of any point in the graph of y = x? in the line x = 0 is itself also in the 
graph of y = x”. So, the graph of y = 2? is its own reflection in the line x = 0. 
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Example 271. Let G be the graph of y = 1/2. We show that G is symmetric in the lines 
(a) y =<; and (b) y=-z: 

Pick any point (a,1/a) in G: 

(a) By Fact 37, the reflection of any point (a,1/a) in G in the line y = z is (1/a,a), which 
is also in G. We’ve just shown that the reflection of any point in G in the line y = & is 
also in G. So, the reflection of G in y= is G. Hence, G is symmetric in y = 2. 








(b) By Fact 38, the reflection of any point (a,1/a) in G in the line y = -z is (-1/a,-a), 
which is in G. We’ve just shown that the reflection of any point in G in the line y = —z is 
itself also in G. So, the reflection of G in y = -x is G. Hence, G is symmetric in y = —-2. 
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Example 272. By informal observation,!*’ the graph of x?+y? = 1 is symmetric in every 
line through the origin. 














'43The formal proof is not conceptually difficult, but does involve tedious and messy algebra. So, I’ve 
relegated it to the Appendices—see Fact 252. 
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16.4. Symmetry in Vertical and Horizontal Lines 


Fact 42. Suppose G is a graph andaéeR. Then 


G is symmetric in For every (b,c) €G, 
—> 





the vertical line x =a (2a-b,c) eG. 


Proof. Immediate from Fact 39: The reflection of the point (b,c) in the vertical line x =a 
is the point (2a - b,c). 














Illustration of above result: 


(4,5) 








(0+1+1,3) = (2,3) 





Plug a = 0 into the above result to get the next result: 


Corollary 6. Suppose G is a graph. Then 


G is symmetric in the For every (b,c) €G, 
—> 


vertical line x =0 (y-axis) (-b,c) €G. 





Illustration of above result: 
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Fact 43. Suppose G is a graph andaeR. 


G is symmetric in For every (b,c) €G, 
—> 





the horizontal line y =a (b,2a-—c)éeG. 


Proof. Immediate from Fact 40: The reflection of the point (b,c) in the horizontal line y = a 
is the point (6, 2a-c). 














Illustration of above result: 


Figure to be 


inserted here. 
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17. Functions 


Undoubtedly the most important concept in all of mathematics is that of a func- 
tion—in almost every branch of modern mathematics functions turn out to be the 
central objects of investigation. 


— Michael Spivak (1994). 


In secondary school, you probably learnt to describe functions like this: 


“Let f (x) = 2x be a function.” 


Strictly speaking,!“* the above description of functions is incorrect. In particular, it suffers 


from these two big problems: 


1. It fails to make any mention of the domain and the codomain. 
Whenever we specify a function, we must also specify the domain and codomain.'*° 

2. It incorrectly suggests that a function must always be some sort of a “formula”. 
But it needn’t be. A function simply maps or assigns every element in the domain to 
(exactly) one element in the codomain. There need be nothing logical or formulaic about 
how this mapping or assignment is done. We will illustrate this important point with 
many examples below. 


Here is one correct way to describe functions:!“° 


Definition 67 (informal). A function consists of these three objects: 


1. A set called the domain; 


2. A set called the codomain; and 


3. A mapping rule that maps (or assigns) every element in the domain to (exactly) 
one element in the codomain. 





Two very simple examples of functions: 





14Pedagogical note: Many bits of H2 Maths are equivalent to first- and even second-year university 
courses in many countries. (Well, at least on paper. See my Preface/Rant.) My view is that while the 
above incorrect description of a function may have been suitable at earlier stages of a student’s maths 
education, it is no longer so at this stage. Definition 67 (below) is not at all difficult (especially when 
compared with a lot of the junk that’s already in H2 Maths). At the cost of very little additional pain 
and time, the student gains a far better understanding of what functions are and how they work, and 
thus saves herself more grief in the long run. 

'45Things become so much simpler if we make it clear from the outset and indeed insist that the very 
definition of a function includes the specification of a domain and codomain. Instead we have the 
present rigmarole where we ask students to explain which values “to exclude” from a function’s domain, 
as if we were doing some ad hoc repair to make the function “work”. 

M61 efinition 67 will serve us very well. Note though that it is a little informal. For the formal definition, 
see Definition 280 (Appendices). 
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Example 273. Let f be the function with 


1. Domain: The set {1,5,3}. 
2. Codomain: The set {100, 200}. 
3. Mapping rule: f (1) = 100, f (5) = 100, f (3) = 200. 


The function f is well-defined, because every element in the domain maps to or “hits”!4” 
(exactly) one element in the codomain: 


1 “hits” 100, 5 “hits” 100, 3 “hits” 200. 


There is no apparent logic or formula behind how the mapping rule works. 


Why should 1 “hit” 100, 5 The function f 


“hit” 100, and 3 “hit” 200? 

No matter. To qualify as 

a well-defined function, all \ f/f 100 
that matters is that every 

element in the domain is 

mapped to or “hits” (exactly) 200 
one element in the codomain. 

Which is true of f. So, f is 

well-defined. 


Domain f Codomain f 








"The term “hits” is informal (hence the scare quotes). The formal term is maps to. But “hits” is probably 
clearer, more evocative, and easier to understand. 
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Example 274. Let g be the function with 


1. Domain: The set {1,5,3}. 
ae Codomain: The set {100,200}. 
3. Mapping rule: g (1) = 100, g(5) = 100, g (3) = 100. 


Observe that the element 200 The function g 
in the codomain is not “hit”. 
Nonetheless, g is again a well- 


defined function, because it \ _f 
satisfies the requirement that 


every element in the domain pe 


maps to or “hits” (exactly) 
one element in the codomain: 


1 “hits” 100, 
5 “hits” 100, 


: Cod . 
3 “hits” 100. Domain g odomain g 


For a function to be well-defined, there is no requirement that every element in the 
codomain be “hit”. 


And so here, even though 200 in the codomain isn’t “hit”, g is a perfectly well-defined 
function. 
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The concept of a function is of great generality. To qualify as a function, all we require is 
that every clement in the domain maps to or “hit” (exactly) one element in the codomain. 
In each of our above examples, both the domain and codomain were sets of real numbers. 
But in general, they could be any sets whatsoever. 


We'll use these six symbols in this chapter and again in Ch. 24: 


Cow Chicken Dog Produces eggs Guards the home Produces milk 
wv * ) a7] 


Example 275. Let h be the function with 


Le Domain: The set {FR wy}. 
2 Codomain: The set {@, 5, p}. 
3. Mapping rule: “Match the animal to its role.” 


Here the mapping rule is informally stated in words. 


Nonetheless, it is clear eenameniongn 
enough. If we wanted to, we 


could write it out formally, 
like this: 


h (PR) =), 
h(W) =o. 


The function h is well- 
defined, because it satisfies 
the requirement that every 
element in the domain “hits” 
(exactly) one element in the codomain. 


Domain h Codomain h 





In the above example, the mapping rule “makes sense”—we simply map each animal to its 
role. In the next example, the mapping rule “makes no sense”. Nonetheless and all the 
same, our function is perfectly well-defined: 
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Example 276. Let 7 be the function with 


iL. Domain: The set {FR w}. 
2. Codomain: The set {@, q, yi. 
3. Mapping rule: i (PR) -@, i(@) = @. 


The function 2 
This time, the mapping rule 


“makes no sense”—it maps 


SR to 0 and W to F. 
Nonetheless, this is a well- 
defined function, because it 
maps every element in the 
domain to (exactly) one ele- 
ment in the codomain. 


Domain 2 Codomain 2 
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Another pair of examples: 
Example 277. Let 7 be the function with 
i. Domain: The set of UN member states. 


2. Codomain: The set of ISO three-letter country codes. 
3. Mapping rule: “Match the state to its code.” 


The domain, codomain, and mapping rule are all informally stated in words. Nonetheless, 
they are clear enough. 


The domain contains 193 elements, namely Afghanistan, Albania, Algeria, ..., Zambia, 
and Zimbabwe (list). The codomain contains 249 elements, namely ABW, AFG, AGO, 


AIA, ALA, ..., ZMB, and ZWE (list).' 


And the mapping rule maps every element in the domain to (exactly) one element in the 
codomain. For example, 


j (The United States of America) = USA, 
j (China) = CHN, 
j (Singapore) = SGP. 


Altogether then, 7 is a well-defined function, because it satisfies the requirement that 
every element in the domain “hits” exactly one element in the codomain. 





Again, a very similar example, but this time the mapping rule “makes no sense”: 


Example 278. Let k be the function with 


1. Domain: The set of UN member states. 
2. Codomain: The set of ISO three-letter country codes. 


3. Mapping rule: For every x, k (2) = SGP. 


The function k simply maps every UN member state to the three-letter code SGP. This 
is strange and seems to make no sense. Nonetheless, k& is a well-defined function, because 
it satisfies the requirement that every element in the domain “hits” exactly one element 
in the codomain. 





So! Specifying a function is jolly simple. Simply write down the 
1. Domain (could be any set); 


2. Codomain (could be any set); 


3. Mapping rule (maps every element in the domain to exactly one element in the 
codomain). 





'48There are more ISO three-letter country codes than UN members because not every “country” is a UN 
member. For example, Greenland is assigned the ISO code GRL, but is a territory of Denmark and 
is not a UN member state. The Holy See (or Vatican City) is a fully sovereign state and is assigned 
the ISO code VAT, but is not a UN member state. Taiwan is, for all intents and purposes, a fully 
sovereign state and is assigned the ISO code TWN, but is not a UN member state. 
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Now, a function consists of the above three pieces. Hence—and here’s a somewhat subtle 
point—two functions are identical (i.e. equal) if and only if they have the same domain, 
codomain, and mapping rule. 


Example 279. Recall from above the function h: 


The function h 


Domain h Codomain h 


Now consider the very similar-looking function hy: 


The function hy 


P< 


Domain h, Codomain h 


The functions h and h, look very similar. Indeed, they have the same domain and 
mapping rule. 

However, their codomains are different. And so h and hy are distinct:!° 
RF Pine 


One might think, “Azyah, the codomain not very important wat. Both h and h, map 
SPR to | and W to J. So just call them the same function lah!” But this is wrong. 
Again, to reiterate, stress, and emphasise, a function consists of three pieces: the domain, 
the codomain, and the mapping rule. So, 


Two functions are identical <= Same domain, codomain, and mapping rule. 








49 Distinct is a synonym for not equal. 
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Example 280. Let / be the function with 


1. Domain: The set {1,2}. 
Pa Codomain: The set {1,2,3,4}. 
3. Mapping rule: For every x, | (x) = 22. 


The function / 


Domain / Codomain | 
Now consider the very similar-looking function /,, which has 


1, Domain: The set {1,2}. 
Pe, Codomain: The set {1,2,4}. 


3. Mapping rule: For every z, 1; (2) = 22. 


The function 1, 


Domain |; Codomain 1; 


Again, the functions / and 1, look very similar—they have the same domain and mapping 
rule. However, their codomains are different. And so, / and l; are distinct: 


ate 
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17.1. What Functions Aren’t 


To better understand what functions are, we’ll now look at what functions aren’t. That is, 
we'll now look at examples of non-functions: 


Example 281. It is alleged that f is the function with 


i Domain: The set {1, 5,3}. 
2 Codomain: The set {100,200}. 
3. Mapping rule: f (1) = 100, f (3) = 200. 


The “function” f 


Domain f Codomain f 


Unfortunately, f is not a function, because f fails to map every element in the domain 
to (exactly) one element in the codomain—in particular, f fails to map 5 to any element 
in the codomain. 
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Example 282. It is alleged that g is the function with 


1. Domain: The set {1, 5,3}. 
a Codomain: The set {100,200}. 
3. Mapping rule: g(1) = 100, g (1) = 200, g (5) = 100, g (8) = 200. 


The “function” g 


Ww 100 
oy 200 


Domain g Codomain g 


Unfortunately, g is not a function, because g fails to map every element in the domain 
to (exactly) one element in the codomain—in particular, g maps | to two elements, 
namely 100 and 200. 


Example 283. It is alleged that h is the function with 


1. Domain: The set of ISO three-letter country codes. 
2: Codomain: The set of UN member states. 
3. Mapping rule: “Match the code to the corresponding state.” 


Unfortunately, h is not a function, because h fails to map every element in the domain 
to (exactly) one element in the codomain. 


For example, h fails to map VAT in the domain to any element in the codomain. This 
is because VAT corresponds to the Holy See (or the Vatican City), which is not a UN 
member state and thus not an element of the codomain. 


Example 284. It is alleged that 7 is the function with 


i Domain: The set of real numbers R. 
2. Codomain: The set of real numbers R. 
3. Mapping rule: For all x, i(x) = Vz. 


Unfortunately, 2 is not a function, because 7 fails to map every element in the domain 
to (exactly) one element in the codomain. 

For example, 7 fails to map -1 in the domain to any element in the codomain. This is 
because V-1 is not a real number. 
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17.2. Notation for Functions 
As shorthand, we’ll denote a function f’s domain and codomain by Domain f and Codomain f. 


Example 285. Let f be the function with 


ie Domain: The set {3,4}. 
a Codomain: The set {5,6,7,8}. 
3. Mapping rule: “Double it.” 


The function f 


Domain f Codomain f 


Domain f = {3,4}, 
Codomain f = {5,6,7,8}. 


We may write 





We now discuss how the mapping rule is written. 


To write down a mapping rule, there is usually more than one way. In the above 
example, the mapping rule was written informally as “Double it”. But we could also have 
written it more formally as 


“For every x € Domain f, we have f (x) = 22.” 


Alternatively, we could’ve written the mapping rule explicitly, stating what each and every 
element in the domain is to be mapped to: 


“f(3)=6,  f (4) =8? 


The mathematical punctuation mark + means maps to. And so, here’s yet another way 
to write the mapping rule: 
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“f:3 6, f:4 8” 


So, altogether, in the above example alone, we’ve given four different but entirely equivalent 
ways to write out the mapping rule. 


You can choose to write the mapping rule however you like. What’s important is that you 
make clear how the mapping rule maps each element in the domain to (exactly) one 
element in the codomain. If you haven’t made it sufficiently clear, then you have failed to 
communicate to others what your function is and your function is not well-defined. 


Here are eight ways to say aloud “f (4) = 8” or “f:4 8”: 






























































“f of 4 is 8.” “f of 4 equals 8.” “f of 4 is equal to 8.” 
“f maps 4 to 8.” “The value of f at 4 is 8.” “f evaluated at 4 is 8.” 
“The value of f when applied to 4 is 8.” “The image of 4 under f is 8.” 








If « ¢ Domain f, then f(a) is simply undefined. So in the last example, 0 ¢ Domain f; 
and thus, f (0) is simply undefined. 


Example 286. Let g be the function with 


ie Domain: The set of positive integers Z”. 
2. Codomain: The set of real numbers R. 
3. Mapping rule: “Double it.” 


Then we have, for example, 


g(1)=2, g(2)=4, and g(3)=6. 
Or equivalently, g:1%2,9:244, and g:3 6. 


Since -1,1.5,7,0 ¢ Domaing, g(-1), g (1.5), g (7), and g(0) are undefined. 
Likewise, since FR, W ¢ Domaing, g (<PR) and g (ww) are undefined. 


The mapping rule, “Double it,” was somewhat informal. We could’ve written it more 
formally as 


For every z € Z*, we have g(x) = 2z. 


With the aid of an ellipsis, we could also have written it down explicitly:!” 


g(1) =2, 9 (2) = 4, 9(3) =6, 9(4) =8, ... 
Or equivalently, gilp2,g:24,9:36, 9:4 8,... 





So far, we’ve written down functions with the aid of tables and/or figures. But going 
forward, we’ll want to write down functions more concisely and without the aid of 





0But in general, this may not be possible. 
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tables or figures. 

Here are nine entirely equivalent and formal ways to fully write down the function g from 

the last example: 

1. Let g be the function that maps every element x in the domain Z* to the element 2x in 
the codomain R. 

. Let g be the function that has domain Z*, codomain R, and maps every x € Z to 2x ER. 

. Let g: Z* > R be the function defined by g (x) = 22. 

. Let g:Z* > R be the function defined by g: x2 22. 

. Let g: Z* > R be defined by g(x) = 22. 

. Let g: Z* > R be defined by g: x2 22. 

. Define g: Z* > R by g (2) = 22. 

. Define g:Z* > R by g: x 22. 

. Define g: Z* > R by x 2z. 


Oo Con DD o8 FP W LD 


In Statements 3-9, the domain comes after the colon “:”, the codomain after the right 
arrow “+”, and the mapping rule at the end of the statement. 


The general version of Statement 9 is 


Domain f Codomain f 
K 7 
Define f: A> B by re f (2). 
eS 


Mapping rule 


In words, the function f is defined to have domain A, codomain B, and mapping rule 
cr f(z). 
Example 287. Define h: R* > R by h(x) = 2z. 
The domain of h is R* (the set of positive real numbers). The codomain is R (the set of 
real numbers). Expressed informally in words, the mapping rule is “Double it”. 
So for example, nil s=2, h(2.3)=4.6, and h (3) =6. 
Or equivalently, h:1%2,h:2.3 4.6, andh:3+ 6. 


Observe that —1,0, 9%, W ¢ R* = Domainh. Thus, the following are all undefined: 


h(-1), 2(0), h (APR), and h(). 
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Example 288. Define i: Z— R by i(x) = 2”. 

The domain of 7 is Z (the set of integers). The codomain is R (the set of real numbers). 
Expressed informally in words, the mapping rule is “Square it”. 

So for example, (= 1247) = 49, and 7({—-6) = 36. 

Or equivalently, t:1%1,171: 74 49, andi:-6+ 36. 


Observe that 1.5, 71, 9, W ¢ Z = Domaini. Thus, the following are all undefined: 


h(1.5), i (70), ¢ (AM), and i(W). 


Example 289. Define j: R* > R by j (x) = 2”. 


The domain of j is R* (the set of positive real numbers). The codomain is R (the set of 
real numbers). Expressed informally in words, the mapping rule is “Square it”. 


So for example, 7) = 1 2) = 4 oe ond 9 (6) = 26. 
Or equivalently, gi lel, 9:2.2% 4.84, and 7:64 36. 


Observe that —1,-3.2, ¥, W ¢ Z = Domain j. Thus, the following are all undefined: 


j(-1), J (-3.2), 7 (BR), and j (®). 


Note that as usual, the symbol x here is merely a dummy variable that can be replaced 
by any other symbol, such as y, z, ©, or %&. So, we could also have written this example’s 
first sentence as either 


“Define 7: R* > R by j (y) = y?” “Define 7: Rt > R by 7 (©) = ©””. 





(Consider the last four examples. Is g = h? Is i = 3?) 


In the above examples, we actually cheated a little. Or rather, we took it for granted that 
the specified mapping rule applied to all elements in the domain. If we wanted to be extra 
careful/pedantic, then here’s what we should “really” have written: 


Example 290. Define g:Z > R by g(a) = 2a for all x€ Z. 


Example 291. Define h: R* > R by h(x) = 22 for all x € R*. 


Example 292. Define i: Z > R by i(x) = 2? for all x ¢ Z. 


Example 293. Define j: R* > R by j (x) =2? for all x ¢R*. 


This is because we can have a piecewise function, where there are different mapping 





151 The functions g and h have different domains and so g + h. Likewise, the functions 7 and j have different 
domains and so 7 # j. 
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7 06 


rules for different “sections”, “pieces”, or intervals of the domain: 


Example 294. Define k: Z > R by 


f 
ee 2a Or <5) 


xt, fOr 2. 


In words, the function k doubles integers that are less than or equal to 5, but adds one 
to those that are greater than 5. 


So for example, k (-10) = -20, k (0) =0, and k (4) =8. 
But, k (7) =8, k(15) = 16, and k (100) = 101. 


Example 295. Define /: R* > R by 


for x <5, 


Ma)=4" 
2. Lor 2 > 3: 


In words, the function | squares positive real numbers that are less than or equal to 5, 
but cubes those that are greater than 5. 


So for example, NM ije te 1 = Vol and) (4) = 16: 
But, 1(5.3) = 148.877, 1(7) = 343, and 1(9) = 729. 
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Exercise 100. Evaluate each function at 1. (Answer on p. 1772.) 
Define a:R->R an) =n 
b:[-1,1]-R og) =1a:; 
c:Z* +R CQ) =35 
d:Z +R La = 
eo kk ea) aie 





Exercise 101. Four functions are given below. (Answer on p. 1772.) 


(a) Verify that each is well-defined. 
(b) Which (if any) of them are equivalent? 


Function Domain Codomain Mapping rule 
a leo} ele eon “Double it” 
b ay ees si eae ay or “Double it” 
C ee aca) luau antlr “Double it” 
d [0,4)nZ (-3,6] NZ “Double it” 























Exercise 102. Define f : R™ > R by “round off to the nearest integer (and round up 
half-integers)”. (Answer on p. 1772.) 


(a) What are f (3), f (7), f (3.5), f (3.88), and f (0)? 


Would the function still be well-defined if we changed , respectively, 


(b) Its domain to R and codomain Z? 


(c) Its domain to Z and codomain R? 


Exercise 103. Below, the functions f, g, and h are described in words. Write down a 
formal and concise statement that defines each. (Answer on p. 1772.) 


(a) The function f has domain the set of positive integers, codomain the set of integers, 
and mapping rule, “Triple it”. 


(b) The function g has domain the set of negative real numbers, codomain the set of 
non-zero real numbers, and mapping rule, “Square it”. 


(c) The function h has domain the set of even integers, codomain the set of positive even 
integers, and mapping rule, “Square it”. 
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17.3. Warning: f and f(z) Refer to Different Things 


If you cannot say what you mean, Your Majesty, you will never mean what you 
say and a gentleman should always mean what he says. 


— Peter O’Toole as Reginald Johnson, in The Last Emperor (1987). 


A common mistake is to believe that f (7) denotes a function. But this is wrong. 


f and f (2) refer to two different things. 


f denotes a function. f (x) denotes the value of f at x. 
In the next two examples, S is the set of human beings. 


Example 296. Let h: S > R be the height function. That is, h gives each human being’s 
height (rounded to the nearest centimetre). 


So for example, h (Joseph Schooling) = 184. 


h (Joseph Schooling) is not a function. 
Instead, h (Joseph Schooling) is the number 184. 


It would be silly to say that h(Joseph Schooling) is a function, because it isn’t— 
h (Joseph Schooling) is the number 184. 


And in general, for any human being x, h() is her height (rounded to the nearest cm). 
Again, h(a) is not a function—it’s x’s height (rounded to the nearest cm), which is a 
number. 


Example 297. Let w: S > R be the weight function. That is, w gives each human 
being’s weight (rounded to the nearest kilogram). 


So for example, w (Joseph Schooling) = 74. 


w (Joseph Schooling) is not a function. 
Instead, w (Joseph Schooling) is the number 74. 


It would be silly to say that w (Joseph Schooling) is a function, because it isn’t— 
w (Joseph Schooling) is the number 74. 


And in general, for any human being x, w(x) is her weight (rounded to the nearest kg). 
Again, w(x) is not a function—it’s x’s weight (rounded to the nearest kg), which is a 
number. 





This may seem like an excessively pedantic distinction. But maths 7s precise and pedantic. 
In maths, we are gentlemen (and ladies) who say what we mean and mean what we say. 
There is no room for ambiguity or alternative interpretations. 
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17.4. Nice Functions 


Example 298. Consider the function f : [1,5] > R defined by f (x) = 2’. 

The domain of f contains only real numbers. Hence, f is a function of a real variable. 
The codomain of f contains only real numbers. Hence, f is a real-valued function. 
Both the domain and codomain of f contain only real numbers. Hence, f is a nice 
function. 


Definition 68. We call a function whose 


1. Domain contains only real numbers a function of a real variable; 


2. Codomain contains only real numbers a real-valued function (or more simply, real 
function); 


3. Domain and codomain both contain only real numbers a nice function. 


Example 299. Define the function g: { R, wy} > Z by g (PR) =0 and g (w) alle 


Domain g contains objects that aren’t real numbers. So, g is not a function of a real 
variable. 


Codomain g contains only real numbers. So, g is a real-valued function. 


Either Domain g or Codomaing contains objects that aren’t real numbers. So, g is not 
a nice function. 


Example 300. Define the function h : (-2,5) > { FR, ws} by h(x) = PR. 
Domain h contains only real numbers. So, h is a function of a real variable. 
Codomain h contains objects that aren’t real numbers. So, h is not a real-valued function. 


Either Domainh or Codomainh contains objects that aren’t real numbers. So, h is not 
a nice function. 





Happily, most functions we’ll encounter in H2 Maths are nice functions. That is, we’ll often 
encounter functions like f, but not functions like g or h. 
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17.5. Graphs of Functions 


Given a function f, its graph is simply the graph of the equation y = f(x) with the 
constraint « € Domain f. A little more formally, 


Definition 69. Given a function f, its graph is this set of points: 


{(x,y): x € Domain f,y = f (x)}. 
Example 301. Define f: R- R by f (x) = 22. 


IN 
The graph of f is the set | Y 


VG Ure y= 2a, 








By the way, strictly speaking, we should say 


“The point (3,6) is in the graph of f.” 


That is, we should explicitly state both the x- and y-coordinates of any point we are 
talking about. 


However, since the x-coordinate is sufficient for identifying a point on the graph of a 
function (why?),!°? we will sometimes be lazy and say things like 


“The point x = 3 is on the graph of f.” 


Or even, “The point 3 is on the graph of f.” 








1524 function maps each x-coordinate to exactly one y-coordinate. So, given only an 2-coordinate, we 
immediately also know what the corresponding y-coordinate is. 
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Example 302. Define f, : R* > R by f; (x) = 22. 


IN 
The graph of f; is the set | Y 


\2qjeceR ga 27). 








Example 303. Define fo: Z—> R by fo (x) = 22. 


The graph of fo is the set a 
(eg) eZ, y= 22). 








The graph of fo is simply this set of isolated points: 


IRB? Ai I (OK (4), (eG) ee 
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Example 304. Define g: R > R by g(x) = 2”. 


IN 
The graph of g is the set 
(Gig) elie |. 








Example 305. Define g;: R* > R by gi (x) = 2”. 


IN 
The graph of g; is the set | Y 


{ (x,y) eel’ .y= Dele 








Note that again, the graph of g; does not contain the point (0,0), because 0 ¢ R*. 
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Example 306. Define g.:Z > R by go (x) = 2”. 


IN 
The graph of go is the set | Y 


eae e Lae | 








The graph of go is simply this set of isolated points: 


A ERR Rh imo) pala hit oa cic) eee | 
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17.6. Piecewise Functions 


Informally, a piecewise function is a function that has different mapping rules for different 
intervals (or “pieces”): 


Example 307. Define h: R > R by 


2 
ag 
a for ¢ > 2: 


tor @ = 2; 
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Example 308. Define 7: RR > R by 


for x ¢ Z, 


ie) = 


for x € Z. 











An important piecewise function is the absolute value function (definition reproduced 
from Ch. 5.2): 


Definition 25. The absolute value (or modulus) function, denoted |-|, is defined for all 
xéR by 


toro = 0) 


for 7 =< 0) 


Figure to be 
inserted here. 





Another important piecewise function is the sign function: 
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Definition 70. The sign function sgn: IR > R is defined by 


-1, for 4 < 
sgn z= 40, for =), 


1. for x > 0. 


Figure to be 
inserted here. 


Example 309. sgn7 = 1, sgn -5 =-1, sgn0=0, sgn 100 = 1, sgn — 531 = -1. 





The absolute value function is closely related to the sign function: 


Example 310. Ele ies 1=sgn7 and —— = — =-1=sgn -7. 


7] 7 =5] 





In Ch. 5.5, we gave this result: 


Fact 13. cee le 


a 





And so, Examples 129 and 130 from Ch. 5.5 may be restated thus: 


Example 311. In Exercise 66 (Ch. 5.5), we gave this false statement: 


Va? Va? 


For all x eR \ {0}, — 
Ee ae 


The correct statement should instead be this: 


Var _ |e 1, 


For all « € Rx {0}, we = lesen = 
x x = 
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1. 


for x > 0, 


for x < 0. 
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Example 312. Similarly, in Exercise 67 (Ch. 5.5), we gave this false statement: 


ii 
For all ze Rv {0}, —== 
a 


The correct statement should instead be this: 


for @.> () 


iz 
For allxae Rv {0}, —=2=— = 
Vr? for 2<0) 
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Exercise 106. Let A = {Lion, Eagle} and B = {Fat, Tall}. Can we construct a well- 
defined function using A as the domain and B as the codomain? (Answer on p. 1773.) 


Exercise 107. What do we call a function whose ... 


(a) Domain contains only real numbers? 


(b) Codomain contains only real numbers? 


(c) Domain and codomain contain only real numbers? (Answer on p. 1773.) 


Exercise 108. Below are 17 alleged functions named a through q, with a given domain, 
codomain, and mapping rule (the last being given informally in words). 


(i) Explain whether each alleged function is actually well-defined. 
And if it is, 
(ii) Write down a statement that formally and concisely defines it.(Answer on p. 1773.) 


Function Domain Codomain Mapping rule 
{or 0.0) Z, “Double it” 
yon} “Double it” 
15,0; 1} “Double it” 

1o.4, 0,7) “Double it” 

(ora, 0, a * “Double it” 

toi “Any larger number” 























ya, at “Any larger number” 





103+ “Any larger number” 





{3,4} “Any larger number” 





42,4} “Any smaller number” 
{1} “Stay the same” 
{1} “Stay the same” 

{12% “Stay the same” 

R “Take the square root” 

















R “Take the reciprocal” 
R “Add one” 
ore “Add one” 

















Exercise 109. Continuing with the above exercise, how can we change the domains of 
n and o so that n and o become well-defined?!” (Answer on p. 1774.) 








3The sharp student may have noticed that one trivial answer here is to simply change the domain to 
the empty set @. Then it is trivially or vacuously true that every element in the domain is mapped 
to exactly one element in the codomain. If you’ve noticed and are bothered by this, please change the 
question to, “What are the ‘largest’ subsets of R to which the domains of n and o can be changed, so 
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17.7. The Vertical Line Test 


Example 313. Let G be the graph of the equation x? + y? = 1. Recall that G is simply 
the unit circle centred on the origin: 


Figure to be 
inserted here. 


Could G be the graph of a function? That is, does there exist some (nice) function f 
whose graph is G? 


Answer: No. 


it 11 
The reason is that for example, the points (5. i) and (- | are both in G. So, if G 


il if il 
were the graph of f, then we’d have (5) =— an 5 1 That is, f maps the 


element — to two elements in the codomain. But this contradicts our requirement that a 
function maps each element in its domain to exactly one element in its codomain. 


Hence, G cannot be the graph of any function. 


Example 314. XXX 


More generally, we have the Vertical Line Test (VLT): 





Fact 45. (Vertical Line Test) Suppose G is a graph (i.e. any set of points in the 


cartesian plane). Then G is the graph of a nice function <> No vertical line intersects 
G more than once. 








Proof. See p. 1572 (Appendices). 











The above result says merely that a graph that fails the VLT can’t be the graph of one 
function. It could however be the (union of the) graphs of two or more functions: 





that they become well defined?” 
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Example 315. Again, let G be the graph of 2?+y? = 1. Although G cannot be the graph 
of one function, it could be the (union of the) graphs of two functions. 


Figure to be 
inserted here. 


Define f :[-1,1] > R by f(x) = V1-2? and h: (-1,1) > R by A(z) =-V1-2?. 
Let F and H be the graphs of f and h, respectively. 


Observe that G = F. UH. So, we have successfully constructed two functions, whose 
graphs’ union form G. 


We could of course further split G into the graphs of even more functions. 


Figure to be 
inserted here. 


For example, we could define f; : [-1,0] > R by fi (z) = V1-2? and fp: (0,1] > R by 
fo(a) = V1-2?. Let F, and F be the graphs of f; and fo. Then G= F,U Fy UH. 


Example 316. XXX 
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17.8. Range 


Informally, the range (of a function) is the set of elements in the codomain that are “hit”. 
Formally, 


Definition 71. The range of a function f, denoted Range f, is the set defined by 
Range f = {f (x): 2 ¢€ Domain f}. 


Example 317. Define f : [1,2] > R by f (2) =2+3. Then 


¢ Codomain f = R (the set of reals). 
¢ Range f = [4,5] (the set of reals between 4 and 5, inclusive). 


Figure to be 
inserted here. 


So, Range f c Codomain f and, in particular, Range f # Codomain f.!™ 


Example 318. Define g: R > R by g(x) = x+3. Then Codomain g = R (the set of reals). 


Every element in the codomain R is “hit”. And so, the range is the same as the codomain: 
Range g = R. 


Figure to be 
inserted here. 


So, Range g = Codomain g (and Range g © Codomain g). 





The range will always be a subset of the codomain. Equivalently, it will always be either 
a proper subset of the codomain (Example 317) or equal to the codomain (Example 318). 





4Tnformally and intuitively, we are tempted to say that the range of f is “strictly smaller than” its 
codomain. However, as explained in Remark 13, we shall avoid such language. 
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Example 318 shows that the range can sometimes be identical to the codomain. But 
as Example 317 shows, this is not generally true—in general, not every element in the 
codomain will be “hit”. Because this is such a common point of confusion, let me repeat: 


9 The range is not the same thing as the codomain. 9 


Example 319. Define h: R > R by h(x) = e*. Then 


Codomainh = R and Range h = R*. 


Figure to be 
inserted here. 


So, Rangeh c Codomainh and, in particular, Rangeh # Codomainh. 


Example 320. Define 7: R* > R by i(2) =Inz. Then 


Codomain? = R and Range? = R. 


Figure to be 
inserted here. 





So, Rangei = Codomainz (and Rangei € Codomain2?). 





15°Of course, I can’t be sure because your A-Level syllabus and exams are never clear about what. they 
mean by the term range and never use the term codomain. 

156Some recommend avoiding the term range (has different meanings for different writers) and using only 
image (has a single meaning for almost all writers). as sts 
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Example 321. Define j: {1,2} > Rj by 7 (x) =2+3. Then 


Codomain j = Rg and Range j = {4, 5}. 


Figure to be 
inserted here. 


So, Range7 c Codomain 7 and, in particular, Range 7 # Codomain j. 


Example 322. Define k: {1,2} > {4,5} by k(x) =2+3. Then 


Codomain j = {4,5} and Range, = {4p}. 


Figure to be 
inserted here. 


So, Rangek = Codomaink (and Rangek ¢ Codomaink). 


Is the function k the same as the function j in the previous example?!’ 


Definition 72. A function whose range and codomain are equal is called onto. 








1S7No. Although k and j have the same domain and mapping rule (indeed, they even have the same range), 
their codomains differ. Hence, k # 7. 
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Example 323. In the above examples, g, 7, and k were onto because for each, the range 
was the same as the codomain: 


g: R -R defined by g(x) =2+3, Rangeg =R; 
i: R* +R ‘ i(t) =Iex, Range, =; 
k: {1,2} > {4,5} ko) =a+3, Ranger ={4,5). 


In contrast, f, h, and 7 were not onto because for each, the range was a proper subset 
of the codomain: 


f: [1,2] ~R defined by f(z) =2+3, Range f =[4,5]; 


h: R =-R hia) =e, Rangeh = R*; 
j: {1,2} > Ro j(a)=2+3, Rangej = [4,5]. 
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Exercise 110. Find the range of each function. Which (if any) of these functions is 


onto? 


(a) 
(b) 
(c) 
(d) 
(e) 
(f) 


Define a:R} +R by a(x) =vVz. 
Define b:Z>R Gs) ee 
Define c:Z > Z Cau. 
Define d:Z > Z CO) =7 
Define e:Z>R e(ayraa + t. 
The function f 


Domain f Codomain f 


The function g 


\—-f 


Domain g Codomain g 


(Answer on p. 1774.) 


Exercise 111. Let f be a function. Of the following six statements, which must be true? 


(a) Range f ¢ Domain f. 
(b) Range f ¢ Codomain f. 
(c) Range f c Domain f. 
(d) Range f c Codomain f. 
(e) Range f = Domain f. 
(f) Range f = Codomain f. 
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(Answer on p. 1775.) 
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18. An Introduction to Continuity 


Informally, if you can draw a function’s graph without lifting your pencil, then that function 
is continuous.!°> Most functions we’ll encounter in A-Level maths will be continuous: 


Example 324. The function f : R > R defined by f(x) = 2° — x? - 1 is continuous, 
because you can draw its entire graph without lifting your pencil. 


IN 
Y 





f is continuous because 
you can draw its entire graph 
without lifting your pencil. 





Example 325. The sine function sin is continuous. 


ZN Ree : 
sin is continuous because 


you can draw its entire 
graph without lifting 
your pencil. 














158The converse though is false—that is, a function can be continuous even if its graph cannot be drawn 
without lifting your pencil. 


233, Contents www.EconsPhDTutor.com 


Example 326. The exponential function exp is continuous. 


IX 
Y 


exp is continuous because 
you can draw its entire graph 
without lifting your pencil. 








Example 327. The absolute value function |-| is continuous. 


|-| is continuous 

because you can draw 
its entire graph without 
lifting your pencil. 
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As stated, most functions we’ll encounter are continuous. However, we’ll occasionally en- 
counter functions that aren’t continuous everywhere. For example, functions with “holes” 
in them are not continuous: 


Example 328. Define 7: R > R by 


es tora + 1, 


O- . tora = 1: 


ia) = 


The function 7 is not continuous. In Part V (Calculus), we’ll learn why precisely this is 
so. For now, an intuitive understanding will suffice. 


IN 
Y 





Note though that 7 is continuous on (—oo,1) because you can draw this portion of the 
graph without lifting your pencil. 


Similarly, 7 is continuous on (1, co) because you can draw this portion of the graph without 
lifting your pencil. 


And so, we can actually say that 7 is continuous on (—0o0, 1) U (1, 0). 
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This isn’t something you need to know, but just to illustrate, here’s a somewhat exotic 
function whose domain is R but which is nowhere-continuous. 


Example 329. The Dirichlet function!? d: R > R is defined by 


1, forxreQ, 
0, forr¢€éQ 


Gla )y= 


Then d is not continuous everywhere, because to draw its entire graph, you must lift 
your pencil. 


In fact, it’s worse than that—d is nowhere-continuous! Informally, this means you 
can’t draw more than one point of the graph without lifting your pencil.1® 


The Dirichlet function ay 


d: IR - R is defined by: z 


1 for r€eQ, 
0 forxr¢Q. 


an)-= 


The graph of d contains the 
point (7,1) for every re Q. 


The graph of d contains the 
point (7,0) for every x ¢Q. 


1S OSES SSO6686S680G6666665969S56E089S6SESS9608SSS6SS88 
z 











90Or the characteristic function of the rationals. Named after the German mathematician Peter 
Gustav Lejeune Dirichlet (1805-59). W 
160We'll revisit this example in Part V (Calculus). 
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18.1. A Subtle Point about Continuity 


We'll formally define continuity only in Part V (Calculus). 


It turns out that given a function f, we say that f is 


e Either continuous or not (and not both) at each point x ¢ Domain f (Definition 198); 


¢ Continuous (and called a continuous function) if it is continuous at every point x € 
Domain f (Definition 199); 


e Neither continuous nor discontinuous at every point x ¢ Domain f 


The last bullet point is somewhat subtle and surprising. Here are three examples to illus- 
trate it: 


Example 330. Define f :[1,2]U[3,4] > R by 


iF for 2 | 1,2). 
f(z) = ae 
2, for x e€|3,4]. 


Figure to be 
inserted here. 


It turns out that f is continuous at every x € Domain f = [1,2] U[3, 4]. 


And so, perhaps surprisingly, f is a continuous function. 
Example 331. Define g: (0,1) U (1,2) U (2,3) > R by 
if for x € (0,1), 


g(z)=42 for x€ (1,2), 
3, tor ve (2,3) 


Figure to be 
inserted here. 


It turns out that g is continuous at every x € Domain f = (0,1) U (1,2) u (2,3). 





And so, perhaps surprisingly, g is a continuous function. 
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Example 332. Here’s the graph of the tangent function (to be reviewed in Ch. 33): 


Figure to be 
inserted here. 


Again, it turns out that tan is continuous at every x ¢€ Domain tan 
{Odd integer multiples of 71/2}. 


And so, perhaps surprisingly, tan is a continuous function. 





The last three examples also show that the (very) informal “you can draw a function’s 
graph without lifting your pencil” definition of continuity is actually wrong!'®! 


We could try to modify this informal statement to, “For each connected subset of the 
function’s domain, you can draw the graph without lifting your pencil.” This is “more” 
correct but still somewhat imprecise and informal (“lifting your pencil” isn’t exactly a 
precise or formal statement). 


Again, the formal and correct definitions of continuity will be given in Part V (Calculus)— 
Definitions 198 and 199. For now it suffices to be aware of the above subtle point. 





161\fore precisely, satisfying this condition is sufficient but not necessary for continuity. 
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19. When a Function Is Increasing or Decreasing 


Informally, we know what it means for a function to be increasing, decreasing, strictly 
increasing, and/or strictly decreasing: 


Example 333. Consider the function f :R > R defined by f (x) = 2”. 


Figure to be 
inserted here. 


The function f is decreasing—and indeed, strictly decreasing—on Rp = (-09, 0]. 


Also, it is increasing—and indeed, strictly increasing—on Rj = [0, 0). 





Formal definitions: 
Definition 73. Let f be a nice function with domain D. Suppose S ¢ D. Then f is 
(a) Increasing on S if for any a,b¢€ S with a< b, we have f (a) < f(b). 
(b) Strictly increasing f(a)< f (0). 
(c) Decreasing f (a) Ftd). 
(d) Strictly decreasing F(a Gib). 
If f is increasing on D, then f is an increasing function. 


If f is strictly increasing on D, then f is a strictly increasing function. 


If f is decreasing on D, then f is a decreasing function. 





If f is strictly decreasing on D, then f is a strictly decreasing function. 
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Example 334. Consider the function g: R->R defined by g(x) = 2°. 


Figure to be 
inserted here. 


The function g is increasing on R, because if we pick any a,b € R with a < b, then by 
informal observation,’©? we have 


a or g(a) <g(b). 


Since g is increasing on its domain R, it is an increasing function. 


Indeed, by informal observation, we also have 
S008 
a’ <b or g(a) <g(b), 
so that g is also strictly increasing on its domain R and is a strictly increasing function. 


Fact 46. Let f be a nice function and S ¢ D = Domain f. 


(a) If f is strictly increasing on S, then it is also increasing on S. 





(b) If f is strictly decreasing on S, then it is also decreasing on S. 


Proof. (a) Suppose f is strictly increasing on S. Then by Definition 73(a), f (a) < f(b) for 
all a,bée S with a< b. So, it must also be true that f(a) < f(b) for all a,be S with a< b. 
Hence, by Definition 73(b), f is increasing on S. 


(b) Similar, omitted. 














Fact 46 says that “strictly increasing implies increasing” and “strictly decreasing implies 
decreasing”. However, the converses of these two statements are false, as the next example 
shows: 





'62Formal proof that g is strictly increasing (and hence also increasing): 
Suppose 2; < £2. Observe that x? - #3 = (41 - 72) (aj + 2122+ mS): We shall prove that x? - 73 < 0. 
1 1 il 
Next, 27+ 21%)+ 273 = = (2a; + 2% Xo + 223) aa (vi +034 07 429122 + v3) Bs Ee +024 (2, + x2) |. 
Each of the three terms in brackets is non-negative. Moreover, at least one of x; or %9 is non-zero, 
so that at least one of x? or x3 is (strictly) positive. Hence, x7 + 1172 + x5 > 0. 
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Example 335. Consider the function 7: R > R defined by i (a) = 1. 


Figure to be 
inserted here. 


The function 2 is increasing on R, because if we pick any a,b € R with a < b, we have 
ial or i(a) <i(b). 


However, 7 is not strictly increasing on R, because if we pick any a,b «€ R with a < b, we 
have 


ie aa or Laat b), 
Similarly, 7 is decreasing on R, because if we pick any a,b ¢ R with a < b, we have 
ies or i(a) >7(b). 


However, 7 is not strictly decreasing on R, because if we pick any a,b € R with a < b, we 
have 


17 1 or i(a) i(b). 


Altogether, 7 is both an increasing function and a decreasing function. However, 7 is 
neither a strictly increasing function nor a strictly decreasing function. 


Exercise 112. Consider the function j : R > R defined by j (x) = x". 


Figure to be 
inserted here. 


Explain why j is (a) strictly decreasing on (—00, 0]; (b) strictly increasing on [0, 00). 


Hence, conclude (fill in the Ae (c) By , J is also decreasing on 
and increasing on (Answer on p. 1776.) 
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20. Arithmetic Combinations of Functions 


Example 336. Define f,g:R—>R by f (x) =7r+5 and g(x) =x". Let k=2. 
Then we can also define these six functions: 
1. (f+9):R-R by (f +9) (2) =f (z)+9(x) =7a+5+2°; 
2. (f-g):R-R by (f-9) (2) =f (@)-g(#) =7a+ 5-2"; 
3. (f-g):R-R by (f-9) (x) =f (@) g(a) = (7a +5) 2°; 
4 (kf): R-R by ChE (@) = hy (= 2 (a 45) = 14274 10; 





(4): Rs {0} +R by (A) - Le) _ Tes, 


6. (ft+k):R-R by (f +k) (2) =f (v7) +k=7r+54+2=7r+7. 





For lack of any standard names, I'll call the above six functions the sum, difference, 
product, constant multiple, quotient, and translated functions, respectively. 


Evaluating each of these six functions at 1, we have 


1. (f+g)(1) =7-14+5419 = 13; A, (hfy(1) = i4el + 10 = 24 
Lida 


B 2 


gC Beye Sra eese een 5. (Z)a) 


SC ona ( 145) 6. (f+k)(1) =7-1+7=14. 


By the way, as usual, parentheses can be helpful for making things clear. Here for 
example, they make clear that f +g is a single function—it’d be confusing and unclear if 
we wrote f + g(1) instead of (f + 

he ah he 


Note the domain of each of the above six functions: 
For kf and f +k, the domain is simply Domain f. 


For each of f +g, f-g, and f-g, the domain is Domain f nm Domaing (i.e. the set of 
numbers that are in both the domains of f and g). 


For f/g, it’s a little trickier figuring out what the domain is. Observe that g(0) = 0. 
And so, in order for f/g to be well-defined, we must restrict the domain by removing 
the element 0. Otherwise, (f/g)(0) would be undefined and f/g would fail to be a 
well-defined function. Thus, the domain of f/g is 


Domain fn Domaing \ {x: g(x) =0}=Rv {0}. 


In words, the domain of f/g is the set of numbers x that are in both the domains of f 
and g, except those for which g(a) equals zero. 
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Example 337. Define h,i:[-1,00) > R by h(x) =x+1 and i(z) = Vr+1. Let 1=5. 
Then we can also define these six functions: 

1,co) > R by (Ah +72) (@) =A(z) 44(2) =24+14V241; 
1,00) > R by (h-7) (a) =A(x)-i(a) =24+1-Ve+1; 
1,00) +R by (h-t)(x) =h(2)i(x) = (4@+1)V 241; 
l,o)>+Rby (lh) (2) =lh(x) =5(e4+1) = 5x45; 

h(a) _ xrt+1 

6. (h+1):[-1,00) >R by (h41) (a) =h(x) +l axv4+1+5=2+6. 


1. (h+i): 
2. (h-i): 
(h-i): 
(Ih) : 


(1,00) =R by (2) (e) = 


[- 
[- 
[- 
[- 





Evaluating each of these six functions at 1, we have 
1. (A+i)(1) =14+14+V14+1=2+ V2; A. (ih)(1) =5(1+1)=10; 
Ce tte eee 5. (=) eee 
3. (h-i)(1) = (141) V141=2Vv2; 6. (h+1)(1) =14+14+5=7. 


Again, note the domain of each of the above six functions: 
For each of /h and h+1, the domain is simply Domain h. 


For each of h +i, h-i, and h-i, the domain is Domainhn Domaini (the set of numbers 
that are in both the domains of h and 7). 


But for h/i, it’s again a little trickier figuring out the domain. Observe that i(-1) = 0. 
And so, in order for h/i to be well-defined, we must restrict the domain by removing 
the element 0. Otherwise, (h/7) (0) would be undefined and h/i would fail to be a well- 
defined function. Thus, the domain of h/7 is 


Domainhn Domaini \ {x : 1 (x) = 0} = [-1, 0) \ {-1} = (-1, 0). 


In words, the domain of h/i is the set of numbers x that are in both the domains of h and 
i, except those for which i (a) equals zero.'°? 








'63Here the sharp student may wonder, “But isn’t Vx+1 perfectly well-defined for all x € [-1, 00) and, in 
particular, even for x = -1? So couldn’t the domain of h/i instead be [-1, 00)?” 
Great point! But the thing is, we really want to think of (h/i) (x) as being equal to h(a) divided 
by i(x) (without doing any simplification beforehand and without thinking of h/i as being simply the 
“formula” Va +1). So, if ¢(x) is undefined, then we want (h/i) (x) to also be undefined. 
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Formal definitions of the six functions: 


Definition 74. Let k¢ Rand f: A> B and g:C > D be nice functions. Define! 


1. The sum function (f +g): ANC >R by (f +g) (2) =f (x) + 9(2); 

2. The difference function (f -g): ANC >R by (f -g) (x) =f (x) -g (2); 

3. The product function (f-g): ANC >R by (f-g) (x) = f (x) - g(x); 

4. The constant multiple function (kf): A—> R by (kf) (x) =kf (2); 

5. The quotient function (4) >:AnC \ {x: g(x) =0} > R by (£) (a= fz). 
g g(x) 

6. The translated function (f +k): A—>R by (f +k) (2) = f (2) +k. 


Exercise 113. Let k = 2 and 1=5. Define (Answer on p. 1777) 


f:R-R by f (x) =7r+5, 

g:R>R by g(x) =2°, 
h:[-1,00) +R by A(z) =2+1, 
i:[-1,00) > R by i(x) =Ve41. 


Evaluate the following. 


(a) (f+9)(2) (ad) (Ag) 1) (g) (+i) G) (@)Q) 
(b) G@- AG) (Ce) GAG) = (b) @-A)G) (Kk) Gi/h) C1) 
() @AQ @ F+)M) W@W &HZ) WM @+)Q) 


(k) Write down the domain of each of the functions f+h, f-h, f-h, and f/h. Then 
define each function. 











1647> 








ve simply set the codomain of each of the six functions as R. 





One alternative would be to let the codomains of the six functions be, respectively, 
{y= f(z) +9(x):2€ ANC}, {y= f(x)-g(x):re ANC}, {y= f (x) g(x): ce ANC}, 
{y=kf(x):ceA}, {y= f(x) /g(z):xe ANC \ {x: g(x) =0}}, and {y= f(r) +k: xe A}. In 
which case, each of the six functions would be onto. 

165Unfortunately and very confusingly, a minority of writers do use fg to mean f-g. In the A-Level 
syllabus and exams and in this textbook, we will follow majority practice by insisting that fg # f-g. 
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21. Domain Restriction 


We can take a given function and create new ones through domain restrictions: 


Example 338. Define f:R—- R by f (x) = 22. 
Then the restriction of f to [1,2] is the function flys 9)" [1,2] ~ R defined by fin 2) (a) = 
f (x). 


Figure to be 
inserted here. 


Note well that the functions f and fla gy are distinct: 7 # fla a" 


Example 339. Define g: Rj > R by g(2) = Vz. 
Then the restriction of g to [2,3] is the function ae 3) [2,3] > R defined by Irs 3) (x)= 
g (2). 


Figure to be 
inserted here. 


Note well that the functions g and Irs gy are distinct: ¢= Irs 3): 


Exercise 114. For each given function, what are its restrictions to [0,1] and R*? Draw 
the graphs of the given function and also the two restrictions. (Answer on p. 245.) 


(a) xxx 


(b) xxx 





A114. 
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22. Composite Functions 


Let f and g be nice functions. Earlier we learnt to construct the product function f -g 
(read aloud as “f times g”). 


We now learn to construct a new function fg (read aloud simply as “f g”), called the 
composite function. 


The functions fg and f-g look similar but are completely different. 


Example 340. Define f,g:R—R by f (x) = 2x and g(x) = x+1. Then the composite 
function fg:R-— R is defined by 


(fg) (x) = f (g(x)) = f (2 +1) =2(a4+1) = 2242. 


(To get the composite function fg, plug g into f.) 
In contrast, the product function f-g:R-— R is defined by 


(f-9) (2) =f (2) 9 (2) = (22) (@ +1) = Oa On. 


(To get the product function f-g, multiply f and g.) 


The functions fg and f-g are different. For example, the values of each at 0 are 


(fo (Q=2x04+2=2 and (Fo) (0) =2 0° 22 < 020, 


Note that f-g = g-f, but fg # gf. We say that the product (of two functions) is 
commutative, but the composition (of two functions) is not (generally) so. 


Here for example, the composite function gf : R > R is defined by 


(of) (2) = 9 (f (@)) = g (2x) = 2a +1. 


(To get the composite function gf, plug f into g.) 
The value of gf at 0 is (gf) (0) =2x0+1=1, which is different from (fg) (0) = 2. 


Note also that for the composite function fg, we plug g into f. So for example, to compute 
fg (7), we first compute g(7) = 7+ 1=8, then compute f (g(7)) = f (8) =2-8=16. (A 
common mistake is to do the opposite because one goes instinctively from left to right.) 
Conversely, for the composite function gf, we plug f into g. So for example, to compute 


gf (7), we first compute f (7) = 2-7 = 14, then compute g(f (7)) = 9 (14) = 144+1= 15. 
(Ditto.) 
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Example 341. Define h,i: R > R by h(x) = 27-1 and i(2) = x/2. 
Then the composite function hi: IR —> R is defined by 


(hi(@)=hG(2)) =hisi2)=s27 41. 


In contrast, the product function h-2:R— R is defined by 


(i-)i@)-Ue iG) =(2 Nas 222. 


The functions hi and h-i are different. For example, the values of each at 0 are 
(hi) (0) =0?/4-1=-1 and (h-7) (0) = 0°/2- 0/2 =0. 
Again, note that h-2=7-h, but hz #ih. The composite function 7h: R > R is defined by 


ChyCa =a (aan = Ire 2 — 1/2. 


1 
= ~~, which is different from (hi) (0) =-1. 


2 1 
The value of ih at 0 is (th) (0) = = SR 





Formal definition of a composite function: 


Definition 76. Let f and g be functions with Range g € Domain f. Then the composite 
function fg is the function with 


Domain: Domain g; 


Codomain: Codomain f; and 
Mapping rule: (fg) (x) = f(g(2)). 





So, a composite function’s domain is the same as the innermost (or rightmost) function. 


And its codomain is the same as the outermost (or leftmost) function. 





The condition Range g € Domain f is important.'©° It ensures that for every x ¢ Domain g, 
we have g(a) € Domain f and hence that f (g(x)) is well-defined. 


If this condition fails, then we simply say that the composite function fg does not exist 
or is undefined: 





166 Just so you know, many writers take a different approach: They define the composite function f og if 
and only if Codomain g = Domain f. 
In contrast, in Definition 76, we use the weaker requirement Range g ¢ Domain f. (Many other writers 
also do the same as us—see e.g. MathWorld). 
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Example 342. Define f: R* > R by f (x) =Inz and g:R-R by g(x) =2+1. 


Observe Range g = R is not a subset of Domain f = R*. And so for example, 
Cie yg) ae) ee) 


would be undefined. Hence, the composite function fg simply does not exist. 


The rest of this ecample is explicitly excluded from your syllabus.'®’ Nonetheless, spending 
a minute or two reading it will earn you a better understanding of composite functions. 
(Plus, this is the sort of thing that your A-Level examiners could easily include as a 
curveball question.) 

Consider the restriction of g to (-1, 0), i.e. the function Ie wo) 
Observe that Range g| os R* is a subset of Domain f = R*. 


So, we have the composite function fol yy (-1, 00) > R defined by 


(£9,100) ) (2) =F (l-1 00) (@)) =f @+1) =n +1). 


Example 343. Define i: Rj > R by i(xz) = Vz and 7: R>R by j (x) =2-3. 
Observe Range j = R is not a subset of Domainz = Rj. And so for example, 
(ij) (-5) = ¢ (9 (-5)) = 7 (-5 - 3) =2(-8) 


would be undefined. Hence, the composite function 77 simply does not exist. 


Again, the rest of this example is explicitly excluded from your syllabus. 


Consider the restriction of j to [3, 00), ie. the function is oo)’ 
Observe that Range j|,, ae Rg is a subset of Domain? = Ro. 


So, we have the composite function idl as [3, 00) > R defined by 


(44] 5 wy) @) = # (1s 00) (2) = 5 @-3) = Ve=3. 








'67Your syllabus (p. 5) states, “Exclude ... restriction of domain to obtain a composite function.” 
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We can use a single function to build a composite function. 


Example 344. Define f:R—- R by f (x) = 22. 


Observe that Range f = R is a subset of Domain f = R. Hence, the composite function 
ff:R-—R exists and is defined by 


(ff) (2) = FCF (@)) = f (2a) = 40. 
So for example, Cy a4 and Cin Gye 2 


The composite function ff is usually simply written as f?. So, the above line can also 
be written as 


fe) =4 and Oe) 


Next, observe that Range f? = R is also a subset of Domain f = R. Hence, the composite 
function f f? = f?:.R > R exists and is defined by 


Par Gf @))-7 4aase 
So for example, oe lees and (Ga 


We also have f*, f°, etc.: 
f':R-R is defined by f* (x) = 16z, 


f°: R-R is defined by f? (x) = 32z, 





Following common practice (and also your syllabus and exams), we’ll use f? to mean the 
composite function ff, f? to mean ff’, f* to mean f f*, etc. For future reference, let’s jot 
this down formally: 


Definition 77. Let f be a function. The symbol f? denotes the composite function fo f. 
For any integer n > 3, the symbol f” denotes the composite function fo f”"!. 
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Example 345. Define g: R>R by g(x) =1-2/2. 


Observe that Range g = R is a subset of Domaing = R. Hence, the composite function 
g’: R= R exists and is defined by 





# (2) =9(9(2))=9(1- 2) =1-2F -34t 
3 


5 
So for example, G7 cance (3) = A 


4 


Next, observe that Range g” = R is also a subset of Domain g = R. Hence, the composite 
function g® : R > R also exists and is defined by 





)=1- Peet te 


9° (v) = 99° (2) = 9 (Fw) =9(5 +5 oe. 
3 


So for example, Chl Ge = ; andra? (3) = : 





Exercise 115 continues with this example. 
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Exercise 115. Continue with the above example. (Answer on p. 1778.) 
(a) Let n=4. 
(i) Explain why the composite function g” exists. 

(ii) Write down the function g”. 

(iii) Evaluate g” (1) and g” (3). 
(b) Repeat (a), but now for n = 5. 
(c) Repeat (a), but now for n = 6. 
The remainder of this question is a little harder, but is also the sort of curveball that the 
A-Level examiners might throw at you: 


So far, we have 


g° (x) -3+(-2)2 


ci ? 
G@)=. Oe, 9 (z)=54+(2)z, 
where the question marks in the second line can be filled in by your answers to parts 
(a)-(c). 
As the above suggests, it turns out that for each positive integer n, we have 


An 
g" (a)= 7 + (Cyn) x. 


You are given that a general expression for @,, is 


(d) Write down a general expression for b,, and c,,. Hence, write down a general expression 
for g (2) 





(e) Explain why as n approaches oo, each g" (az) approaches 2/3. 
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Exercise 116. For each of (a)—(d), determine if the composite functions fg and gf 
exist, then compute fg(1), fg(2), gf (1), and gf (2) (if these exist). (Answer on p. 
1779.) 


(a) Define f,g:R—R by f (x) =e* and g(x) =27 +1. 

(b) Define f,g:R\. {0} > R by f(z) =1/x and g(x) =1/ (22). 

(c) Define f :R\ {0} > R by f (x) =1/z and g:R->R by g(x) =27+1. 
(d) Define f :R\ {0} >R by f (x) =1/z and g:R—R by g(x) =27-1. 


Exercise 117. For each of (a)—(d), determine if the composite function f? exists. If it 
does, write it down and compute f? (1) and f? (2). 

(a) Define f : R—R by f (x) =e’. 

(b) Define f: R—R by f (x) = 3x42. 

(c) Define f:R—R by f (x) = 227 +1. 

(d) Define f:R* > R by f (x) =Inz. (Answer on p. 1780.) 
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23. One-to-One Functions 


Recall that a function is onto (or surjective) if every element in its codomain is “hit” at 
least once: 


Example 346. XXX 


Example 347. XXX 


We now introduce a closely related concept: A function is one-to-one (or injective) if 
each element in its codomain is “hit” at most once. More precisely, 


Definition 78. A function f is one-to-one if for all y ¢ Codomain f, there is at most one 


x € Domain f such that f (x) = y. 





Equivalently and graphically,!® 


Fact 47. (Horizontal Line Test) A function f is one-to-one if and only if no horizontal 
line intersects the graph of f more than once. 








Proof. See p. 1574 (Appendices). 











Example 348. Consider f :R > R defined by f (x) = 2? + 2z. 


Observe that f (-4) = 8 and f (2) = 8—the element 8 in Codomain f is “hit” twice.'®? So, 
by Definition 78, f is not one-to-one. 


Figure to be 
inserted here. 


Equivalently, the horizontal line y = 8 intersects the graph of f twice.‘”? So, by Fact 47, 
f is not one-to-one. 








'68Im the book Mathematics Education in Singapore (2019), an unnamed JC maths teacher is quoted as 
claiming that, 


to ‘proof’ |sic| that a function is 1-1, one uses the horizontal line test which is 
incorrect. 


The above quote is false. The horizontal line test is perfectly correct. 
The only possible objection is that the horizontal line test is informal as it relies on geometry. But even 
this objection can be dismissed because the horizontal line test can be stated and proven analytically 
(as is done here and in the Appendices). 

169Indeed, every element in Codomain f that’s greater than -1 is “hit” twice. (In contrast, -1 is “hit” 
once and every other element is not “hit” at all.) 

‘Indeed, for every c > -1, the horizontal line y = c intersects the graph of f twice. 
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Example 349. Consider g: R > R defined by g(x) = 22 +1. 


Observe that every element in Codomain g is “hit” at most once.’ So, by Definition 78, 
g is one-to-one. 


Figure to be 
inserted here. 


Observe also that no horizontal line intersects the graph of g more than once.!” So, by 
Fact 47, g is one-to-one. 


Example 350. Consider the absolute value function |-|. 


Observe that |-2| = 2 and |2| = 2—the element 2 in Codomain || is “hit” twice.’ So, by 
Definition 78, |-| is not one-to-one. 


Figure to be 
inserted here. 


Equivalently, the horizontal line y = 2 intersects the graph of |-| twice.!"* So, by Fact 47, 
|-| is not one-to-one. 





Example 351. XXX 


Indeed, every element in Codomain g is “hit” exactly once. 

“Indeed, for every c€ R, the horizontal line y = c intersects the graph of g exactly once. 

Indeed, every element in Codomain |-| that’s greater than 0 is “hit” twice. (In contrast, 0 is “hit” once 
and every other element is not “hit” at all.) 

'4TIndeed, for every c > 0, the horizontal line y = c intersects the graph of |-| twice. 

'>Nope. A one-to-many function would be one where an element in the domain “hits” “many” (i.e. more 

than one) elements in the codomain. But this violates our definition of a function—a function maps 

each element in the domain to exactly one element in the codomain. 
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Example 352. XXX 





Exercise 118. XXX (Answer on p. 255.) 
A118. 
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23.1. Two More Characterisations of One-to-One 


The following result gives two more characterisations! of one-to-one functions: 


Fact 48. The function f ts one-to-one if and only if for all x1, x2 € Domain f, 


(a) Ifa, # 2%, then f (x1) # f (x2). 
(Da) Ga) =r (en) hen a = ee. 





Proof. Let P, Q, and R be, respectively, these statements: 


e« The function f is one-to-one. 
¢ For all 21,72 € Domain f, if x1 # x2, then f (21) # f (22). 
¢ For all 21,72 € Domain f, if f (7) = f (v2), then x = 2. 
(a) The claim is “P <=> Q” is equivalent to “NOT-P <— >» NOT-Q”, which we’ll now 
prove: 
NOT-P 
—> “There exists some y € Codomain f such that there are more than 
one x € Domain f (call them x, and x2) with f (7) =y and f (x2) =y” 
—- NOT-Q 


(b) In (a), we showed that P <= Q. But by the contrapositive,'”’ Q is equivalent to R. 
Hence, we also have P => R. 














Example 353. Consider f :R > R defined by f (x) = 2? + 2z. 
Observe that —4 # 2, but f (-4) = f (2). So, by Fact 48(a), f is not one-to-one. 


Figure to be 
inserted here. 


Equivalently, f (-4) = f (2), but -4 #2. So, by Fact 48(b), f is not one-to-one. 








176 \ characterisation (of a property) is a condition that’s equivalent (to that property). 
Here, the property is “one-to-one” and the (characterising) condition is “distinct elements are mapped 
to distinct elements”. 
Earlier the horizontal line test gave a graphical characterisation of one-to-one: Again, the property was 
“one-to-one”, but the (characterising) condition there was “no horizontal line intersects the graph more 
than once”. 

'7Tf you don’t know what the contrapositive is, read Ch. 3.11. 
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Example 354. Consider the absolute value function |-|. 
Observe that -2 #2, but f (-2) = f (2). So, by Fact 48(a), |-| is not one-to-one. 


Figure to be 
inserted here. 


Equivalently, f (-2) = f (2), but -2 #2. So, by Fact 48(b), |-| is not one-to-one. 


Example 355. Consider g: R > R defined by g(x) = 2x +1. 


To prove that g is one-to-one, we can use either Fact 48(a) or (b): 


1. Suppose 271,22 € Domaing = R with x; # x9. Then 22, # 2x5 or 27, + 1 # 2%) +1 or 
g (21) #9 (x2). So, by Fact 48(a), g is one-to-one. 


2. Suppose g (71) = g(#2). Then 22, + 1 = 242 +1 or 221 = 2x2 or 21 = X2. So, by Fact 
48(b), g is one-to-one. 


Figure to be 
inserted here. 


Example 356. Consider h:R — R defined by h(x) = 2. 


To prove that h is one-to-one, we can use either Fact 48(a) or (b): 


1. Suppose 21, 22 € Domainh = R with x; # x2. Then 2? # x3 or h(a1) # h(22). So, by 
Fact 48(a), h is one-to-one. 


2. Suppose h (x,) = h(ar2). Then x? = 3 or 21 = £2. So, by Fact 48(b), h is one-to-one. 


Figure to be 
inserted here. 
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Example 357. XXX 


Example 358. XXX 


Exercise 119. XXX (Answer on p. 258.) 
A119. 
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23.2. <A Strictly Increasing/Decreasing Function Is One-to-One 


Fact 49. If a function is strictly increasing (or strictly decreasing), then it is also one- 





to-one. 


Proof. Let f be a function. 

Suppose f is strictly increasing (or strictly decreasing). 

Pick any distinct a,b € Domain f with a < b. 

Then f (a) < f(b) (or f (a) > f (b))—and, in particular, f (a) # f (0). 
Hence, by Fact 48(a), f is one-to-one. 














Example 359. XXX 


Example 360. XXX 
Unfortunately, the converse of Fact 49 is false: 


Example 361. XXX 


Happily, Fact 49 has this partial converse. ‘This converse is only partial because it adds 
two assumptions: The function is (a) continuous; and (b) defined on an interval (i.e. its 
domain is an interval). 


Proposition 4. Suppose a nice function is continuous and its domain is an interval. If 





this function is one-to-one, then it is also either strictly increasing or strictly decreasing. 














Proof. See p. 1575 (Appendices). 
Combining Fact 49 and Proposition 4, we have 


Corollary 7. Suppose a nice function f is continuous and its domain is an interval. 
Then 





f ts one-to-one f is strictly increasing or strictly decreasing. 


Example 362. XXX 
Example 363. XXX 


Exercise 120. XXX (Answer on p. 259.) 
A120. 
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24. Inverse Functions 


Example 364. Consider the function f : {PH wy > {@, )} defined by “match the 


animal to its role”: 


The function f 


Domain f Codomain f 


Its inverse function (or more simply inverse) is the function f™ : {@, )} > {PR}. 
defined by “match the role to the corresponding animal”: 


The function f+ 


Codomain f7! 
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Example 365. Consider the function g: { PR, , ww} = {@, B, y} defined by “match 
the animal to its role”: 


The function g 


Domain g Codomain g 


Its inverse is the function g”! : {@, q, )} = {FR x, ww}. defined by “match the role 
to the corresponding animal”: 


Domain g™! Codomain g™+ 
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Example 366. Consider the function h: R > R defined by h(x) = x + 2: 


Figure to be 
inserted here. 


Its inverse is the function h7!: R ~ R defined by h7! (y) = y -2: 


Figure to be 
inserted here. 


Remark 53. In the above example, we use the symbols x and y as our dummy variables 
for the functions h and h7, respectively. As usual, these being dummy variables, the 
symbols x and y can be replaced by any other variable. For example, all of the following 
would also have been perfectly correct: 


1. Define h: RR by h(©) =©+2 and h!:R-R by h' (0) =0-2. 
2. Define h: R>R by h(*) =*+2 and h':R-R by h'(€)=¢ -2. 
3. Define h: R>R by h(x) =2+2 andh!:R-R by A! (2) =2-2. 
4. Define h: R>R by h(y)=y+2 andh!:R-R by h''(y)=y-2. 


In #3, we use the same symbol x twice as the dummy variable for both h and h'!. 
There is nothing wrong with this. (Indeed, we’ve always been doing this—e.g., “Define 
f,g:R—-R by f (2) = 382 and g(a) = 52.”) 


Ditto for #4, where we use the same symbol y twice as the dummy variable for both h 
and ht. 
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Example 367. Consider the function 7: [0,1] — [0,2] defined by i (a) = 2a: 


Figure to be 
inserted here. 


Its inverse is the function i~! : [0,2] > [0,1] defined by i7! (y) = y/2: 


Figure to be 
inserted here. 





We'll formally define the inverse function only on p. 266. But for now, try this exercise: 


Exercise 121. XXX (Answer on p. 263.) 
A121. 
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24.1. Only a One-to-One Function Can Have an Inverse 


If a function isn’t one-to-one, then it does not have an inverse. Let’s see why: 


Example 368. Consider the function f : { PR, 3, wy} > {Yum, Yuck} defined by 
f (PR) = avian: (5) = Vilek, and. (w) = Vani: 


Figure to be 
inserted here. 


The element Yum in the codomain is “hit” twice. So, f is not one-to-one. 


Let’s see what goes wrong if we try to find f, the inverse of f: 


Figure to be 
inserted here. 


The “function” f~! is not well-defined because an element in the domain, Yum, is mapped 
to two distinct elements in the codomain. 


We may simply say, “f has no inverse,” or “the inverse of f does not exist,” or “the 
inverse of f is undefined”. 
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Example 369. Define g: {0,1,2} > {3,4} by g(0) =4, g(1) =4, and g(2) =3. 


Figure to be 
inserted here. 


The element 4 in the codomain is “hit” twice. So, g is not one-to-one. 


Let’s see what goes wrong when we try to find g™', the inverse of g: 


Figure to be 
inserted here. 


The “function” g™ is not well-defined because an element in the domain, 4, is mapped 
to two distinct elements in the codomain. 


We may simply say, “g has no inverse,” or “the inverse of g does not exist,” or “the inverse 
of g is undefined”. 





We’re now ready to give our formal definition of the inverse: 
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24.2. Formal Definition of the Inverse 


Definition 79. Given a one-to-one function f, its inverse function (or inverse), denoted 
f-', is the function with!” 


Domain: Range f. 
Codomain: Domain f. 
Mappme tule: g=)(2) =f (=a. 





Example 370. XXX 


Example 371. XXX 





Again, if a function is not one-to-one, then we need not bother finding its non-existent 
inverse: 


Example 372. The function f : IR > R defined by f (2) =0 is not one-to-one. 


So, f does not have an inverse. (Or, “the inverse of f does not exist or is undefined.”) 


Example 373. The function g:R > R defined by g(x) = x? is not one-to-one. 


So, g does not have an inverse. (Or, “the inverse of g does not exist or is undefined.”) 





Exercise 122. XXX (Answer on p. 266.) 
A122. 





178 Actually, Definition 79 is not quite the standard definition of an inverse function. It turns out under 
the standard definition, to have an inverse, a function must also be onto. We discuss this in Ch. 142.15 
(Appendices). 

But this isn’t something you need worry about. For A-Level Maths, Definition 79 will be perfectly good 
and will be used everywhere in this textbook except Ch. 142.15 (Appendices). 

But see n. 178. 

180Tn your A-Level exams, no distinction is ever made between the codomain and the range—the implicit 
assumption seems to be that they are the same (“usually” or “whenever we need them to be”), so that 
any function is onto (“usually” or “whenever we need them to be”). 

181T et f be a one-to-one function. By Definition 79, Domain f = Range f. If f is also onto, then Range f = 
Codomain f, so that Domain f = Codomain f. 
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24.3. A Method for Showing a Function Is One-to-One and 
Finding Its Inverse 


Let f: A> B bea function with range C’. Suppose we know f is one-to-one and its inverse 
is f-':C + A with this mapping rule: 


y=f(e) == f*y)=e. 


Now, suppose instead we do not know whether f is one-to-one. But, we’re able to find 
some function g: C > A with this mapping rule: 

y= f (2) = g(y) = 2. 
The function g looks very much like it must be the inverse of f. And so, we ask two 
questions: 


1. Is f even one-to-one? 


2. Is g the inverse of f? 


Happily, the answer to both of the above questions is affirmative: 


Fact 50. Suppose f : A > B is a function with range C. Then f is one-to-one if and 
only if there exists a function g:C > A where for every ye C, we have 


y= f (2) = g(y) ==. 


(Moreover, if g exists, then by Definition 79, it is the inverse of f.) 

















Proof. See p. 1576 (Appendices). 


As the following examples show, Fact 50 provides a method for (a) showing a function is 
one-to-one; and (b) finding its inverse at the same time. Actually, this method will allow 
us to skip the work that previously went into (a) showing a function is one-to-one. 


Example 374. Define f:R—-R by f (2) =r+1. 
Let y ¢ Range f =R. 
Since y € Range f, there must exist some x € Domain f = R such that y = f (x) or y= ax+1. 


Do the algebra to get x =y-1. 


So, let’s construct the function f~!:R > R defined by f(y) = y-1. 
Observe that for each y € Range f = R, we do indeed have 


pa) =a i O)-eay 





Hence, by Fact 50, f is one-to-one and its inverse is f~!. 
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Example 375. Define g: R > R by g(a) = 22. 
Let y € Range g = R. 


Since y € Range g, there must exist some x € Domaing = R such that y = g(x) or y = 22. 


Do the algebra to get x = y/2. 


So, let’s construct the function g7! : IR + R defined by g™! (y) = y/2. 
Observe that for each y € Range g = R, we do indeed have 


y=g(a)=20 0 == gt (y) =a =y/2. 


Hence, by Fact 50, g is one-to-one and its inverse is gt. 





In the above two examples, we were very careful in spelling out each step of the argument. 
Going forward, we'll be a little lazier and distill the argument into this Three-Step Method: 


Three-Step Method to Show a Function Is One-to-One and Find Its Inverse 


Consider the function f : A> B defined by x f (x) and with range C. 
In Witte) (a). 
2. Do the algebra to get x = f7' (y). 


3. Conclude: f is one-to-one and its inverse is the function f-' : C > A defined by 


Wee 





Let’s redo the above two examples: 


Example 376. Define f: R—->R by f(x) =2+1. 


1. Write y = f (x) =a +1. 
2. Do the algebra: x = y—-1. 


3. Conclude: f is one-to-one and its inverse is the function f-! : R > R defined by 
FY (y)ey-l. 
Example 377. Define g: R>R by g(a) = 22. 


1 Write y= g(a) =2e. 
2. Do the algebra: x = y/2. 


3. Conclude: g is one-to-one and its inverse is the function g' : R > R defined by 
g* (y) = 9/2. 





More examples: 


268, Contents www.EconsPhDTutor.com 


Example 378. Define h: R \ {0} > Rv {0} by h(a) = 1/z. 


1. Write y=) = Lia. 

2. Do the algebra: x = 1/y. 

3. Conclude: h is one-to-one and its inverse is the function h7! : R\ {0} + R\ {0} defined 
by ho" (y) = 1/y. 


By the way, observe that h =h7!. So, h is an example of a self-inverse function (more 
on this in Ch. 24.7). 


Example 379. Define i: Rj > Ro by i (x) = 2. 


. Write y =i (x) = 2". 

. Do the algebra: x = +,/y. 
Since Domain? = Rj, we have x > 0. So, we discard the negative value and are left 
with x = ,/y. 

. Conclude: i is one-to-one and its inverse is the function i! : Rj} > Rj defined by 


g* (y)=V¥- 


Example 380. Define j: Rj > Rj by j (x) = 2”. 


Write =) (=a 

. Do the algebra: x = +,/y. 
Since Domainj = Rp, we have x < 0. So, we discard the positive value and are left 
Wibhy = —/ G7, 

. Conclude: j is one-to-one and its inverse is the function j7~! : Rj > Rp defined by 


a 7 


Example 381. Define k: R* > R* by k(z) = 1/2”. 


. Write y = k(x) = 1/2’. 
. Do the algebra: « = +1/,/y. 


Since Domain k = R*, we have x > 0. So, we discard the negative value and are left 
with x = 1/,/y. 
. Conclude: k is one-to-one and its inverse is kt : R* > R* defined by k* (y) = 1/\/y. 


Example 382. Define 1: R~ > R* by I(x) = 1/2”. 


. Write y= (a) = l/z7. 
. Do the algebra: x = +1/,/y. 


Since Domain! = R’, we have x < 0. So, we discard the positive value and are left with 


a =-1//y. 
. Conclude: | is one-to-one and its inverse is 7! : R* + R™ defined by I! (y) = -1/V/¥. 
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Exercise 123. Show that each function is one-to-one and find its inverse. 
(a) a: R +R defined by a(n) = be. 

(b) 6: R >R defined by b( 

(c) c:Ri > [1,00) defined by c(x) = 27 +1. 
(d) d: Rp > [1,0) defined by d(x) = 2? +1. 
( 
(x 


an 


(e) e:Rj > (0,1] defined by e(z) =1/(2?+1). 


(f) f:Rp > (0,1] defined by f(x) =1/ (2 + De (Answer on p. 1782.) 


Exercise 124. Is each function one-to-one? If it is, find its inverse. 

(a) a:R-R defined by =o 

(b) b:R-[-1,00) defined by b(x) = 27-1. 

(c) c:Rj > [-1, 00) defined by c(x) = 27-1. 

(d) d:Rj — [-1, 0) defined by d(x) = 27-1. (Answer on p. 1782.) 
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24.4. Inverse Cancellation Laws 


Fact 51. Suppose the function f has inverse f-+. Then 


y= f(z) = f a=: 





Proof. (==> ) By Definition 79 (of inverses). 
( <= ) See p. 1576 (Appendices). 














Proposition 5. (Inverse Cancellation Laws) If the function f has inverse f~', then 
(a) f-' Cf (@)) =2 for all x € Domain f; and 
(b) f Cis (y)) =y for all y € Domain f. 





Proof. (a) Let x ¢ Domain f, y = f (x). By Fact 51 (==>), ft (y)=2 or fl (f (2)) =a. 
(b) Let y € Domain f, x = f-' (y). By Fact 51 (<=), f (x) =y or Ca (y)) =U. 














Example 383. Define f:R—- R by f (2) =a+1. 
The inverse of f is the function f~':IR > R defined by f(y) =y-1. 


And so, by the Cancellation Laws for Inverses (Proposition 5), we have 


f-4(f (2)) =2, for all x ¢ Domain f = R, 


if (ft (2) =x, for all xe Domain f =R. 


For example, { (/@))-) @)y=2and (GG) =F (Oat. 





Exercise 125. XXX (Answer on p. 271.) 
A125. 
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24.5. The Graphs of f and f7! Are Reflections in the Line y = x 


We reproduce Fact 51 from Ch. 24.4: 


Fact 51. Suppose the function f has inverse f+. Then 


y= f(z) 7 Pena 





Fact 51 yields this corollary: 
Corollary 8. Suppose the function f has inverse f-'. Then 


(x,y) is in the graph of f (y,x) is in the graph of fu". 





Hence, by Fact 37, the graphs of f and f7! are reflections in the line y = z: 


Example 384. Define f:R—- R by f (2) =a+1. 
The inverse of f is the function f~':IR > R defined by f-' (x) = 2-1. 


The graphs of f and f7! are reflections of each other in the line y = 2: 
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Example 385. Define g: R>R by g(a) = 22. 

The inverse of g is the function g! : R > R defined 
by og (a) =a/2. 

The graphs of g and g7! are reflections of each other 
in the line y = a: 








Example 386. Define h: R \ {0} > R\ {0} by h(2) = 
Vee 

The inverse of h is the function h7!: R \ {0} > R\ {0} 
defined by h™' (2) = 1/2. 


The graphs of h and h™ are reflections of each other in 
the line y = 2: 


Indeed, h = h™'. (Again, h is an example of a self- 
inverse function—more on this in Ch. 24.7). 
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Example 387. Define i: Rj > Rj by i(x) = 2”. 
The inverse of 7 is the function i! : Rj > Rj defined by i”! (x) = Vz. 


The graphs of i and i“! are reflections of each other in the line y = z: 








Example 388. Define j: Rj > Rj by j (x) = 2”. 
The inverse of j is the function j~!: Rj > Rj defined by j7' (x) = -Vz. 


The graphs of j and j~! are reflections of each other in the line y = z: 


y & 
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Example 389. Define k: R* > R® by 


kg) = ta. 


Its inverse is the function k-' : R* > Rt 
defined by k7! (x) = 1/2. 

The graphs of k and k™ are reflections of 
each other in the line y = x: 


Some observations to help graph k and k7?: 


Since 2 (1) = i= and (a), = 
1/V/1 = 1, both intersect the line y = x 
an P= (11). 

For x < 1 (left of P), & is above'®? k71. 
Hora > 1 (tishivol P), kis below? k—. 








Example 390. Define /: R° > R* by 
Vea Via? 


Its inverse is the function J7! : R* > R- 
defined by 1-1 (x) = -1/./z. 


The graphs of | and I are reflections 
of each other in the line y = x: 











Sometimes, we may know a function is one-to-one and so has an inverse; however, we may 
be unable to find the mapping rule of the inverse. In such cases, Fact 52 is very useful: 





182 l = 1 _1i. -1 1 a a 
ps. &(z)= p= gies Ok (G)- sae? 
183g. & (4) = 1/4? = 1/16 < k-1(4) = 1/V4 = 1/2. 
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Example 391. Define m:R > R by m() = 2° + 2. 


184 


Since m is strictly increasing (can you show this?),°* it is one-to-one and has an inverse 


m'!:R-R 


Unfortunately, not having learnt to solve cubic equations, we don’t know how to find 
“i 


Nonetheless, even though we have no idea what m~' (2) is, if we already have the graph 
of m, we can use Fact 52 to sketch the graph of m7!: 


IX 
Y 











In the above example, it’s actually possible to find the mapping rule of the inverse: 
m7 (x) = ala y jeer ~\/—+—. % 
2 4 27 2 4 27 


We don’t know how to show the above, because we haven’t learnt to solve cubic equations. 


In the next example, it’s tmpossible. But again, we can use Fact 52 to graph the inverse. 





184 Method 1. Suppose x, < rq. Inn. 162, we already proved that x?-23 < 0, so that x? < #3, x3+21 < 23+ 


and m is indeed strictly increasing. 
Method 2 (calculus). Since m' (x) = 3x7 + 1>0 for all x € R, m is strictly increasing on R. (We'll learn 
more about this method in Ch. 43 and Part V). 
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Example 392. Define n: R > R defined by n(x) = 2? +2. 


We can show that n is strictly increasing.!®? Hence, n is one-to-one and has an inverse 
=i 
n-:R-R. 


Unfortunately, it is impossible to write down an algebraic expression for n~' (x).'°° 


Nonetheless, even though we have no idea what n~' (2) is, if we already have the graph 
of n, we can use Fact 52 to sketch the graph of n7!: 


IN 
Y 














185 Method 1. Suppose x1 < x2. Observe that x? - x} = (2 - 2) (aj +c}r2 + xjx3 +2103 +2$). We shall 


prove that x} -— 23 <0, so that x} < ©, 7} +21, < 23+ 22 and n is indeed strictly increasing. 
Since x1 < X29, 1 -X_ <0. Next 
d y) 


1 
a; + Ono + crak +2108 + 75 = 5 (21 + Qa3xo + 2x70? + 20,03 + 2H) = 


; [(o? +03)” +af +044 2r02 (0? + 23)]. 
If x, and x2 have opposite signs, then 21 < 0 < x2, so that x2? <0 <5. So, assume 21 and 22 do not have 
opposite signs. Then each of the four terms in brackets is non-negative and at least one of xf or 73 is 
strictly positive. Hence, 2j + 2322 + 2242+ 2,03 +25 >0. Thus, 2? - 23 <0. 
Method 2 (calculus). Since n’ (x) = 52* +1 > 0 for all 2 € R, n is strictly increasing on R. (We'll learn 
more about this method in Ch. 43 and Part V). 

'86We can find algebraic expressions for the roots of polynomial equations of degree 2 (quadratic formula), 
3 (not taught in A-Level Maths), and 4 (ditto). However, it is impossible to do the same for polynomial 
equations of degree 5 or higher—this is Abel’s Impossibility Theorem. 


























277, Contents www.EconsPhDTutor.com 





278, Contents www.EconsPhDTutor.com 


24.6. When Do the Graphs of f and f~! Intersect? 


Let f be a nice, one-to-one function. By Fact 52, f and f~ are reflections of each other in 
the line y = a. 
So, if f intersects the line y = x at some point P, then f~! must also intersect y = x at P. 


Hence, any point at which f intersects y = x is also a point at which f and f~! intersect: 
Example 393. Define f: R— R by f (x) = 22-1. 


Its inverse is f-' : IR > R defined by 


i 
5 


f(e)=50+ 


The graph of f intersects the line y = x at (1,1). 
So, f and f~' also intersect at (1,1). 





Example 394. Define g: Rj > Rp by 
g(a) =a7. 
Its inverse is g7!: Rj > Rj defined by 
g (2) = Vz. 


The graph of f intersects the line y = x at (0,0) 
and (1,1). 


So, g and g™' also intersect at (0,0) and (1,1). 











For future reference, let’s jot down the above observation: 





Or, Any point at which f intersects y = x is also a point at which f intersects f~!. Or, 


joc = f (a) = f(a). 


Now, consider the converse of Fact 53. That is, consider this next statement: 
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“If f intersects f~' at some point P, then f also intersects the line y= at P.” &X 
Or, “Any point at which f intersects f~! is also a point at which f intersects y=2.” X 


Or, “f (a) = fF (a) — f (a) = a” x 


The above statement sounds perfectly plausible. But unfortunately, it is false. And even 
more unfortunately, those writing your A-Level exams incorrectly assumed it to be true in 
N2008-II-4 (iv), (Exercise 563).1°" 


Two counterexamples to the above false statement: 


Example 395. Define h: R \ {0} > Rx {0} by h(a) = 1/z. 
The inverse of h is the function h7' : R \ {0} > R \ {0} 
defined by h7' (x) = 1/z. 


Observe that h = h~'. So, h and h! share infinitely many 
intersection points. 


Of these, only (-1,-1) and (1,1) are 
on the line y = x; every other intersec- 


tion point is not. For example, h and 
h intersect at the point (3,1/3), but 
this point is not on y = 2. 











One might object that the function in the above counterexample is unusual because it is 
(a) not defined on R; and (b) its own inverse. So consider the next example: 





187Similarly, one set of published TYS answers incorrectly states, “As y = f (x) is a reflection of y = f~! (x) 
about the line y = x, the point of intersection of the two curves must meet on y = 2.” 
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Example 396. The function 7: R > R 
defined by i(x) = -a® is (a) defined 
on R (and moreover continuous every- 
where); and (b) not its own inverse. 


Its inverse is 7 : R > R defined by 
f(a) =—7s. 

The graphs of i and i intersect at 
(0,0), (-1,1), and (1,-1). But only 
(0,0) is on the line y = x. 

















So again (and as shown by the last two examples), the following statement is false: 


“If f intersects f~' at some point P, then f also intersects the line y=ax at P.” &X 
Or, fla=f'(@) = — flaj=a. v 
Nonetheless, here are two results that come kinda close: 


Fact 54. Let D be an interval and f: D>R be a continuous function with inverse f-. 


If f and f! intersect at least once, then at least one of their intersection points is on 
the line y=. 








Proof. See p. 1582 (Appendices). 











Fact 54 is illustrated by the functions f, g, and i from the last four examples. (How?)'*® 


Our second result says that by adding the weird assumption that f and f~! intersect at an 
even number of points, we can obtain the stronger result that all of the intersection points 
are on the line y = 2. 


Fact 55. Let D be an interval and f: D>R be a continuous function with inverse f-'. 


If f and f intersect at an even number of points, then all of their intersection points 
are on the line y=. 

















Proof. See p. 1582 (Appendices). 


Fact 55 is illustrated by the function g (Example 394). 


188Tn each case, the given function was continuous on an interval and intersected its inverse at least once. 
And so sure enough, in each case, the function intersected its inverse on the line y = x at least once. 





Fact 54 may also be illustrated by the functions hy and h| 


281, Contents www.EconsPhDTutor.com 


24.7. Self-Inverses 


Your A-Level examiners like to write questions about self-inverses:'®° 


Definition 80. The function f is self-inverse if 


f(f(z))=2, for all x € Domain f. 


Example 397. XXX 
Example 398. XXX 


Your A-Level examiners also like to use this result: 





Fact 56. Let f be a function and a¢ Domain f. Suppose f has inverse f~-'. Then 


f(f(a))=a f* (a) = f (a). 





Proof. (==> ) Suppose f (f (a)) =a. 

Since a ¢ Range f = Domain f, we can apply f~' to get f+ (f (f (a))) 2 7 (a). 

But by Proposition 5(a) (Cancellation Law), we also have f~' (f (f (a))) 2 f (a). 

So, by = and 2, f7 (a) = 7 (a). 

<= ) Suppose f~' (a) = f (a). 

Since f~' (a) ¢ Codomain f~' = Domain f, we can apply f to get f (f- (a)) Sf (Ff (a)). 


“— 


But by Proposition 5(b) (Cancellation Law), we also have f (f7 (a)) =a. 
So, by 2 and 4, iG (@)) =a. 














Example 399. XXX 


Example 400. XXX 


Example 401. XXX 


Exercise 127. XXX (Answer on p. 282.) 
A127. 





189See e.g. N2018/1/5, N2017/II/3, N2016/1/10, N2012/1/7 (respectively, Exercises 531, 536, 539, 551). 
Indeed, N2012/1/7 explicitly uses the term self-inverse. 
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24.8. If f Is Strictly Increasing/Decreasing, Then So Is f7! 
In Ch. 23.2, we learnt that if a function f is strictly increasing (or decreasing), then it is 
also one-to-one—and so has some inverse f~!. It turns out that f~! must also be strictly 


increasing (or decreasing): 


Proposition 6. (a) A strictly increasing function’s inverse is strictly increasing. 


(b) A strictly decreasing function’s inverse is strictly decreasing. 





Proof. (a) Let f be a strictly increasing function with inverse f~'. 
Pick’? any a,b € Domain f with a < b. 


Let c= f-'(a) and d= f~1(b), so that f (c) =a and f (d) =. Since f is strictly increasing 
and a <b, it must be that c< d. 


We’ve just shown that a <b implies f7'(a) < f7' (0), ie. f7! is strictly increasing. 
(b) Similar, omitted. 














Example 402. XXX 


Example 403. XXX 


Exercise 128. XXX (Answer on p. 283.) 
A128. 





90TF no such a,b exist, then it is vacuously true that f~ is strictly increasing. 
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24.9. When Is a Function the Inverse of Its Inverse? 


ai 
Let f be a function with inverse f~!. Suppose f~! itself has inverse (=) ae 


So, (Ff) is the inverse of the inverse of f. 


zi 
One question we might ask is this: How are f and ( 7) related? 


Here’s one possible answer: 


Fact 57. Suppose the function f has inverse f-' and (fF!) is the inverse of f-'. Then 


y= (fy f is onto. 

















Proof. See p. 1576 (Appendices). 


As discussed in Remark 54, we’ll assume you'll only ever be asked to find the inverse of a 


Si 
function that is onto. And so, by Fact 57, we may assume that f = ( a Tt) is always true 
(even though it’s actually false if f isn’t onto). 


Example 404. Define f :R—R by f (x) = 32-1. (Is f onto?)!” 

The inverse of f is the function f~!:R > R defined by f(y) = (y+1) /3. 

The inverse of f~' is the function ee : R= R defined by ( Se (2)=39—-1, 
Since f and ( f =: have the same domain, codomain, and mapping rule, f = ( ie 
Example 405. Define g: R > R by g(x) = 52+ 2. (Is g onto?)!”? 

The inverse of g is the function g~! : R > R defined by g™! (y) = (y-2) /5. 


The inverse of g™ is the function iG : IR > R defined by (g!) (2) = Sat 2. 


= =i 
Since g and (G7) ; have the same domain, codomain, and mapping rule, g = (g') 


Exercise 129. For each function given below, find (i) its inverse; and (ii) the inverse of 
its inverse. Also, is the function (iii) onto; (iv) and equal to the inverse of its inverse? 


(a) h: (0,1] > [1, 00) defined by h(x) = 1/2. 
(b) 4: (0,1] > (0,1] defined by i (a) = 2”. (Answer on p. 1784.) 








191T¢ turns out that ( 7) must exist because f~' must be one-to-one—Fact 254 (Appendices). 

192Ves, because every element in Codomain f = R is “hit” at least once. (Or equivalently, “Yes, because 
Range f = R = Codomain f.”) 

193Ves, because every element in Codomaing = R is “hit” at least once. (Or equivalently, “Yes, because 
Range g = R = Codomain g.”) 
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24.10. Domain Restriction to Create a One-to-One Function 


Again, if a function is not one-to-one, then it does not have an inverse. 


Nonetheless, given a function that isn’t one-to-one, we can always! restrict its domain to 
create a new function that is one-to-one (and so has an inverse): 


Example 406. Define f :R— RO by f (x) = 2”. 
We can show”? that f is not one-to-one and so has no inverse. 


But consider the function f\,, (i.e. the restriction of f to Ro). 
0 


We can show that f|,, is strictly increasing’®° and hence one-to-one. 


Ro 


Figure to be 
inserted here. 


The inverse of f Ri is f e : Rj > Rj defined by f . (y) =/¥- 


Exercise 130. Continue to define f :R > Rj by f (x) = 2. (Answer on p. 1785.) 
(a) In words, what is f Pee 
0 
(b) Show that f fe is one-to-one. 
(c) Find the inverse of f ae 
0 


(d) Are there any other sets S such that the function f | g is also one-to-one? 








4Given any function f, we can, in the worst case, consider its restriction to the empty set, ie. the 


function f | . Observe that the range of f | is the empty set. So, it is vacuously true that no element 
@ @ 


in the codomain of f | is “hit” more than once. Hence, f | is is one-to-one. 
@ @ 


The element 4 in its codomain is “hit” twice: f (2) = 2? =4 and f (-2) = (-2)? =4. 
Pick any distinct a,b € Rj with a <b. Then a-b< 0 and a+b > 0 (because a > 0 and b > 0), so that 
a? —b? = (a-b) (a+b) <0. 
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Example 407. Define g: R > Rj by g(2) = |zI. 
We can show?’ that g is not one-to-one and so has no inverse. 


But consider the function g|,. (i.e. the restriction of g to Rj). 
0 


We can show that g|,,, is strictly increasing’®® and hence one-to-one. 


I 


Figure to be 
inserted here. 


The inverse of f RS is g . : Ro > Ro defined by g . (y) =y. 


Exercise 131. Continue to define g: R > Rj by g(z) = |aI. (Answer on p. 149.15.) 
(a) In words, what is Ilp-? 
(b) Show that Ie: is one-to-one. 
(c) Find the inverse of gee 
0 
(d) Are there any other sets S such that the function g| g is also one-to-one? 


1 


Exercise 132. Define h: R\ {1} > Rj by h(a) = Bae 
9b = 


(Answer on p. 1785.) 


(a) Explain whether h is one-to-one. 
(b) In words, what is hl ae 


(c) Show that Pe eo) is one-to-one. 
(d) Find the inverse of P06) 

(e) In words, what is Leen 

(f) Show that loa) is one-to-one. 
(g) Find the inverse of Pl say: 


(h) Are there any other sets S such that the function h], is also one-to-one? 








‘The element 2 in its codomain is “hit” twice: g (2) = |2| = 2 and g (-2) =|-2| =2. 
*SPick any a,b € Rj with a<b. Then g],, (a) =|a|=a< 6 = |b] = gl. (0). 
0 0 
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25. Asymptotes and an Introduction to Limit Notation 


Informally, an asymptote is a line that a graph approaches (or “gets ever closer to”). 


1 
Example 408. Define f:R. {0} > R by f (x2) = —. Observe that 
z 


As x approaches —oo, f (x) approaches 0 (from below). 


Statement (I) can be rewritten in another three ways: 


The limit of f (2) as x approaches —oo is 0°. 
Asxz—>-oo, f(z) -0°. (1 jim f Gah oil) 


Statement (I) implies that 
The horizontal line y = 0 is a horizontal asymptote of the function f. 


Three remarks: 


1. The above four statements labelled (I) are en- CUI} dina 7 (2) = eo 
tirely equivalent and are perfectly formal and rig- ee 
orous. They serve as shorthand for a precise and 
formal but long-winded statement (Ch. 146.3, 
Appendices) that you need not know. For A- 
Level Maths, an informal and intuitive under- 
standing will suffice. Likewise with the terms 
limits and asymptotes. 


Graph of f 


(IV) lim f (2) = 0° 


2. The parenthetical “from 
below” and _ superscript 
minus sign are optional Horizontal asymptote 
and may be omitted. In- (I) lim f(x) =0 y =0 
deed, in A-Level Maths, T>—00 
they are usually omit- 
ted. Nonetheless, they 
should be easy to un- Vertical asymptote 
derstand and so I’ve in- r=0 
cluded them. (Rather 
than f(z) > 0, oth (II) lim f (a) = —0o 
ers may instead write 


f(a) 7 0 or f(x) t 0.) 


. We could’ve said “x approaches -co (from the right)”, but that would be redundant 
because there’s only one possible way to approach —oo. 


x 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Next, similarly observe that 
As x approaches 0 (from the left), f (2) approaches —oo. 
Statement (II) is equivalent to each of the following three statements: 


The limit of f(x) as x approaches 0 (from the left) is —oo, 


Asxz>0°, f (4%) > -oo. (ID) lim f (x) =-oo. (II) 


Statement (II) implies that 


The vertical line x = 0 is a ove) epee of the function f. 


Next, As x approaches 0 (from the right), f (2) approaches oo. 
Statement (III) is equivalent to each of the following three statements: 


The limit of f (x) as x approaches 0 (from the right) is oo. 


Asx +0, f(z) > 0. (II) lim f (x) = oo. (III) 


r>0* 
Statement (III) implies that 

The vertical line x = 0 is a vertical asymptote of the function f. 
Next, As x approaches oo, f (2) approaches 0 (from above). 
Statement (IV) is equivalent to each of the following three statements: 


The limit of f (x) as x approaches oo (from the left) is 0°. 
As x > 00, f(x) + 0°. (IV) lim f (= 0 1) 


Statement (IV) implies that 


The horizontal line y = 0 is a horizontal asymptote of the function f. 
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1 
Example 409. Define g: Rx {1} > R by g(x) = i 2. Observe that 
GG —_ 


(I) As x approaches —co, g(x) approaches 2 (from below). 
(I) The limit of g(x) as x approaches —oo is 2°. 
(I) As 7 + -00, g(a) > 2°. (I) jim g (ny=2-. 


(II) As x approaches 1 (from the left), g(a) approaches —oo. 


(II) The limit of g(x) as x approaches 1 (from the left) is —oo. 
(II) Asa —> 1°, g(x) > -00. (II) lim g (x) =—-o0. 


Figure to be 
inserted here. 


(III) As x approaches 1 (from the right), g(a) approaches oo. 
(III) The limit of g(x) as x approaches 1 (from the right) is oo. 
(III) As x > 17, g(x) > ©. (III) lim, g (7) S09, 


(IV) As x approaches oo, g(x”) approaches 2 (from above). 
(IV) The limit of g(a) as x approaches oo is 2°. 
(IV) As x > 00, g(x) > 2°. (IV) lim g(x) 5 


The horizontal line y = 2 is a horizontal asymptote of g. 


The vertical line x = 1 is a vertical asymptote of g. 
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25.1. Oblique Asymptotes 


Asymptotes need not only be vertical or horizontal—they can also be oblique (or slanted). 


| 
Example 410. Define f: R. {0} > R by f(z) = [+e 
Sb —5 


First, lim f (x) =-00 and lim f (x) = ©, so that a vertical asymptote of f is x = 1. 
t>1- z—>l1t 


(11) lim f (x) = 0* 


al ‘Oblique asymptote 
Yr 





Vertical asymptote 
ca 





As x approaches —oo, f (7) approaches x (from below). 
The limit of f(x) as x approaches —oo is x”. 
As x -o0, f(x) >a. (I) lim f(x)=a. 


Similarly, As x approaches oo, f (x) approaches x (from above). 
The limit of f(x) as x approaches oo is x*. 
As xz > 00, f (x) > a*. (I) lina (aaa, 


Either statement (I) or (II) (by itself) implies that 


The oblique line y = x is an oblique asymptote for the function f. 
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26. ‘Transformations 


26.1. y=f(x)+a 


The graph of y = f (x) +a is simply the graph of f translated (or shifted) upwards by 
a units. (If a < 0, then we have a “negative upward” shift—or more simply, a downward 
shift.) 

Example 411. Define f :R > R defined by f (x) = 2? -1. 

Translate (the graph of) f upwards by 2 units to get (the graph of) y = f (v)+2=2°+1. 


IN 
Y 





Translate f downwards by 3 units to get y = f (x) -3=2°- 4. 


When a graph is translated upwards or downwards, any y-intercepts, lines of symme- 
try, turning points, and asymptotes simply shift along with it.1% 


Here, f has y-intercept (0,-1). So, y= f(x) +2 and y = f (x)-3 simply have y-intercepts 
(0,1) and (0, -4). 


(But we can’t say anything general about how the x-intercepts change.) 








'99This assertion is formally stated as Fact 256(a) (Appendices). 
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Example 412. Define g: Rx {0} +R by g(a) = 1/z. 
Translate g upwards by 1 unit to get y=g(v)+1=1/r+1. 


Since g has horizontal asymptote y = 0 (the x-axis), y = g (x)+1 has horizontal asymptote 
a= 1 


Figure to be 
inserted here. 


Observe that with an upward or downward shift, any vertical asymptotes remain un- 
changed, because a vertical line translated upwards or downwards is simply the same 
vertical line. So, both g and y = g(x) +1 have the vertical asymptote x = 0 (the y-axis). 


The two lines of symmetry for g are y = x and y = -x. So, the two lines of symmetry 
for y = g(x) +1 are simply the same, but translated upwards by 1 unit—y = x +1 and 
ge k. 
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26.2. y=f(rt+a) 


The graph of y = f (a«+a) is simply the graph of f translated leftwards by a units. (If 
a <0, then we have a “negative leftward” shift, or more simply a rightward shift.) 


Why Jeftwards (and not rightwards as one might expect)? Because for f (7) and f (#2 +a) 
to “hit” the same value, it must be that x2 = 2%; -a. That is, x2 must be a units to the left 
of 1. 


Example 413. Define f:R— R by f (x) = 2° -1. 
Translate f leftwards by 2 units to get y= f (w+ 2) =(x+ a)" jee Shr 212s 27. 
Translate f rightwards by 1 unit to get y= f (a@-1) =(a2- Va = liege? = 32° + ono. 








When a graph is translated leftwards or rightwards, any x-intercepts, lines of sym- 
metry, turning points, and asymptotes simply shift along with it.?” 


In this example, f has x-intercept (1,0). And so, y = f(x+2) and y = f (x-1) simply 
have x-intercepts (-1,0) and (2,0). 


(But we can’t say anything general about how the y-intercepts change. ) 








200This assertion is formally stated as Fact 256(b) (Appendices). 
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Example 414. Define g: Rx {0} +R by g(a) = 1/z. 
Translate g leftwards by 1 unit to get y= g(xr+1). 


Since g has vertical asymptote x = 0 (the y-axis), y = g(7+1) has vertical asymptote 
= 


Figure to be 
inserted here. 


Observe that with a leftward or rightward shift, any horizontal asymptotes remain un- 
changed, because a horizontal line translated leftwards or rightwards is simply the same 
horizontal line. So, both g and y = g(x+1) have the horizontal asymptote y = 0 (the 
x-axis). 

The two lines of symmetry for g are y = x and y = -x. So, the two lines of symmetry 
for y = g(a +1) are simply the same, but translated leftwards by 1 unit—y = 7 +1 and 
y=-(e+1)=-a2-1. 
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26.3. y=af(x) 


Let a> 0. The graph of y = af (x) is simply that of f stretched vertically (outwards from 
the z-axis) by a factor of a. (If a <1, then the graph is compressed rather than stretched.) 


Example 415. Define f:R—R by f (x) =2?-1. 
Stretch f vertically (outwards from the z-axis) by a factor of 2 to get y = 2f (x) = 2x°-2. 


IX 
y 





Compress 2x ¢ 


Stretch 2x 


y = 2f(2) 





Compress f vertically (inwards towards the x-axis) by a factor of 2 to get y = 0.5f (x) = 
0.52% - 0.5. 


When a graph is stretched vertically, any y-intercepts, lines of symmetry, turning 
points, and asymptotes stretch along with it.22! 

Here, f has y-intercept (0,-1). So, y = 2f (a) and y = 0.5f (x) simply have y-intercepts 
(0,-2) and (0,-0.5). 

Under a vertical stretch, any xz-intercepts remain unchanged. Here, all three graphs have 
the same x-intercept (1,0). 








20lThis assertion is formally stated as Fact 256(c) (Appendices). 
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The graph of y = —f (x) is simply that of f reflected in the z-axis: 


Example 416. Define f:R— R by f (x) = 2? +3r+2. 


Figure to be 
inserted here. 


Reflect f in the x-axis to get y =—f (x) =- (2? + 3x + 2) say" = 35 =2. 





So, to get y = -af (x) (where a > 0), first reflect f in the x-axis to get y = —f (x), then 
stretch vertically by a factor of a: 


Example 417. Define f:R—R by f (x) =2?-1. 
Suppose we want to graph y = -2f (2). 
To do so, first reflect f in the z-axis to get y = —f (x) = -a? +1. 


IN 
Y 


y= —2f(2) 








Then stretch vertically by a factor of 2 to get y = -2f (x) = -227 +2. 
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26.4. y=f(ar) 


Let a> 0. The graph of y = f (ax) is simply that of f compressed horizontally (inwards 
towards the y-axis) by a factor of a. 


Why compressed inwards (and not stretched outwards as one might expect)? Because for 
f (a1) and f (ax2) to “hit” the same value, we must have x2 = x1/a. 


Example 418. Define f:R—R by f (x) =2?-1. 


Compress f horizontally (inwards towards the y-axis) by a factor of 2 to get y = f (27) = 
(2r)° -1 = 823-1. 





Compress 2x 
Stretch 2x 





Stretch f horizontally (outwards from the y-axis) by a factor of 2 to get y = f (0.5z) = 
(0.57)° - 1 = 0.1252 - 1. 


When a graph is stretched horizontally, any x-intercepts, lines of symmetry, turning 
points, and asymptotes stretch along with it.? 

In this example, f has x-intercept (1,0). So, y = f (2x) and y = f (0.57) simply have 
x-intercepts (0.5,0) and (2,0). 

Under a horizontal stretch, any y-intercepts remain unchanged. Here, all three graphs 
have the same y-intercept (0,-1). 








202This assertion is formally stated as Fact 256(d) (Appendices). 
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The graph of y = f (-x) is simply that of f reflected in the y-axis: 


Example 419. Define f:R—R by f (x) = 27 +3r+2. 


Figure to be 
inserted here. 


Reflect f in the y-axis to get y = f (-x) = (-x)’ +3 (-x) +2 = 22-3042. 





So, to get y = f (-ax) (where a > 0), first reflect f in the y-axis to get y = f (-2), then 
compress horizontally by a factor of a: 


Example 420. Define f :R—R by f (x) =2?-1. 
Say we want to graph y = f (-2z). 
To do so, first reflect f in the y-axis to get y = f (-x) = (-x)° sl=e7 =) 


IX 
Y 


Compress 2x 





y= f (-2) 


y= f (-22) 





Then compress horizontally by a factor of 2, to get y = f (-2x) = (-22r)° =]==—8y7 = 1. 
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26.5. Combinations of the Above 








Proof. See p. 1583 (Appendices). 
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Example 421. Some unknown function f is graphed below. You are told only that 
points A, B, and C are (approximately) (-1.4,0), (0.8,-1.1), and (1.4,0). 


Armed only with this knowledge, we will try to graph four equations: y = 2f (x) +1, 
y=2f (v+1), y= f (27) +1, and y= f (24 +1). 


IX 
Y 


Aw (-1.4,0) 


B x (0.8, -1.1) 





First stretch f vertically by a factor of 2 to get y =2/f (x), then translate upwards by 1 
unit to get y =2f (7) +1. 


y=2f(x)+1 








(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


First translate f leftwards by 1 unit to get y = f(x+1), then stretch vertically by a 
factor of 2 to get y =2f (x +1). 


y=2f(a+1) 








First compress f horizontally by a factor of 2 to get y = f (2x), then translate upwards 
by 1 unit to get y = f (27) +1. 








(Example continues on the next page ...) 
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(... Example continued from the previous page.) 
First translate f leftwards by 1 unit to get y = f («+1), then compress horizontally by a 
factor of 2 to get y = f (2x +1). 








Exercise 133. Continue with f from the last example. Graph the following eight equa- 
tions. (Hint: You can make use of what was already shown in the above example.) 


(a) y=-2f(x)-1. (b) y=2f (-x) +1. (Answers on p. 1788.) 
(c) y=-2f (#+1). (d) y=2f (-r+1). (Answers on p. 1789.) 
( 
( 


(e) y=—-f (2z) +1. (f) y=f (-2x) +1. Answers on p. 1790.) 
Answers on p. 1791.) 


(6) weap (2a 1). (yy a (2231). 
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26.6. y=|f(2)| 


Given the graph of f, we can easily graph y = |f (x)|: 


1. Where f > 0 (i.e. above the z-axis), the two graphs coincide. 
2. But where f <0 (i.e. below the x-axis), they’re reflections of each other in the x-axis. 


Example 422. Define f:R—R by f (x) =2?-1. 
The red dotted graph is of y =|f ()|. 


Where f <0, the two Where f > 0, the two 
graphs are reflections of graphs coincide. 
each other in the x-axis. 








Example 423. Define g: Rx. {0} > R by g(a) = 1/z. 
The red dotted graph is of y =|g(«)|. 


Where g > 0, the two 
graphs coincide. 


ee ee 





Where g < 0, they’re 
reflections of each 
other in the x-axis. 
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26.7. y= f(|2\) 


Given the graph of f, we can easily graph y = f (|z|): 
1. Where x > 0, the two graphs coincide. 


2. Where x < 0, they’re reflections of each other in the y-axis. 


Example 424. Define f:R—> R by f (x) = 2?-1. The red dotted graph is of y = f ((z]). 


IX 
Y 


1. The right portions coincide. 
2. To get the left portion, 
simply reflect the right 
portion in the y-axis. 





Example 425. Define g:R\ {0} +R 
by oe = a 
The red dotted graph is of y = g (|a|). 


y = 9 (al) 





J 1. The right portions coincide. 


2. To get the left portion, 
simply reflect the right 
portion in the y-axis. 
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1 
26.8. — 


; 


Given the graph of f, we can easily graph 1/f: 


1. Small becomes big (“0 becomes +oo”—z-intercepts become vertical asymptotes). 
2. Big becomes small (“+00 becomes near 0”—vertical asymptotes become z-intercepts). 


3. The two graphs intersect wherever f (x) = +1 (because 1/f (x) = +1 too). 
4. y-intercept (0,a) becomes y-intercept (0, 1/a). 


Example 426. Define f:R— R by 
f (c)=2°-1. To graph 1/f, 


1. Small becomes big: 


e Where f > 0°, 1/f > -o. 
¢ Where f > 0*, 1/f > 00. 
¢ So, x =1 is a vertical asymptote of 1/f . 


Horizontal asymptote ( (v2 ; 1) 





Se 
a 


Vertical 
asymptote 
ae 





2. Big becomes small: 


(a) Where f > -00, 1/f +0. 
(b) Where f > co, we have 1/f > 0°. 
(c) So, y =0 is a horizontal asymptote of 1/f. 


3. Intersect at f (2) =+1: So, f and 1/f intersect at (V2, 1) and (0,-1). 


4. y-intercept (0,-1) becomes y-intercept (0,1/-1) = (0,-1) (here it so happens that 
the y-intercepts are the same, but this won’t generally be the case). 





305, Contents 


www.EconsPhDTutor.com 


Example 427. Define g: R > R by g(x) = x7 +2. To graph 1/g, 


. Small becomes big: Not applicable here. 

. Big becomes small: 
e Where g > 00, 1/g > 0°. 
¢ So, y=0 is a horizontal asymptote of 1/9. 

. Intersect at g(a) = +1: But here there is no x at which g(x) = +1. So, g and 1/g do 
not intersect at all. 


. y-intercept (0,2) becomes y-intercept (0, 1/2). 


(0, 0.5) 


/ co ameoaeeee st Ie 





> 
Horizontal asymptote x 


y =0 
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Exercise 134. Graphed below is some unknown function f. You are told only that 
points A, B, and C are (approximately) (-1.4,0), (0.8,-1.1), and (1.4,0). Graph (a) 
y = |2f (2x)|; and (b) y= f (jx -1]) +2 (Answer on p. 1792.) 








Exercise 135. Describe a sequence of transformations that would transform the graph 
of 


(Answer on p. 1793.) 
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27. Reflection and Symmetry for Functions 


Everything we learnt in Ch. 16 about the reflection and symmetry of graphs also applies 
to the graphs of functions. 


Definition 81. Let f be a function and G be a graph. We say that f is symmetric in G 


if the graph of f is symmetric in G. 





27.1. (Almost) No Function Is Symmetric in a Horizontal Line 


Recall that a function must pass the vertical line test. So, not surprisingly, it’s (almost) 
impossible for a function to be symmetric in a horizontal line. 


Example 428. Graphed below are two functions f and g, and two horizontal lines y = 1 
and y = 0. 








(a) Define f: R—R by f (x) =1. 


(i) “Obviously”, f is symmetric in the horizontal line y = 1. 
(ii) f is not symmetric in y = 6 or any other horizontal line. 


(b) g is not symmetric in y = 1, y =}, or any other horizontal line. 
The function f illustrates the one and only (unusual) way by which a function could be 


symmetric in a horizontal line—when the graph of the function is identical to (or a subset 
of) the horizontal line. 
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More formally and generally, 





Proof. Let F' be the graph of f. 


(a)(i) By Fact 40, the reflection of any (z,a) € F in y =a is (x, 2a-a) = (2,a), which is 
also in F’. So, by Fact 43, f is symmetric in y = a. 


For a formal proof of (a) (ii) and (b), see p. 1585 (Appendices). 
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27.2. (Almost) No One-to-One Function Is Sym. in a Vert. Line 


Recall that a one-to-one function must pass the horizontal line test. So, not surprisingly, 
it’s (almost) impossible for a one-to-one function to be symmetric in a vertical line. 


Example 429. Graphed below are two one-to-one functions f and g, and two vertical 
linés a= 1 and & = 6, 


Figure to be 
inserted here. 


(a) Define f: {1} + R by f (1) =2. 


(i) “Obviously”, f is symmetric in the vertical line x = 1. 
(ii) f is not symmetric in x = 0 or any other vertical line. 


(b) g is not symmetric in x = 1, x =, or any other vertical line. 
The function f illustrates the one and only (unusual) way by which a one-to-one function 


could be symmetric in a vertical line—when the graph of the function is a single point 
on the vertical line. 





More formally and generally, 


Fact 60. Let ae R and f be a nice, one-to-one function (whose domain is not empty). 
(a) Suppose Domain f = {a}. Then 


(i) f is symmetric in the vertical line x =a; and 
(ii) f is not symmetric in any other vertical line. 


(b) Suppose Domain f contains more than one point. Then f is not symmetric in any 
vertical line. 





Proof. Let F be the graph of f. 

(a) Given Domain f = {a}, we have F = {(a, f (a))}—.e., the graph of f consists of only 
the single point (a, f (a)). 

(a)(i) By Fact 39, the reflection of (a, f(a)) in x =a is (2a-a, f (a)) = (a, f (a)), which 
is also in F’. So, by Fact 42, f is symmetric in x =a. 

(a)(ii) Let b # a, so that 2b-a#a. By Fact 39, the reflection of (a, f (a)) in the vertical 
line x = b is (2b-a, f (a)) # (a, f (a)), which is the only point in F’. So, by Fact 42, f is 
not symmetric in x = b. 


(b) See p. 1586 (Appendices). 
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27.3. An Even Function Is Symmetric in the Vertical Axis 


Definition 82. A nice function f is called an even function if f(a”) = f(-2) for all 
x € Domain f. 


Example 430. The function f : R > R defined by f (x) = x? is even because for every 
x € Domain f = R, 


f (@) = 2° = (-a)’ = f (-2). 


Example 431. The absolute value function |-|: IR > R is even because for every x > 0, 
It] = 2 =|-a]. 
And similarly, for every x < 0, 


le] = -@ = |-al. 





Fact 61. Suppose f is a nice function. Then f is even <=> f is symmetric in the 
y-axis. 





Proof. See p. 1586 (Appendices). 











Example 432. The function f : R > R defined by f (x) = x? is both even and symmetric 
in the y-axis: 


Figure to be 
inserted here. 


Example 433. The absolute value function |-| : R > R is even and symmetric in the 
y-axis: 


Figure to be 
inserted here. 
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Example 434. It turns out that for every positive even integer n, the function f, : R—> R 
defined by f,, (2) = x” is even (and hence also symmetric in the y-axis): 


Since n is a positive even integer, for each x € R, f, (x) = 2" = (-1)" x” = (-x)” = fy (-2). 


Figure to be 
inserted here. 





As we'll see shortly, the cosine function is another example of an even function. 
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27.4. An Odd Function Is Symmetric about the Origin 


Definition 83. A nice function f is called an odd function if —f (x) = f (-x) for all 
x € Domain f. 


Example 435. The function f : R > R defined by f (x) = 2° is odd because for every 
x € Domain f = R, 


-f (x) = -2° = (-2)" = f (-2). 


Example 436. The function g : R > R defined by g(x) = x is odd because for every 
x € Domain g = R, 


~f ()=-« = f (-2). 


Fact 62. Suppose f is a nice function. Then f is odd <=> f is symmetric about the 
origin. 


Proof. See p. 1586 (Appendices). 

















Example 437. The function f :R > R defined by f (x) = x is both odd and symmetric 
about the origin: 


Figure to be 
inserted here. 


Example 438. The function g: R > R defined by g(x) = x is both odd and symmetric 
about the origin: 


Figure to be 
inserted here. 
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Example 439. It turns out that for every positive odd integer n, the function f, : R— R 
defined by f,, (2) = x" is odd (and hence also symmetric about the origin): 


Since n is a positive odd integer, for each x € R, —f, (x) = —x”" = (-1) (-1)""' x” = (-2)" = 


In( 2). 


Figure to be 
inserted here. 





As we'll see shortly, the sine and tangent functions are also examples of odd functions. 
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28. In, exp, and e 


In secondary school, your teachers probably went in this order: 


1. First introduce Euler’s number e = 2.718 281 828459... 


2. Then define the natural logarithm In as the logarithm with base e. 
In this textbook, we’ll go the other way round. We’ll 


1. First define the natural logarithm function In. 


2. Then define Euler’s number e as being the number such that Ine = 1. 


dll 


This may seem strange but will pay off in Part V (Calculus). 


Definition 84 (informal). The natural logarithm 
function In: R* > R maps each t € R* to 


Int = Area bounded by the graph of 
y = 1/z, the x-axis, and the 


vertical lines 7 = | and x =f. 


Int is defined as 
this shaded area 











The above definition is considered informal because the mapping rule is described using 
geometry (and in particular, with reference to an “area” bounded by a curve and three 
lines). After we’ve learnt about the definite integral in Part V (Calculus), we’ll give Defini- 
tion 229—a formal definition of the natural logarithm function that supersedes the above 
informal definition. 
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Example 440. We have In4 = 1.386... and In5 = 1.609... 


In4= 1.386... 











Nha sr IO 








Note that right now, we don’t yet know how to calculate In4 or In 5. 


One possibility is to draw an extremely precise and large graph of y = 1/2 on some graph 
paper, then slowly count up the squares. This sounds like a ridiculous idea, but as we'll 
learn in Part V, it isn’t too different from how integration is actually done. 
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Example 441. By definition, In1 is the area under y = 1/xz, between x = 1 and x = 1. 
Hence, In 1 = 0. 


In 1 =0 because 
there’s no area! 








Note that if «<1, then Inz is negative: 


Example 442. In0.9 =-0.105... 


ro =o. 
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Example 443. In0.5 = -0.693... 











Note that since Domain In = R*, if x < 0, then Inz is simply undefined: 


Example 444. In0 and In (2.5) are undefined. 
Below is the graph of In. 
e Since In1 = 0, the x-intercept of In is (1,0). 
¢ For x€ (0,1), Inz<0. 
¢ For x <0, Inz is simply undefined. 


IN 
Y 








As x > 0*, Ina > -oo. 
Thus, In has the vertical 
asymptote x = 0 (also the y-axis). 
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28.1. The Exponential Function exp 


For x > 0, the graph of y = 1/2 is strictly positive. But In is defined as the area under the 
graph of y = 1/2. So, In is strictly increasing. Hence, by Fact 49, In is one-to-one and has 
an inverse. 


Now, we don’t know what exactly this inverse function is, but we know it exists. So, let’s 
simply name it the exponential function: 





By Definition 79 (of inverse functions), 


1. Domain exp = Range In = R; 
2. Codomain exp = Domain In = R*; 


3. Mapping rule: y=Inz = > expy=vz. 
Example 445. Earlier, we saw that 
In4d*« 1.386, Ind*#1.609, Inl=0, In0.9#-0.105, In0.5 » -0.693. 


Since exp is the inverse of In, we have 


exp1.386%4, exp1.609*5, exp0=1, exp(-0.105)#0.9, exp (-0.693) # 0.5. 





Since exp is the inverse of In, by Fact 52, re 
their graphs are reflections of each other in y ye 
the line y = a: 


We next introduce Euler’s number e. 
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28.2. Euler’s Number 


Definition 86. Euler’s number, denoted e, is the number that satisfies 


Ine=1 or equivalently, e=exp 1. 


And so by definition of In, Euler’s number?” e is the num- 
ber such that the area under y = 1/7, between x = 1 and x=e 
is equal to 1. 












Right now we have no idea how to compute e. But Ill cheat 
by telling you that 


e= exp 1 # 2.718 281 828459... 







We define e so 
that this area is 1. 








> 
1 e Ve 





In secondary school, you were probably taught that e” is simply another way to write exp x. 
It turns out that this is actually a result that must be proven using the definitions of the 
exponential function (given above) and exponentiation: 


Fact 63. For every x € R, e* = expz. 


Proof. See p. 1726 (Appendices). 














So, it is not that e” is simply another way to write exp. Instead, it takes some work to 
prove that e” = exp x—i.e. that the number e raised to the power of x is equal to the value 
of the exponential function at 2. 








°3Tt was Leonhard Euler (1707-1783) himself who first used the letter e to denote this number. Presumably 
he did not do this to honour himself. Calling e Euler’s number is simply an honour conferred by posterity. 
Confusingly, there’s also another number called Euler’s constant y ~ 0.577 215664... that, fortu- 
nately, we will not encounter in A-Level maths. W 

?0Indeed, one mathematician Walter Rudin (Real and Complex Analysis, 1966 [1987, 3e, p. 1]) wrote that 
the exponential function “is the most important function in mathematics”. 
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We also have these two lovely results: 


1 ra eee | 


i 
Theorem So i a 

















Proof. Happily, we'll learn to prove this in Part V (Calculus)—see p. 1007. 


1 n 
Theorem 6. e = lim (1 ee - | : 
n— co n 











Proof. Happily, we'll learn to prove this in Part V (Calculus)—see p. 1151. 





Although we can’t prove either of the above theorems right now, we can nonetheless nu- 
merically “verify” that they are plausible: 


il 1 
To “verify” Theorem 5, define f:Zj >R by f(n)= ai eee. 


n! 
Then write 
Lt a il 1 es 

F(0)= Geel F(3) = f(2)+ 5) = 2.546 = 2.6. 

1) =F 0 Dea EG A= 73 EG | o716 
Dat) aire: F(4) = f(3) + a = 2.64 57 = 2.716. 

2)= 7th 20 E56 bye 7 (4 | 9.716 : =2,/18 
F(2)=f()+5 = 24+ 5 = 25. F(5) = f(4) + 5 = 2.716 + 755 = 2.718... 


We see that f rapidly converges towards e = 2.718 281 828459... By f (6), we have e correct 
to three decimal places. Lovely. 


1 n 
Similarly, to “verify” Theorem 6, define g:R->R by g(n)= (1 + - | ; 
n 





Then write 

il 1 1 10 

g(1) = (145) =o. g(10).= (1+ = = 0503747 2: 
1 2 i] 100 

g(2) =(1+5] 2995. g (100) = (1+) = 2.704813... 
if 3 _ il 1000 

9(3) = (145) =(9.6. (1000) = (1+) 22,716 923 ie: 
1\4 _ 1 \10° 

g(4) = (145) =2.7083 g (10°) = (1+ 5] = 2118 280 xs 

5 _ 1 \10° 
9(5) = (1+ 5] =2:716: g (10°) = (1+ <5] = 2-718 281.4: 


We see that g also converges towards e = 2.718 281828 459..., though much less rapidly 
than f—for example, g (100) gets e correct only to two decimal places, while even g (10°) 
gets e correct to only five decimal places. 
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29. Trigonometry: Arcs (of a Circle) 


This and the next few chapters will cover trigonometry. You’ll already have seen a lot of 
this in secondary (and even primary) school. But as usual, we’ll go into slightly greater 
depth. 


Example 446. Graphed below is a circle containing the points A and B. (Recall that a 
circle is simply a set of points.) 


205 


Informally, an arc of a circle is any connecte subset of that circle. 


But suppose we simply said that the arc AB is the subset of the circle connecting the 
points A and B. Then we’d be faced with an ambiguity: Are we referring to the red arc 
or the blue arc? 


The arc AB 


The arc BA 


To resolve this ambiguity, we adopt the convention that we “go anticlockwise” from the 
first point to the second. So, 


e Arc AB “goes anticlockwise” from A to B. 


e Arc BA “goes anticlockwise” from B to A. 


Definition 87 (informal). Given two points A and B on a circle, the arc AB (of the cir- 
cle) is the set of points on the circle when we “go anticlockwise” from A and B (including 
the two points A and B). 








205The term connected can actually be formally defined but as usual, an intuitive understanding will suffice. 
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Remark 62. Some writers use the word arc to mean any “smooth” curve. 


But we'll use arc only in the context of circles. So, in this textbook, arc will always refer 
to a connected subset of a circle. 


Remark 63. ‘The usual western or European convention in most matters is to go clockwise 


(when viewed from above), be it with clocks (which began as sundials in the northern 
ae 


hemisphere or card games. (In contrast, mahjong is usually played anticlockwise. ) 


It may thus seem a bit puzzling that here we (or rather European mathematicians from 
centuries ago) go anticlockwise. Later on, when we learn about the unit circle definitions 
of sine and cosine, we’ll see how this convention might have come about (see p. 350). 


In any case, this is to some extent an arbitrary convention that doesn’t really matter. 
Just like which side of the road we drive on, what matters is that we are all aware of, 
agree to, and stick to the given convention (Remark 23). 


Exercise 136. Name the labelled arcs using the labelled points. | (Answer on p. 323.) 


Figure to be 
inserted here. 





A136. 





206Sundials in the southern hemisphere go anticlockwise. 
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29.1. The Length of an Arc 


Example 447. By Fact 29, a circle with radius r has circumference (or length) 27r. 


|AB| = mr/2 


Circumference= 27tr 


Note that like a line, an arc is a one-dimensional object. (In contrast, a point, an 
area, and a volume are, respectively, a zero-, two-, and three object.) And similar to line 
segments, given an arc AB, we'll denote by |AB] its length. 


Here we may repeat the warning given earlier in Remark 34: So far, we’ve neither learnt 
how to compute nor formally define the length of any curve (other than the special case 
where the curve is a line segment). So here, we don’t (yet) know how to compute |C'D]. 


We can however tell that, “clearly”, |AB| = 7r/2 and |AC| =r. 
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30. ‘Trigonometry: Angles 


You’ve known about angles since primary school. But it turns out that formally and 
precisely defining angle is hard and will be done only in Part IV (Vectors) (Definition 145). 


One informal and intuitive way to understand angles is to think of an angle as the anti- 
clockwise rotation that a ray?’’ must undergo to coincide with another: 


Example 448. Let A, B, C, and D be points on a circle with centre O. 


The angle a is the “amount” the ray OB must rotate anticlockwise to coincide with 
the ray OA. 


We'll also call a the angle subtended by the arc AB.? 


Similarly, the angle @ is the “amount” the ray OD must rotate anticlockwise to coincide 
with OC. 


We'll also call 6 the angle subtended by the arc CD. 








"Recall that a ray is, informally, a “half-infinite line” (see Ch. 8.6). 

208“ Amount” is in scare quotes because it’s an imprecise term. So too is the notion of “anticlockwise 
rotation”. 

209For this textbook’s formal definition of the angle subtended by an arc, see Definition 291 (Appendices). 


325, Contents www.EconsPhDTutor.com 


30.1. An Informal Definition of Angles 


Definition 88 (informal). Suppose a circle of radius r with centre O contains the points 
A and B, so that the length of the arc AB is |AB|. Then the angle between the rays OA 
and OB or the angle subtended by the arc AB is this number: 


|AB| 


tf 


Figure to be 
inserted here. 


Example 450. Suppose r = 2, |AB| = 2, and |CD| = 3. Then by Definition 88, the angles 
a and 6 are 


AB| 2 
ye AB 2 Ly B 


r r iN 


_ICD| _3 


Figure to be 
inserted here. 





By the above definition, an angle is defined as the ratio of two lengths. And so, 
an angle is a unitless, dimensionless, or “pure” number. 


Hence, strictly speaking, angles should not have units. Nonetheless and perhaps slightly 
confusingly, angles are given units. 


For example, in primary school, you learnt to use degrees (°), following the Babylonians, 
who assigned to the full angle the value of 360°. 





20 This definition is labelled “informal” because it makes use of |AB|, a number we don’t yet know how to 
compute. 
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Figure to be 


inserted here. 





In secondary school, you also learnt a second unit called the radian (rad). This unit is 
what we’re using in the above definition and is the SI unit for angles. It is also what we'll 
be using in this textbook. So, in the last example, we could also have written 








AB| 2 D 
we eae and gale aa 
i. 2 , 2 


But again, angles are unitless. So, it’s actually entirely optional whether we want to include 
the “rad” following the value of the angle. Indeed, going forward, we’ll almost always omit 
ile 


To repeat, the Babylonians assigned to the full angle the value of 360°. Let’s now work 
out what the full angle is in radians. 


By Definition 88, it is the ratio of the circle’s circumference to the circle’s radius, which by 
Fact 29 is 


Circumference 27r 9 
—_—______— = — =27. 
Radius r 


Hence, the full angle is 27t (radians). Equivalently, the full angle is the number 27t. 
Let’s now find the conversion rate between radians and degrees. Since 


Full angle = 360° = 27 (rad) , 


we must have 


360° 
27 





1 (rad) = w 57.3°. 


By Definition 88, 1 (rad) » 57.3° is the angle subtended by an arc whose length is equal to 
the circle’s radius r: 
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Going forward in this textbook, 
we'll avoid using degrees and try to use only radians. 


(Indeed and to repeat, since angles are actually unitless, we'll also almost always not even 
bother to write “radians” or “rad”.) 


One reason for this is that radians work out much more nicely in calculus: 


Example 451. If x is in radians, then 


——sin? = cos %. 
dx 


But if x is in degrees, then 





Of course, in the everyday world, laypersons are still more likely to use the degree, probably 
because it’s about four millennia older.?7!° So, you shouldn’t forget about the degree! 





210The Babylonians have been using degrees since around 2000 BC. In contrast, the term radian was 
invented only in the late 19th century. 
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30.2. Names of Angles 


Definition 89. We call the angle A 


(a) The zero angle if A = 0. 

(b) An acute angle if A € (0,7/2) = (0°, 90°). 
(c) The right angle if A = 7/2 = 90°. 

(d) An obtuse angle if A € (7/2,70) = (90°, 180°). 
(e) The straight angle if A = 7 = 180°. 

(f) A reflex angle if A € (7, 27) = (180°, 360°). 
(g) The full angle if A = 27t = 360°. 


Example 452. xxx 


Figure to be 
inserted here. 


Definition 90. Let A and B be angles. 
(a) IfA+B=7/2, then Aand B are complementary 


(and are each other’s complements). 
(b) IfA+Be=n, then Aand B are supplementary 


(and are each other’s supplements). 
(c) IfA+B=2n, then A and B are explementary or conjugate 


(and are each other’s explements or conjugates) 


Example 453. xxx 


Figure to be 
inserted here. 
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This primary-school result is sometimes called the “Z” Rule (because you can see the angles 
a and £ in the letter “Z”). 


In more formal and precise jargon from geometry, the “Z” Rule says that “alternate angles 
formed by a transversal through parallel lines are equal”.?"' 





211For a formal statement and proof of the “Z” Rule, see Fact 257 (Appendices) 
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31. Trigonometry: Triangles 


PSLE review: A triangle has three vertices (singular: vertex), sides (whose lengths are 
strictly positive), and angles (also strictly positive).?!” 


Example 454. A ABC has vertices A, B, and C; sides AB, AC, and BC (whose lengths 
are, respectively, |AB|, |AC|, and |BC]); and angles a, 3, and y, which, respectively, face 
(or are opposite to) the sides BC, AC, and AB. 


|AB| 


A vertex is simply a point. A side is simply a line segment. 


Definition 91. Let a be a triangle’s largest angle. We call the triangle 


(a) acute if a is acute; 
(b) right if a is right; 
(c) obtuse if a is obtuse; 


(d) equilateral if all three of its sides have equal length; 


(e) isosceles if two of its sides have equal length; and 
(f) scalene if no two of its sides have equal length. 








212For a more complete and formal treatment of triangles, see Definition 292 (Appendices) and what 
follows. 
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Example 455. Triangles (a), (b), and (c) are also scalene triangles. 


(b) Right 


TU 
ee 
eg 


(e) Isosceles 


(d) Equilateral 


=o; 
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31.1. PSLE Review: Some Results About Triangles 


Fact 65. The sum of any triangle’s three angles is 7. 


Proof. Informal proof-by-picture:?° 

















By Fact 64 (“Z” Rule), a=A and 6=B. So,a+6+y=A+B+ Y=. 


Definition 92. Two triangles are similar if they share the same three angles. 








Example 456. A; and Ao are similar. 4; and A3 are not. A» and A3 are not. 


ne eoastles 


Fact 66. If two triangles share the same two angles, then they are similar. 











Proof. Suppose two triangles share the same two angles a and 3. Then by Fact 65, both 
triangles’ third angle must be 7—a-—. So, they share all three angles and are similar. 














Fact 67. Two triangles are similar <=> Suppose one triangle’s sides have lengths a, b, 


and c. Then there exists k > 0 such that the other’s sides’ lengths’ are ka, kb, and kc. 

















Proof. See p. 1591 (Appendices). 





213For a formal proof, see p. 1589 (Appendices). 
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The next result says that a = 6 in these triangles: 


ka 
i ke 
C | € 
7) 
b 


Corollary 9. Let k > 0. Suppose one triangle has sides of lengths a, b, and c, while 


another has sides of lengths ka, kb, and kc. Then the angle in the first triangle facing 
the side of length c is equal to the angle in the second triangle facing the side of length c. 








Proof. See the proof of Fact 67( <= ) on p. 1591 (Appendices). 


The next result says that if a/b = c/d, 
then these two triangles are similar: d 
| 
a 6 


Corollary 10. Suppose two triangles share an angle. If both triangles also share the ratio 
of the lengths of the sides adjacent to that angle, then they are similar. 

















Proof. See p. 1592 (Appendices). 

















Fact 68. In an isosceles triangle, the angles facing the sides of equal length are equal. 


Proof. Let A ABC be an isosceles triangle with |AC| = 
|BC|. Let D be the midpoint of AB. 


By Fact 67, AADC and ABDC are similar (indeed 
identical) because they share the side CD, |AD| =|DBI, 
and |AC| =|BC]. Hence, by Corollary 9, a = £. 

















Fact 69. The three angles of an equilateral triangle are equal. 


Proof. Apply Fact 68 twice. 


Corollary 11. Each of the three angles of an equilateral triangle equals 7/3 


Proof. By Facts 65 and 69. 
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32. ‘Trigonometry: Sine and Cosine 


The two basic trigonometric (or circular) functions are sine and cosine. 


In secondary school, you probably learnt these right-triangle definitions of sine and 
cosine: 


Definition 93 (informal). Let a € (0,7t/2) (i.e. let @ be an acute angle). Construct a 
right triangle with angles a, 7/2, and 1/2-a. Let the lengths of the three sides opposite 
these three angles be o, h, and a (for “Opposite”, “Adjacent”, and “Hypotenuse”). 


(a) For we (0,71/2), we define the sine and cosine functions (denoted sin and cos) by 


: 1 
sn a@ = 


= 
h 
(b) For a =0 or a= 7/2, we define 


sin0 20, cos0 #1, 





We'll discuss 2 through © in the next subchapter. But first, let’s illustrate + and 2 with 
some simple examples: 


Example 457. XXX 


Example 458. XXX 





Example 459. XXX 
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32.1. The Values of Sine and Cosine at 0 and 1 


In Definition 93, we defined 
sin0 20, cos0 = 1, oinG 


These definitions seemed a bit arbitrary. Let’s see why they might make sense: 


Observe that as a > 0,744 030 and ah. 


a 
Since sina = o/h for all a € (0,7t/2), it makes sense to define 
0 3 
inf=— =. 
sin0=— 


And since cosa = a/h for all a € (0,71/2), it makes sense to define 


ha 


Q=—=1. 
COs i 


Similarly, observe that as a > 71/2,0>handa-0. 


a 


Since sina = o/h for all a € (0,7t/2), it makes sense to define 


h 
ins 1, 


2 2 


And since cosa = a/h for all a € (0,71/2), it makes sense to define 


~=—£9, 
COS 5 i, 





214 As in our discussion on limits, the right arrow “+” here may be read aloud as “approaches”. 
? 
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Fun Fact 


According to Katz (2009), sine means bosom or breast: 


The English word “sine” comes from a series of mistranslations of the Sanskrit 
jya-ardha (chord-half). Aryabhata frequently abbreviated this term to jya or its 
synonym jiva. When some of the Hindu works were later translated into Ara- 
bic, the word was simply transcribed phonetically into an otherwise meaningless 
Arabic word jiba. But since Arabic is written without vowels, later writers in- 
terpreted the consonants jb as jaib, which means bosom or breast. In the twelfth 
century, when an Arabic trigonometry work was translated into Latin, the trans- 
lator used the equivalent Latin word sinus, which also meant bosom, and by 
extension, fold (as in a toga over a breast), or a bay or gulf. This Latin word 
has now become our English “sine”. 
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32.2. Some Values of Sine and Cosine 


From Definition 93(b), we already know that 


v0 v4 
3° cosQ = 1 = —, 


in0=0= 
sin 5 


We now also give the values of sine and cosine at 7/6, 7/4, and 7/3: 
































Note: The following proof of Fact 70 is informal because it relies on geometry and our 


right-triangle definitions of sine and cosine: 


Proof. In the right isosceles triangle, a = 71/4 and o=a. By Pythagoras’ 


Theorem, h = Vo? + a? = V20 = V2a. 


P _ 7 
So, sina = sin — = 
4 


1 
And, cos @ = cos - 


Next, AABC is an equilateral triangle (below) 
Each of its sides has length 1. 
Let D be the midpoint of BC. 


(Proof continues below ...) 
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(... Proof continued from above.) 
A By Pythagoras’ Theorem, 
|AD| = V1? - 0.52 = V0.75 = V3/2. 


_m |AD| V3/2 V3 
50; sin — = 9—__ = = _, 








3 |AB) 1 p 
nm |BD| 0.5 JI 
A aS SS . = —, 
nd, cos 3 TAB] 1 5 
_m |BD| 0.5 V1 
Al = = = 0.5 = —. 
ee AB a 2 
AD 2 3 











6 |AB) 1 > 





& a a se ey eee Re ee eee ee ea te 


B 0.5 


Exercise 137. This Exercise will prove that 


mi 1+V5 


COs 5 i 





(a) Any polygon with n sides can be parti- 
tioned into n - 2 triangles. For example, 
a polygon with 7 sides (called a heptagon) 
can be partitioned into 5 triangles: 


(i) What’s the sum of a triangle’s three 
angles? 

(ii) Hence, what’s the sum of the angles 
of a polygon with n sides? 





(Exercise continues below ...) 
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(... Exercise continued from above.) 


(b) Now consider the regular?’? pentagon ABCDE. Each of its five sides has length 1. 


A 





(i) What is the sum of the pentagon’s five angles? 
(ii) Find 2 BAE. 
(iii) Find 2 ABE. 
(iv) The aa of a line segment BE is R and AARB is a right triangle. Show 
jm ele 


that —= 
at COB = 5 


(c) We now find |BE] and hence prove =: 
(i) Find 2 PAQ. (Hint: What sort of triangle is AABP?) 
(ii) Explain why the triangles AABQ and AAPQ are similar. 
(iii) Find |BQ|. 
i 
(iv) Let x =|AQ]. Show that — 
13 
(v) Solve for x. 
(vi) Find |BE]. 
(vii) Complete the proof of z. 
10 - 2/5 
ie 


l= 


(Answer on p. 1798.) 


(d) Bonus: Show that sinc = 








215 A polygon is regular if its sides are of equal length. 
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32.3. Confusing Notation 


In Ch. 22, we learnt that 


« f? denotes the composite function f o f; 
« f° denotes the composite function f o f?; 
e Etc. 


We’d thus expect that sin? denotes the composite function sino sin. That is, we’d expect 
“sin? ¢ = sin (sinx).” x 
But this is not the case! Very confusingly, sin? denotes the product function sin: sin: 


sin? x = (sinz)” = (sinz) (sinz). J 


Example 460. sin? : + sin (sin =| because 


p 
sin? = = (sin =) =(1)’ =1, but sin (sin | = sin 1 » 0.845. 





And in general, for any positive integer n, sin” does not denote sinosino---osin. That is, 
sin” x # sin (sin (sin (...(sinz)))). 
Instead, sin” denotes sin-sin----- sin: 
sin” x = (sinr)” = (sinz) (sinz)...(sinz). J 


This is yet another annoying and confusing bit of notation you’ll have to learn to live with. 


The above remarks also apply to cosine and the other four trigonometric functions we'll be 
looking at shortly (tangent, cosecant, secant, and cotangent). 
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32.4. The Area of a Triangle, Law of Sines, and Law of Cosines 


Here’s an identity you should find familiar from secondary school: 


Fact 71. (First or The Pythagorean Identity) Suppose 0¢R. Then 


sin? 6 + cos? 6 = 1. 





Proof. Informal proof-by-picture:7° 


Apply Definition 93 and Pythagoras’ Theorem: 


2 2 9, 9 2 
3 2n_(0 a\Ve ao t+ar he 
sin’ 6 + cos 9-(2] +(5] Bag ap 




















Proposition 7. Suppose a triangle has sides of lengths a, b, and c with angles A, B, and 
C' facing those sides, respectively. Then 


1 
(a) The triangle’s area is 500 sinc. 


a a ee 
(b) The Law of Sines: Maelo = = sinC 
a @ 


(c) The Law of Cosines: c =a’ +b? -2abcosC 





Proof. Informal proof-by-picture:7!" 


Denote the triangle’s vertices by A, B, and C’. Let D be 
the point on AC’ such that BD 1 AC’. Then 







|BD| 


D 
sin C' = —— and eo aac | 
a a 






So, |BD| = asinC, |CD| = acosC, 
and |AD| = b-acosC. 


(Proof continues below ...) 


A b-acosC D  acosC C 


b 





216This proof is also incomplete because it covers only the case where A is acute. For a formal and general 
proof, see p. 1013 in Part V (Calculus). 

217For formal proofs of (a) and (c), see pp. 1592 and 1588 (Appendices). The proof of (b) given here is 
perfectly complete and formal. 
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(... Proof continued from above.) 
(a) The triangle has base b and height asinC. Hence, its area is 0.5absin C. 


(b) We can similarly show that the triangle’s area is also 0.5bcsin A or 0.5acsin B. So, 


besinA acsinB absinCd x2+abe sinA sinB  sinC 


—_> 
2 2 2 a b Cc 


(c) Consider AABD. Its hypotenuse has length c. Its legs have lengths asinC and b - 
acosC’. Now use Pythagoras’ Theorem and the First Pythagorean Identity: 




















c= (asinC)’ + (b-acosC)’ = a? sin? C +b? — 2abcosC + a? cos? C 


= «7? (sin? C + cos? C) +b? —2abcosC = a? + b? — 2abcosC. 














We can use the Law of Cosines to prove the Triangle Inequality: 


Corollary 12. (Triangle Inequality) The length of one side of a triangle is less than 
the sum of the lengths of the other two sides. 
Proof. Consider a triangle with sides of lengths a,b,c. Let C be the angle opposite the side 


of length c. Below, é, 2 and 5 use the Law of Cosines, Plus Zero Trick, and a,b, cosC < 1: 


2 40248 -2abcosC 2 a2 + b — 2ab + 2ab — 2abcosC 


= (ab) +2ab(1-cosC) S (a-b)’. 


Now, c? > (a—)* implies c> a-b or a<b+e. 











We can similarly prove that b<a+candc<at+b. 





Fact 72. (Heron’s Formula) Suppose a triangle has sides of lengths a, b, and c. Let 





s=(a+b+c) /2.7!8 Then the triangle’s area ts /s (s—a)(s—b)(s-c). 














Proof. See Exercise 138. 





Exercise 138. Prove Heron’s Formula (Fact 72) using the steps below. 


Let C’ be the angle opposite the side of length c. (Answer on p. 1799.) 


(a) By rearranging the Law of Cosines, express cosC' in terms of a, b, and c. 


ee 9 2 = 2 
ay as Sie = = (aly, 


(b) Now show that 1+cosC = Oa Dab 


(c) Show that (a+b)? -c = 4s (s—a) and c?- (a-b)* =4(s-6) (s-c). 
(d) Use the First Pythagorean Identity to express sinC’ in terms of cosC, 


Here you'll get a “+”. Explain why you can discard the negative value. 
(Hint: What can C' be? Hence, what can sinC be?) 
(e) Now use (b)—(d) and Area = absin C/2 to find Heron’s Formula. 








218We call s the triangle’s semiperimeter. 
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32.5. Some Formulae Involving Sine and Cosine 
Happily, List MF26 (p.3) contains the following formulae, so no need to mug: 


Fact 73. Suppose A,B eR. Then 


Addition and Subtraction Formulae for Sine and Cosine 
(a) sin(A+B) =sin AcosB+cos Asin B 
(b) cos(A+B8) =cosAcosB ¥sin Asin B 


Double Angle Formulae for Sine and Cosine 





(c) sin2A =2sin AcosA 
(d) cos2A = cos? A-sin? A = 2cos* A-1=1-2sin? A 
Proof. Informal proofs-by-picture:?!9 Q cos (A + B) sin A sin g 





In the figure””’, |PR| = 1.7”! 
Observe that |PT| = cos B. So, 


|PU| = cos Acos B; cos Asin B 
|TU| = sin Acos B. 
sin (A+ B) 
Next, 2PRQ and 2 RPU are “Z” ss 
angles. So, 2PRQ = 2RPU = 
A+B. Hence, Aca 
|PQ| =sin (A+B); 
|QR| = cos (A+ B). 
P cos Acos B U 


Next, 2 RTS is complementary?” to 2 PTU, which is in turn complementary to 2T'PU. 
So, RTS =A. Since |RT| = sin B, we have |ST| = cos Asin B and |RS| = sin Asin B. 


Now, |PQ| = |TU|+|7'S| or sin(A + B) = sin Acos B + cos Asin B. 
Similarly, |QR| =|PU|-|RS| or cos(A +B) =cos Acos B - sin Asin B. 


Exercise 139 guides you through a similar (informal and incomplete) proof of the Subtrac- 
tion Formulae for Sine and Cosine. 











Exercise 140 asks you to prove a Double Angle Formulae. 








219This proof is incomplete because it applies only in the special case where A, B, and A+ B are acute. 
For a formal and complete proof, see p. 1593 (Appendices). 

220Credit to Blue. a 

*21 Construction details. Let A,B, A+B € (0,7/2). Construct the horizontal line segment PU. Rotate Ray 
PU anticlockwise by A to produce Ray PT’. Then rotate Ray PT anticlockwise by B to produce Ray 
PR. Pick R to be the point on Ray PR such that |PR| = 1. Pick T to be the point on Ray PT such 
that RT 1 PT. Now construct the rectangle PQSU so that R and T are on the line segments QS and 
SU, respectively. 

222See Definition 90. 
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Exercise 139. Use the figure below”? to prove the Subtraction Formulae for Sine 
and Cosine (in the special case where A is acute and B < A). (Answer on p. 1799.) 


sin(A- B) = sin AcosB - cos Asin B, 
cos(A- B)=cosAcos B + sin Asin B. 


Q 5 
wi 


ag 
ie U 


Exercise 140. Use the Addition and Subtraction Formulae for Sine and Cosine to prove 
the Double-Angle Formulae for Sine and Cosine. (Answers on p. 1801.) 








223The construction of this figure is very similar to before, except now we start with Ray PQ, rotate it 
clockwise by A- B to get Ray PR, then rotate Ray PR clockwise by B to get Ray PT. 
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Fact 74. (Triple-Angle Formulae for Sine and Cosine) Suppose Ae R. Then 


(a) sin3A =3sin A-4sin’® A; 
(b) cos3A =4cos* A -3cos A. 








Proof. See Exercise 141. 











Exercise 141. Prove Fact 74(a) and (b). (Answers on p. 1801.) 
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Happily, List MF26 (p.3) contains the following formulae, so no need to mug: 


Fact 75. Suppose P,Q eR. Then 
Sum-to-Product (S2P) or Product-to-Sum (P2S) Formulae 


Me = 
(a) sinP+sinQ = 2sin me COS a 


P-@Q 





Ie 
(b) sin P-sinQ = 2cos ae 





(c) cosP+cosQ= 2cos 





i 
(d) cosP-cosQ=-2sin 

















Proof. See Exercise 142. 


The P25 Formulae will be useful when we do integration (they let us rewrite a difficult-to- 
integrate product into an easy-to-integrate sum). 


Fun Fact 


The above S2P or P2S Formulae are also known as the Prosthaphaeresis Formulae. 
Sounds cheem, but that’s just the combination of the Greek words for addition and 
subtraction—prosthesis and aphaeresis. So yea, something you can totally use to impress 
your friends and family. 


Exercise 142. Prove Fact 75(a)—(d).?”4 (Answers on p. 1801.) 


Exercise 143. Rewrite each expression using the P25 Formulae: (Answer on p. 1801.) 


(a) sin2xcos5a  (b)cos2rsin5z = (c) cos2xcos54~—s (d) sin2xsin5z 








24 int: Write P = 
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32.6. The Unit-Circle Definitions 


So far, we’ve been using the right-triangle definitions of sine and cosine (Definition 93). 
Unfortunately, these define sin and cos only for a € [0,71]. 


To define sin and cos more generally, that is, for all a € R, we turn to the unit-circle 
definitions (which supersede Definition 93) 


Definition 94 (informal). Consider the ray that is the positive x-axis (it begins at the 
origin O and extends eastwards). Call this ray x*. 


Let a be any angle. Rotate «* anticlockwise by a to produce the ray a. 


Let A =(A,,A,) be the point at which a intersects the unit circle centred on the origin. 


NN 
y The ray a_-’ 


7 


"A= (Az A,) 








The sine and cosine functions, sin: R > R and cos: R > R, are defined by 


sina =A, and Cosa = A. 





Equivalently, sina and cosa are the x- and y-coordinates of the point A. 


And so, here’s an entirely equivalent definition that gives us a slightly different way of 
thinking about sin and cos (and hopefully also a better understanding): 
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Definition 95 (informal). Consider the unit circle centred on the origin. A particle A 
travels anticlockwise around the circle at the constant speed of 1 unit per second. At 
time a = 0s, A is at the point (1,0). 


Figure to be 
inserted here. 


We define sina and cosa to be the y- and x-coordinates of A at time a. 


Example 461. If a =0, then A = (0,1) = (sin0,cos0). 


Figure to be 
inserted here. 


Example 462. If a = = then by the reasoning given in the proof of Fact. 70, 


A=(5 2) = (sin F, cos), 


7 a 6 6 


Figure to be 
inserted here. 
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Example 463. If a = = then by the reasoning given in the proof of Fact. 70, 


A= (=. S = (sin ®.cos 7). 


Figure to be 
inserted here. 


Remark 71. We can now explain how the anticlockwise convention might have arisen. 
We already adopt this convention: 

1. The x-axis points rightwards and the y-axis upwards. 

Let’s also adopt these conventions: 


2. The zero angle corresponds to the positive x-axis. 


3. Small but positive angles are in the quadrant (of the cartesian plane) where x and y 
are positive. 


Figure to be 
inserted here. 


Adopting the above conventions means also adopting the anticlockwise convention. 
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Example 464. If a = = then by the reasoning given in the proof of Fact 70, 


Figure to be 
inserted here. 


Example 465. If a = = then 


A= (1,0) = (sin F.cos). 


Figure to be 
inserted here. 





The above five examples show that the following values for sine and cosine (from Fact 70) 
also hold under the unit-circle definitions of sine and cosine: 














Tu | %@W | iT TU 

al! 0 at = = = 
6 A 3 2 

sina| V2 | Vi | v2 | v3) v4 
2 2 2 2 2 
cosy| VE | VB | v2 | VI] vo 
2 2 2 2 2 




















Proof. Continue with the figure in Definition 94. Let A = (A;,0). 


351, Contents www.EconsPhDTutor.com 


oN oa 


7 


y The ray a7" 











Consider the right triangle OAA. We have “Hypotenuse” of length |OA| = 1, “Opposite” 
of length |AA| = A,, and “Adjacent” of length JOA] = A,. And so, by Definition 93(a), for 
a € (0,7/2), 


which indeed coincide with Definition 94. 


For a = 0, by Definition 93(b), sinO = 0 and cosO = 1, which coincides with Definition 
94—-see Example 461. 

For a = 7/2, by Definition 93(b), sin (7/2) = 1 and cos(7/2) = 0, which coincides with 
Definition 94—-see Example 465. 














If a > 7/2, then our right-triangle definitions (Definition 93) are silent. Let’s see what our 
unit-circle definitions say: 
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2 
Example 466. If a = = = 1 = then 


a4 
2 


Figure to be 
inserted here. 


3 
Example 467. If a = = = 7 then 


Figure to be 
inserted here. 


5 
Example 468. If a = = = = then 


(si 570 =) 
= | sin —,cos — }. 
6? 6 


Figure to be 
inserted here. 
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Example 469. If a =7, then 


A = (0,-1) = (sin7, cos7t). 


Figure to be 
inserted here. 


i 
Example 470. If a = — = 


Figure to be 
inserted here. 


Figure to be 
inserted here. 
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4 
Example 472. If a= = =™U+ = then 


Figure to be 
inserted here. 


3 
Example 473. If a= ~ then 


Figure to be 
inserted here. 


Figure to be 
inserted here. 
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Example 475. If a= a = 27 - 7 then 


Figure to be 
inserted here. 


i 
Example 476. If a = — 


=) 
s—]. 


6 


Figure to be 
inserted here. 


Example 477. If a = 271, then we will have gone one full circle, with 


A = (0,1) = (sin 27, cos 27t) = (sin 0, cos 0). 


Figure to be 
inserted here. 





(As we’ll see in the next subchapter, each of sin and cos is periodic with period 271.) 


Let’s summarise our findings from the last 17 examples (Examples 461-477) in a single 
table: 
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Fact 77. 
a 











sin a 


















































Remark 73. To repeat, henceforth, we’ll abandon the right-triangle definitions (Definition 
93) and use only the unit-circle definitions (Definition 94 or equivalently, Definition 95). 


But note that even the unit-circle definitions are considered informal, because they rely 

on geometry. 

To formally and rigorously define sine and cosine, there are (at least) three common 

approaches: 

1. Use power series—sce Definitions 219 and 220 (in Part V (Calculus), Ch. 102.3). 
These will serve as this textbook’s official definitions of sine and cosine. 

2. Use Euler’s formula—sce Remark 123 (in Part III (Complex Numbers), Ch. 84.2). 

3. Use differential equations—see Part V (Calculus), Ch. 110.5. 


But don’t worry about these formal definitions of sine and cosine. For A-Level Maths, 
the unit-circle definitions (Definition 94 or 95) will be more than good enough. 
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32.7. Graphs of Sine and Cosine 


Fact 77 gave 17 points for each of sin and cos. Let’s graph these points: 





y » 
@ e @ 
e e e sin COS 
e @ © 
e @ c) e 
Tm Qt OOo im 4m 3m 71 
2 3 6 6 3. 2 4 “ 
od = 6 1 = @ tS © > 
mma an 571 om Illa 
a ee Fe — ase — — 27 
i a 2B 4 4 3 6 
@ e e 
@ e e 
@ @ S @ 
@ @ 





Hm, it looks a lot like we can simply “connect the dots” to get these graphs of sin and cos 
on {0 27) 
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But is it proper for us to simply “connect the dots”? All we’ve done is find 17 points for 
each of sin and cos. How do we know that their actual graphs can be obtained by smoothly 
“connecting the dots”? Given the 17 points, why couldn’t the actual graphs be something 
crazy like the following? 


Figure to be 


inserted here. 





Here’s an informal argument for why this last figure is wrong (and the previous one is 
correct). We’ll use our alternative “travelling-particle definition” (Definition 95) of sin and 
Cos: 


1. At time a =0, our particle A is at (1,0). So, sina =0 and cosa =1. 


2. During a € (0,71/2), A travels northwest. 


(a) Travel upwards is initially fast, but then slows down. So, throughout a «€ (0,71/2), 
sin is increasing at a decreasing rate. 
(b) Travel leftwards is initially slow, but then speeds up. So, throughout a € (0,7t/2),cos 
is decreasing at an increasing rate. 
3. At time a = 7/2, our particle A is at (0,1). So, sina =1 and cosa =0. 
4. During a € (7/2,71), A travels southwest. 
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(a) Travel downwards is initially slow, but then speeds up. So, throughout a € (0, 7/2), 
sin is decreasing at an increasing rate. 


(b) Travel leftwards is initially fast, but then slows down. So, throughout a € (0,71/2), 
cos is decreasing at a decreasing rate. 
5. At time a = 7, our particle A is at (-1,0). So, sina =0 and cosa = -1l. 
6. During a € (1, 371/2), A travels southeast. 
(a) Travel downwards is initially fast, but then slows down. So, throughout a € 
(0,7t/2),sin is decreasing at a decreasing rate. 
(b) Travel rightwards is initially slow, but then speeds up. So, throughout a € (0, 71/2), 
cos is increasing at an increasing rate. 
7. At time a = 37/2, our particle A is at (0,-1). So, sina = -1 and cosa = 0. 
8. During a € (37/2, 271), A travels northeast. 
(a) Travel upwards is initially slow, but then speeds down. So, throughout a € (0,71/2), 
sin is increasing at an increasing rate. 


(b) Travel rightwards is initially fast, but then slows down. So, throughout a € (0, 71/2), 
cos is increasing at a decreasing rate. 


Figure to be 


inserted here. 





Lovely. So, we’re fairly confident about what the graphs of sin and cos look like on [0, 271]. 
Let’s now consider what the graphs of sin and cos look like on [271, 47]. 


As mentioned in the last example of the previous subchapter (Example 477), we know that 
at 27t, we’ve gone a full circle. Thereafter, everything should exactly repeat. And so, the 
graphs of sin and cos on [271,47] must be exactly the same as those on [0, 27]: © 


Figure to be 


inserted here. 





Indeed, repeating the same reasoning for each k € Z*, the graphs of sin and cos on 
[2k7t, (2k +2) 7] must be exactly the same as those on [0, 271]: 
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Figure to be 


inserted here. 





Next, let’s consider what the graphs of sin and cos look like on [-271, 0]. 


Well, what does sina or cosa for negative @ even mean? Well, we’ve adopted the con- 
vention that rotating the positive z-axis x* anticlockwise is the positive direction. So, 
negative angles should correspond to rotating 7* clockwise. 


Or equivalently and using our alternative definition (Definition 95), negative a corresponds 
to our particle A travelling clockwise. 


And whether we’re travelling anticlockwise or clockwise, by reasoning similar to ©, after 
every 27 seconds, we will have gone one full circle. Hence, the graphs of sin and cos on 
[-271,0] must be exactly the same as those on [0, 27]: 


Figure to be 
inserted here. 





Indeed, repeating the same reasoning for each k € Z, the graphs of sin and cos on 
[(2k — 2) m,2k7t] must be exactly the same as those on [0, 271]: 


Figure to be 


inserted here. 





With this last figure, we’ve completed our construction of the graphs of sin and cos. 


361, Contents www.EconsPhDTutor.com 


To repeat, we say that each of sin and cos is periodic with period 27. Informally, this 
means that each graph “repeats” every 27.72? A bit more formally, 


Fact 78. For alla €R, (a) sin(x +27) =sinz; and (b) cos (x + 27) = cosa. 





Proof. (a) By the Addition Formula for Sine (Fact 73) and Fact 77, 
sin (x + 27) = sinxcos27+ cos xsin 27 = sin x. 


(b) By the Addition Formula for Cosine (Fact 73) and Fact 77, 














cos (x + 27t) = cos x cos 27 — sin x sin 271 = cos x. 


Equivalently, the translation of sin or cos by any integer multiple of 271 is itself: 


Figure to be 


inserted here. 








22°For a more formal definition of periodicity, see Definition 294 (Appendices). 
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32.8. Graphs of Sine and Cosine: Some Observations 


Figure to be 


inserted here. 





“Clearly”, sin is symmetric in the vertical line x = 1/2. More generally, for any odd integer 
k, sin is symmetric in x = kn/2. 


Similarly, cos is symmetric in the vertical line x = 7. More generally, for any integer k, cos 
is symmetric in x = kT. 


For future reference, let’s jot the above observations down as a formal result: 


Fact 79. The lines of symmetry for 


(a) sin are x = kn/2, for odd integers k; 


(b) cos are x =kmn, for integers k. 

















Proof. See p. 1594 (Appendices). 


Cosine is symmetric in x = 0 or the y-axis; equivalently, cos is an even function): 


Fact 80. For all x €R, cosx = cos(-2). 


Proof. By Fact 79, cos is symmetric in x = 0. By Cor. 6, for all x € R, cosa = cos (-2). 














The reflection of sin in the y-axis is the reflection of sin in the x-axis. Equivalently, sin is 
symmetric about the origin; also equivalently, sin is an odd function: 


Fact 81. For allxzéR, -sinz =sin(-z). 


Proof. By Fact 75 (S2P or P2S Formulae), for all x € R, 


L+2z ; 
= 2sin0cosz = 0. 





XL 
COs 





sin x + sin(-2) = 2sin —— 














Rearranging, sin = -sin (-2). 
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The translation of sin or cos by any odd multiple of 7t is its own reflection in the x-axis: 


Fact 82. For all x €R and all odd integers k, 


(a) sin(#+km) =-sing; (b) cos(x+km) =-cosz. 


Figure to be 
inserted here. 





Proof. First, observe that for all odd integers k, cos (k7t/2) = 0. 
(a) By Fact 75 (S2P or P2S Formulae), for all xe R, 








kn — 
sin (x + k7t) +sinax = ig = 2sin (x +k) coske = 0). 
2 2 2 2 
Rearranging, sin (xv + k7t) =-singz. 
(b) By Fact 75 (S2P or P2S Formulae), for all x € R, 
Crk e mehr z 7 ™ 4 
cos (x + k7t) + cos x = 2.cos 5 COs 5 = 2cos (x +k) cos ke = 0; 











Rearranging, cos (1 + k7t) = —cosz. 





Figure to be 
inserted here. 





The next result says that sin is cos translated right by 71/2 (equivalently, cos is sin translated 
left by 7/2): 


Fact 83. For allxeéR, 





(a) sin = cos (x7), (b) cos =sin (+7), 


™ nm . , 
Proof. (a) cos zs = costcos > +sinasin > =0+sing = sing. 














; 7 . TT _ 7 
(b) sin r+> = sin cos 5 + coszsin > = 0 + cosa = cos x. 
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32.9. About the Prefix Co- 
You will probably have noticed that 
cosine = CO + sine. 


In the context of trigonometric functions, the prefix co- has this meaning: 


If A+ B=7/2 (ie. A and B are complementary), then 
sin A = cos B. 
Or equivalently, for any A€R, 
sin A = cos (; 7 A) 
2 

Also equivalently, using what we learnt in Ch. 26 (Transformations), 

Sine is cosine shifted left by 7/2, then reflected in the y-axis. 
And symmetrically, 

Cosine is sine shifted left by 7/2, then reflected in the y-axis. 


As we'll see in the coming chapters, the above will also be true of the functions tangent 
and cotangent, and the functions secant and cosecant. 


Figure to be 


inserted here. 
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33. Trigonometry: Tangent 


Definition 96. The tangent function, denoted tan, is defined by 





Example 478. XXX 


Observe that cosa = 0 <=> ai is an odd-integer multiple of 7/2. So, tan = sin/cos is 
undefined at odd-integer multiples of 7/2: 


Example 479. XXX 


Exercise 144. What are the domain and codomain of tan? (Hint: Review Ch. 20.) 


(Answer on p. 1802.) 





By the right-triangle definitions, 
_ O 4 A 
sin = — an cos = =. 
H H 
So, tan = — = —— =-. 


In the past, Singaporean students were taught the Hokkien mnemonic “big leg woman”: 


K 3 Ved 


tua kha 80 
TOA CAH SOH 
O ; A O 
Tangent = A Cosine = i Sine = a 


But these days, we probably prefer a more angmoh mnemonic like 


Soccer tour—SOH-CAH-TOA.?” 


Example 480. XXX 
Example 481. XXX 





226Credit to Tom J at TheStudentRoom.co.uk (warning: that page may contain many other NSFW 
mnemonics). 
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33.1. Some Formulae Involving Tangent 
Happily, List MF26 (p.3) has the following formulae, so no need to mug: 


Fact 84. Let A, Be R. Suppose A and B are not odd integer multiples of 7/2. 


(a) If A+ B is not an odd-integer multiple of 7/2,??" then 


Addition and Subtraction tan A+tanB 
tan (A+B) = ——————_.. 
Formula for Tangent 1+ tan Atan B 


(b) If 2A is not an odd-integer multiple of m/2, then 


Double Angle ee 2tanA 


Formula for Tangent eno 





Proof. (a) See Exercise 145. 
(b) See Exercise 146. 














Exercise 145. Prove Fact 84(a). (Answer on p. 1802.) 
Exercise 146. Prove Fact 84(b). (Answer on p. 1802.) 





The remaining formulae in this subchapter aren’t very useful, but are conceivably things 
you could be asked to derive or use on an exam, and so are worth a mention. 


Rearrange the above Addition and Subtraction Formulae to get the Sum-to-Product 
(S2P) Formula for Tangent: 


Fact 85. (S2P Formula for Tangent) Let A,B eR. Suppose A, B, and A+B are not 
odd-integer multiples of m/2, then 


tanA+tan B=tan(A+B)(1FtanAtanB). 








27Tn each of Fact 84(a) and (b), there is the worrying possibility that the denominator is zero, in which 
case we’d be committing the Cardinal Sin of Dividing by Zero (see Ch. 2.2). It turns out that happily, 
in each case, our additional condition that A+B or 2A is not an odd integer multiple of 7/2, as we now 
show (via the contrapositives): 


(a) Suppose 1# tan Atan B =0. This is equivalent to 
sin A sin B 


tanAtanB=+1 <> =+1 <> 0=sin Asin BF cos AcosB =¥cos(A+ B), 
cos A cos B 








so that A+ B is not an odd integer multiple of 7/2. 


(b) Suppose 1 - tan?A = 0. This is equivalent to tan?A = 1 <— > tanA =41 <— Ae 
{(k+40.25)7:keZ} <> 2A € {0.5k7:k is an odd integer}, so that 2A is not an odd integer 
multiple of 7/2. 
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Fact 86. (Triple Angle Formula for Tangent) let A,B ¢ R \ 
{kn/2:k is an odd integer}. If 3A is not an odd-integer multiple of /2,?" then 


3tan A—tan? A 


tan 3A = 
= = eeme 4 

















Proof. See Exercise 147. 


Exercise 147. Prove Fact 86. (Answer on p. 1802.) 
Earlier, we had the Laws of Sines and Cosines (Proposition 7). Similarly, 


Fact 87. Suppose a triangle has angles A and B facing sides of lengths a and b. If 
a+#b,? then 


arb _ tan 42 


ee tan 42 (Law of Tangents) 
a— an 

















Proof. See Exercise 148. 


Exercise 148. We'll prove Fact 87. By the Law of Sines, 
snA sinB 

a oo 
a+b . sin Asin B 
a-b sinAsinB 


bsin A = asinB. 


(a) By considering (Times One Trick) and using =, show that 





a+b a sin A+sin B 
a—-b snA—sinB 


er 
(b) Now”° show that = eyes (Answer on p. 1802.) 








228Similar to n. 227, we can show that this condition ensures 1 - 3tan? A + 0: 
Suppose 1- 3tan?A = 0. This is equivalent to tan?A = 1/3 <> tanA = +V3/3 <> 
Ae {(k+1/6)m:keZ} = > 3AC {kn/2:k is an odd integer}, so that tan3A is not defined. 
229Suppose a ¢ b. Then A + B and each of the denominators of LHS and RHS is non-zero, so that both 
LHS and RHS are well-defined. 
Observe also that the angles of a triangle are strictly positive and add up to m. So, A+ B = 1-C € (0,7) 
and A- B« (-7,7). Hence, neither (A+B) /2 € (0,7/2) nor (A+B) /2 € (-7/2,7/2) is ever an 
odd-integer multiple of 7/2. Thus, both tan[(A + B) /2] and tan[(A- B) /2] are always well-defined. 
?30Hint: Whenever you see a trigonometry question, what should you do? 
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33.2. Graph of Tangent 


Use Fact 77 (17 values for each of sin and cos) to produce these values of tan: 








An |3m| 5a | 7r| lla 
Sle 3 | als 
1 1V31@ ~V3| =) 0 
¥s|@ |-v3| a] 


an 








We already said that tan is undefined at each odd-integer multiple of 71/2. We can say 
more: 


Fact 89. For odd integers k, x = krt/2 is a vertical asymptote for tan. 


Let’s graph the points in Fact 88: 


Figure to be 
inserted here. 





Hm, it looks a lot like we can simply “connect the dots” to get this graph of tan on [0, 271]: 


Figure to be 
inserted here. 





As before, we can make an informal argument as to why the above graph is correct and 
something crazy like the below is wrong. But we shall omit this. 


Figure to be 


inserted here. 





Like sine and cosine, tangent is periodic. But unlike sine and cosine, tangent has period 
m (instead of 27t). Informally, this means that the graph of tan “repeats” every 7. More 
formally, 
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Fact 90. For all x éR, tan(x+7) =tanz. 


Proof. By the Addition Formula for Tangent (Fact 84) and Fact 88, 


tan © + tan 71 _ tang 7 
l-tanztannxn 1-0 














tan (x + 71) = tan x. 





Equivalently, the translation of tan by any integer multiple of 7t is itself. And so, the 
complete graph of tan looks like this: 


Figure to be 


inserted here. 
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33.3. Graph of Tangent: Some Observations 


The reflection of tan in the y-axis is the reflection of tan in the x-axis: 


Fact 91. Jf x €R is not an odd integer multiple of 1/2, then 


—tanxz =tan(-2). 





Proof. Note that -x + x = 0, which is not an odd integer multiple of 7/2. So, by Fact 85 
(S2P Formula for Tangent), 


tan (-2) + tanx = tan (-a2 + x) [1-tan (-x) tan] = 0. 











Rearranging, —tan zx = tan (-2). 


Fact 92. The graph of tan has no lines of symmetry. 


Proof. See p. 1595 (Appendices). 

















Figure to be 
inserted here. 





You may be wondering whether any single “branch” of tan might have a line of symmetry. 


Figure to be 


inserted here. 





For example, might the graph of tan| = have a line of symmetry (perhaps some 
—7/2,7 


downward-sloping line)? Answer—no: 


Fact 93. The graph of tan| has no lines of symmetry. 


(-1/2,77/2) 











Proof. See p. 1595 (Appendices). 
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34. ‘Trigonometry: Summary and a Few New Things 
Graphs of sin, cos, and tan: 


tan 


COs 


I 
l 
I 
I 
I 
I 
I 
I 
| 
| ! 
sin 
! 1 
I 
I 
I 
I 
I 
! 
| 
I 
! 
I 


IT 2Tt 





| 
eH 


To remember these three graphs, here are the key things to remember: 
e sin and cos both have 


— the same shapes; 

— period 271; 

— max value 1; and 

— min value -1. 
e sin and tan both go through the origin. 
¢ cos has y-intercept (1,0). 


e tan goes from —oo to oo and has period 7. 
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34.1. The Signs of Sine, Cosine, and Tangent 


Fact 94. (The Signs of Sine, Cosine, and Tangent) 


for x € [0,7], 
jay seine for a <(0, 70), 
for x € [7, 271], 


for x € (7,27). 


for x € [-7/2, 7/2], 
for x € (-7/2,7/2), 
for x €[m/2, 37/2], 
for x € (7/2, 37/2). 


for x €[0,7/2), 
for 4 € (0) a/2),, 
for x € (-7/2,0], 
for x € (-7/2,0). 


Figure to be 
inserted here. 





Of course, since sin and cos are periodic with period 271, while tan is periodic with period 
7m, we can generalise the above result as 
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Corollary 13. (The Signs of Sine, Cosine, and Tangent) For every k € Z, 
for x € [2k7, (2k +1) 71], 
for x € (2kn, (2k +1)7), 


(a) sing 


for x €[(2k +1) 7, (2k +2) a], 
for x € ((2k +1) 7, (2k +2)7); 
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34.2. Half-Angle Formulae 


Fact 95. (Half-Angle Formulae for Sine, Cosine, and Tangent) 


1-—cosx 


Mm 
Es 
<=) 
4 
= 


for 


(a) 


mols wls 


a + cosx for 
2 2) 


x 
COSt—a 


mol] S wls 


2 7 /1+cosx for 
2 7 


(c) (i) tan 5 = — for all - ER. 


l-—cosx x 
il f liek. 
(c) (ii) smg ? tor all 5 § 


l-—cosz 
a V 1+cosz’ i! 
(c)(iii) tan = 
2 l-—cosz ah 
-\ / ——_, for 
1+cosz 


Proof. By the Double Angle Formulae, for all x € R, 








cost = cos (5+ =) = 2eos? 5-1 = 1-2sin? =. 


So ein? © = LT OOS sail sg a Pees 
2 2 2 2 


Or eee econ and soa Se Enos! 
> 2 a z °° 


To figure out what the correct sign is for each specific x/2, use Corollary 13: 








(a) By Corollary 13, sin 5 > 0 if 5 e [0,7] and sin 5 <0 if 5 € [7, 271]. 


1- 
coer for [0,7], 
_ 2 2 Z 
Hence, sin — = 
° —— for ~ ¢ [ 7, 27] 
Z Z 


(Proof continues below ...) 
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(... Proof continued from above.) 


(b) Similarly and again by Corollary 13, 


cos 5 > 0 if 5 € [-7/2, 7/2] and 


ee 

3? 
1+cosxz 

= a 


Exercise 149. Prove Fact 95(c). 


ic 
Hence, cos a 


(c) See Exercise 149. 


cos = <0 if 5 € [n/2, 3n/2]. 














(Answer on p. 1803.) 


Of course, since sin and cos are periodic with period 271, while tan is periodic with period 
7m, we can generalise (a), (b), and (c)(iii) in the above result: 


Corollary 14. For every k € Z, 


l-—cosz 


(a) 


for 7 € (2k, (2k +1)7], 


for - e[(2k +1), (2k+2)7]; 


/1+cosx 
y) b) 


/1+cosx 
2 3) 


(c)(iii) tan = = 


PY mel for Se [kr (2k +1) 7), 
1+cosz Z 2 


2 l1-—cosx £ A 
——————————— ior 2) Sle 
V 1+cosx’ or 5 (( )o n 
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35. Trigonometry: Three More Functions 


The cosecant and secant functions are defined as the reciprocals of sin and cos: 


Definition 97. The secant and cosecant functions, denoted sec and cosec, are defined by 





Example 482. XXX 





Example 483. XXX 








Example 484. XXX 
Example 485. XXX 


Exercise 150. XXX (Answer on p. 377.) 


A150. 





231Qn p. 18 of your H2 Maths syllabus, these six functions are simply called the circular functions. 
However, the term trigonometric functions is probably more common. 
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The cotangent function cot is “nearly” but not quite the same as the function 1/ tan: 


Figure to be 


inserted here. 





Fact 96. cot # 1/tan. 


Proof. Since tan is not defined at 7t/2, neither is the function 1/ tan. 
In contrast, cot is defined at 7/2 (with cot 7/2 = 0). 
Hence, cot # 1/ tan. 














Recall the First Pythagorean Identity (Fact 71): 
sin? A + cos? A = 1 (for all A€R). 


We can use the above identity to prove two related identities (with equally inspiring names): 


Fact 97. Suppose AER. 


























(a) IfcosA #0, then 1+ tan? A = sec? A. (Second Pythagorean Identity) 
(b) Ifsin A #0, then 1+ cot? A = cosec?A. (Third Pythagorean Identity) 
2,_,,sin"A cos*At+sin°A 1 4g 
Proof. (a) 1+tan*° A=1+ ath Sala nr aa A. 
2A sin? A 2A 1 
(b) 1+cot? A=1+ se apa: = =z = cosec?A. 
sin* A sin’ A sin’ A 


In Ch. 26.8, we learnt to use the graph of f to graph 1/f. Here’s more practice: 
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Exercise 151. Graph cosec, sec, and cot. For your convenience, here are the graphs of 
sin, cos, and tan again: (Answer on the next page.) 


tan 
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Graphs of cosec, sec, and cot: 


cosec sec cot 


—~2r _3n : - = a aa a : fs ~\ 27 
2 2 





As discussed in Ch. 32.9, the prefix co- has this meaning: 

Suppose A+ B =717/2 (i.e. A and B are complementary). Then, 

* sin A =cos Bor sin A = cos (7/2 — A) = cos(A — 71/2).2” So, cos translated 11/2 rightwards 
is sin. 

* Similarly, sec A = cosec B or cosec B = sec (7/2- B) 2 sec (B - 71/2).233 So, sec trans- 
lated 7/2 rightwards is cosec. 


¢ Also similarly, tan A = cot B or tan A = cot (1/2 - A) = cot [-(A-7/2)]. So, cot trans- 
lated 7/2 rightwards and then reflected in the z-axis is tan. 





7821 uses Fact 80: cosx = cos (-7) (equivalently, cos is symmetric in the y-axis). 


2337 ikewise, 2 uses the fact that cos and hence also sec = 1 /cos is symmetric in the y-axis. 
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36. ‘Trigonometry: Three Inverse Trigonometric Functions 


Suppose we want to find the inverses of sin, cos, and tan. 
Unfortunately, none of them is one-to-one. For example, the horizontal line y = 1 intersects 
each graph more than once—so, by the Horizontal Line Test (HLT), none is one-to-one. 


\ 
y tan 


KV AK 7 
OU SOU 


(Indeed, for any c€ [-1,1], the horizontal line y = c intersects each graph more than once. 
And any horizontal line intersects the graph of tan more than once.) 


So, sin, cos, and tan are not one-to-one. But as we learnt in Ch. 21, by restricting the 
domain of each function, we can create a brand new function that is one-to-one. 





As usual, there are many (in fact infinitely many) ways we can do the domain restriction. 


For example, the restriction of sin to [2,4] is one-to-one. So too are the restrictions of cos 
to [-3,-2] and tan to [4,5]: 


Figure to be 


inserted here. 





But of course, as usual, we want to pick restrictions that are as “large” as possible. So, we 
might for example instead pick the restrictions of sin to [7/2,37t/2], cos to [-27,0], and 
tan to (7/2, 37/2)—these new functions are one-to-one: 
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Figure to be 


inserted here. 





Indeed, any translation (or shift) of the above restrictions by any integer multiple of 27 
would also work. 


But the standard convention is to use these particular restrictions: 


sin | , cos | , tan | 


Exercise 152. Graph sin | ete j , (Answer on the next page.) 
™ 7 0 == 





For each, explain why it is one-to-one, then write down its inverse. 
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No horizontal line intersects each of sin| ; cos| 


a 


; tan more than once. So, 


0,7] = 


LL 
272 


gas 
2 


each is one-to-one. 


sin 





COS 








We now define the three inverse trigonometric functions—arcsine, arccosine and 
arctangent: 


Definition 99. The inverse trigonometric functions arcsine, arccosine, and arctangent— 
denoted sin™', cos, and tan-!—are the inverses of 


and tan 














So, Domain | Codomain Mapping rule 
m 1 
sin’ | [-1,1] -=.5| y =sing =— > sin ly =2 
cos! | [-1,1] [0, 7] y=costr — > cos ty =2 
am 1 
tan R (-2, *] y=tang — > tanly=2 
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7 


2 





It can be shown that every inverse function is onto;* equivalently, every inverse function’s 


range and codomain coincide. And so here indeed, each of sin™!, cos", and tan™! is onto; 
equivalently, the range and codomain of each coincide. 


Observe that 


¢ sin’ has endpoints (—1,-7t/2) and (1,7t/2), and is strictly increasing; 
¢ cos! has endpoints (—1,7t) and (1,0), and is strictly decreasing; and 


¢ tan has no endpoints and is strictly increasing. 





Exercise 153. XXX (Answer on p. 384.) 
A153. 





*34act 254 (Appendices). 
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36.1. Confusing Notation 


There are two separate and big points of potential confusion: 


1. 


As noted in Ch. 32.3, sin? = sin-sin—this is confusing and contrary to our earlier use of 
f? to denote the composite function f o f. 


Here, to add to our a sin“! doesn’t mean 1/sin (as would be logical given that 


sin? = sin-sin). Instead, sin”! x denotes the inverse of the function sin ates 


“98 | 


dil) 


1 


. The notation (sin, cos"!, and tan") and names (inverse trigonometric functions) may 


incorrectly suggest that 


sin, cos-!, and tan! are the inverses of sin, cos, and tan. x 


But the above statement is actually false! Instead, 


sin, cos}, and tan7! are the inverses of sin | nq? €O8 f , tan| ae J 
a8 0,7 ed 
2°92 2°72 


= 


Because of the above two points of confusion, other writers prefer to denote the arcsine, 
arccosine, and arctangent functions by 


or 


or 


arcsin, arccos, and arctan; 
ol =| 4 
Sin, Cos, and Tan -; 


asin, acos, and atan. 


(The last is commonly used in computer programs.) 


However and unfortunately, your A-Level exams and syllabus insist on using the confusing 
and misleading notation sin™', cost, and tan™'. And so that’s what we shall do too. 
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36.2. Inverse Cancellation Laws, Revisited 


We reproduce Proposition 5 from Ch. 24.4: 


Proposition 5. (Inverse Cancellation Laws) /f the function f has inverse f~', then 


(a) f+ (f (2)) =2 for all x ¢ Domain f; and 
(b) f Cm (y)) =y for all y € Domain f. 





Given the above cancellation laws, the following result is perhaps not surprising: 


Fact 98. (a) sin(sin' x) =x for all x € Domain sin 
(b) cos(cos"! x) = x for all x € Domain cos“! = [-1, 1]. 


(c) tan(tan"! x) = 2 for all x ¢ Domain tan”! =R. 








Proof. See p. 1596 (Appendices). 











Example 489. For x = 1, we have 
a) sin (sin! 1) = sin — =a 

(a) sin (sin“!1) = sin 2 

(b) cos (cos! 1) = cos0 = 1. 

(@) tan( tans 1) = tan = 

Example 490. For x = 0, we have 
a) sin (sin! 0) = sin0 = 0. 

(a) sin ( 
b) cos (cos! 0) = cos is 0. 

(b) c0s( 


(c) tan (tan! 0) = tan0 =0. 





1 


Of course, if 2 ¢ [-1,1], then sin"'z and cos are undefined, so that sin (sin 2) and 


cos (cos x) are also undefined: 


Example 491. For x = 3 * 1.73, we have 
(a) sin (sin? v3) is undefined. 


(b) cos (cos v3) is undefined. 


(c) tan (tan! v3) San = 





1 


The composite functions sinosin™!, cosocos!, and tanotan! exist (why?)?% and their 


graphs are these: 








1 








23°The composite function sino sin™ 





: oo ™ 7 md 
exists because Range sin”! = |-z | CR = Domain sin. 
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Figure to be 


inserted here. 





Perhaps surprisingly, these three claims are false: 


(a) “sin ‘(sinz)=2 for all x ¢€ Domain sin =R.” x 
(b) “cos *(cosz) =a for all 2 € Domain cos = R” x 
(c) “tan (tanz) =a for all ee Domain tan =R\ {(2k+1)n/2:keZ}” Xx 


A simple counterexample to show that the above three claims are false: 


Example 492. For x = 471, we have 
(alsin (sin4da) =sin 0S 04 4q . 
(b) cos” (cos 47) = cos"! 1 = ; #47. 


(c) tan‘ (tan 47) = tan’'0 = 0 2 4n. 





Here are the correct versions of the above three false claims: 


Fact 99. (4) sin! (sinz) =2 


(b) cos (cosxz) =z 


(cle tanes (ang 








Proof. See p. 1596 (Appendices). 


Example 493. XXX 
Example 494. XXX 


236 











The composite functions sin"! osin, cost ocos, and tan! otan exist (why?). 


And their graphs are these:?°" 








al 








Similarly, cos o cos” exists because Range cos! = [0,7] ¢ R = Domain cos. 

















TT 
And tanotan™' exists because Range tan7* = (-5. 5) £ R\ {(2k +1) 7/2: k € Z} = Domain tan. 
8°The composite function sin’ osin exists because Range sin = [-1,1] ¢ [-1,1] = Domain sin™!. 


‘ ocos exists because Range cos = [-1,1] ¢ [-1,1] = Domain cos". 


And tan! otan exists because Range tan = R € R = Domain tant. 


*87See Fact 260 (Appendices). 


Similarly, cos~ 
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Figure to be 
inserted here. 


Exercise 154. Here’s the graph of cos”: 








Clearly, the following claim is false: 
“Tf # €[-1,1), then cos’ = 0” x 

Nonetheless, below is a seven-step “proof” of the above false claim. Identify any error(s). 
(Answer on p. 1803.) 

. Suppose x € [-1,1). 

. Let A+ cos? z. 

. 90, © =cos A. 

. But by Fact 80 (Cosine Is Symmetric in y-Axis), cos A = cos (-A). 

. So, z= cos (A). 


ste 
. Hence, cos"! x = —-A. 


. Taking 5 + 2 we get 2cos/ a =0 and thus cos! x = 0. 
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36.3. More Compositions 





Proof. Informal and incomplete proof-by-picture:?°> 


Consider a right triangle with legs of lengths x > 0 and 1. By Pythagoras’ Theorem, its 
hypotenuse has length V1+2?. Let a be the angle opposite the leg of length zx. 


Figure to be 
inserted here. 











O 
Since tana = 7 ; =x, we have a= tan! x. Hence, 
O a 
sin (tan7 x) =singd === ; 
H 1+<2? 
cos (tan7 x) = COSa@ = = = : : 
H V14+2? 


Next, consider a right triangle with hypotenuse of length 1 and a leg of length x. By 
Pythagoras’ Theorem, its other leg has length V1 —- 2?. 


Figure to be 


inserted here. 








38Informal because we rely on geometry and incomplete because we cover only the special case where 
x >0. For a formal and complete proof, see p. 1598 (Appendices). 
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Let @ and ¥ be the angles opposite the legs of lengths x and V1 - z?, respectively. 
Opposite x 








Since sin 6 = = — =z, we have = sin! x. Hence, 
Hypotenuse 1 
eens -wre an 1-2? 
7 HH il 
O Z 
tan (sin ‘z) = tan = — = ; 
( ) p A 1 = 2 


Adjacent ue | 
=—=2, we have y=cos x. Hence, 


Similarly, since cos y = aoe A 
ypotenuse 

















O 1- a es 
. -1 zs . -_ = _ 9) 
- = = 1 i 
sin (cos x) sin y H 1 fae 
tan (cos! x) tan i = a : 
= fy — _ 


1 1 


The four composite functions sinocos!, sinotan™!, cososin™!, and cosotan are well- 


defined. Here are their graphs: 


Figure to be 


inserted here. 





Unfortunately, tanosin™' and tanocos"! are not well-defined. (Why?)?*? 


Nonetheless, we may graph the equations y = tan (sin! x) for x € (-1,1) andy = tan (cos x) 
for x €[-1,1] \ {0}: 


Figure to be 


inserted here. 








7 
2 


1 


1 
239Range sin! = |-z. is not well- 


defined. ” 
Range cos = [0,7t] ¢ Domain tan. In particular, 3 ¢ Domain tan. Hence, tan o cos” 


1s 
¢ Domain tan. In particular, a ¢ Domain tan. Hence, tanosin™ 


lis not well-defined. 
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36.4. Addition Formulae for Arcsine and Arccosine 


Fact 101. (Addition Formulae for Arcsine and Arccosine) Let x €[-1,1]. 


(a) costx+ sin'x =n/2. 


(b) sins + sin"! (-x) =0. 
1 


(c) cos! + cos! (-2) =7. 





Proof. Informal and incomplete proofs-by-picture:?”” 


(a) Construct a right triangle with base x € (0,1) and hypotenuse 1. 











Let a and ( be the labelled angles. Then 


ie O « 
cOSs@=—=-=2 and snp — =7, 
H 1 H 1 
So, cos =a and sin! = 8. But of course, a+ G = 1/2. Hence, cos) x+sin! x = n/2. 


(b) Let x € (0,1). Consider the unit circle centred on the origin. Let A and B be the 
points on its right half with y-coordinates 7 and —2x. 











Let 7 be the angle corresponding to the point A. Then the angle corresponding to the 
point B is -+. 





40For the formal and complete proofs, see p. 1596 (Appendices). 


391, Contents www.EconsPhDTutor.com 


By the unit-circle definition of sine, 
siny=2z and sin (-y) = -2. 
Hence, sin“! x = y, sin"! (-2) = —y, and 


1 


sin} 2 +sin!(-2) = y+(-y) =0. 


(c) Let x € (0,1). Consider the unit circle centred on the origin. Let C and D be the points 
on its top half with z-coordinates x and -z. 














Let 6 be the angle corresponding to the point C’. Then the angle corresponding to the point 
Disn-o. 


By the unit-circle definition of cosine, 
cosd =x and cos (7-6) =—a. 


Hence, cos"! x = 6, cos! (-x) = 1-6, and 





cos’ x + cos" (-2) =6+(m-6)=7. 


Example 495. XXX 
Example 496. XXX 
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36.5. Addition Formulae for Arctangent 
Fact 102. (Addition Formulae for Arctangent) Let x €R. 
(a) tan t2+tan!(-2) =0. 


| 2s jor a 0, 
-7/2,  forx<0. 


if 
(b) tan? 2+ tan? — = 
at 


u+rYy 


; jor vg <r 
San 


tem 


+ 
(c) tan !a+tan ly =4 tan! = + for zy>1ANDz > 0, 


ge eee for zy>1ANDz <0. 
1) 





Proof. Informal and incomplete proofs-by-picture for (a) and (b):2"1 


(a) This is very similar to the earlier proof-by-picture of Fact 101(b) (sin7' x+sin7! (—x) = 0): 


Let x € (0,1). Consider the circle with radius V1+ 2? centred on the origin. On this circle 
are the points A = (1,z) and B= (1,-2). 











Let y be the angle corresponding to the point A. Then the angle corresponding to the 
point B is -7+. 


By the unit-circle definitions, 
tany=2 and tan (-y) = -2. 


Thus, tan} 2 =~, tan’! (-x) = -y, and 





*41For the formal and complete proofs, see p. 1601 (Appendices). 
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tan ‘a2 +tan!(-«) = 7+(-y) =0. 


(b) This is very similar to the earlier proof-by-picture of Fact 101(a) (cos"! #+sin7! x = 71/2): 
Construct a right triangle with legs of lengths x € (0,1) and 1. 














B 
iz 
ao [| 
1 
Let a and ( be the labelled angles. Then 
ne Opposite = xv _ : sad ini Opposite — 1 
~ Adjacent 1 ~ Adjacent 2’ 


1 i! 

So, tan! z =a and tan! — = 8. But of course, a + 8 = 7/2. Hence, tan 2 + tan! — = 7/2. 
x t 

(c) By the Addition Formula for Tangent and Fact 98(c), 


1 uf 
tan(tan"!2+tan!y) = ee eee = ") a »Y) ae 
1=tan (tan x) tan (tan x) Ley 





vy 


So, tan7! = = tan (tan (tan! a+tan! y)). 


We now consider the three cases in turn: 

Lose 1. ay <1, 

Then by Lemma 1(b) (below), tan™! x + tan”! y € (-7/2, 77/2). 

Hence, by Fact 99(c), tan7! (tan (tan! a+tan y)) =tan'x+tan ly. Thus, 


aa L+y 
l-azy 


ta =tanbe+tan? ue 











The remainder of the proof of (c) continues on p. 1602 (Appendices). 





Lemma 1. Suppose x,y¢R. Then 


(a) tan ’a+tan ly =47/2 ou aa; 


(b) € (-7/2, 7/2) Tal es Ale 
(c) € (m/2,7) zy >1 AND x>0; 
(d) € (-7, -71/2) zy>1 AND «<0. 

















Proof. See p. 1602 (Appendices). 
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36.6. Harmonic Addition 


You are supposed to have mastered the following in secondary school: 
¢ “expression of acos@ + bsin@ in the forms Rsin (@ +a) and Rcos(@+a)”.” 


So, let’s review how this is done. 


Example 497. Q: Express 2cos (71/3) + 3sin (7/3) in the form Rsin (6 + a). 
A: Write Rsin (6 + a) = 2cos (7/3) + 3sin (71/3). We will find R and a. 
By the Addition Formulae for Sine, 


3 2 
— 


is 
Rsin (6+ 7/3) = Reosasind + Rsinacos 6. 


2) isin (0) 


S = 
Po) Reo. (a) 





2 
= tana and a =tan! = 





Z 
Next, since 3 = Rcosa = Rcos (tan 





More generally, 


Theorem 7. Let 0¢€R. Suppose a,b #0. Then 

(a) acosé+bsind =sgnbVa? + b? sin (0 + fan | 

(b) = sgenaVa? + b?cos{@+ tan —). 
a 

a 


= senbV a? + b?sin (0 Sane = 


b 
= senav a? + b? cos (0 Steme- °), 
a 





Proof. (a) Write Rsin(@+ a) =acos@+ bsind. We will find R and a. 
By the Addition Formulae for Sine, 





42112 Maths syllabus (p. 14). 
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b a 


a —“_ 
Rsin(@+a) = Reosasiné + Rsinacosé. 


a Rsina 
Now, == 


= = tana. 
6 Rceosa 





a 
So, a = tan 7 


And since Rcosa = b, by Fact 100(d), we have 


— b b 7 b s. , 
"cost eos(tan™§) ES Aa) 
+(a 


= ovis + (a/b) VP +a? =senbvb? + a?. 


(b) Write Rcos (6 +a) =acos6é+bsin@. We will find R and a. 
By the Addition Formulae for Cosine, 





b 
a an —_—_ 
Rcos(@+a) = Rceosacosé-Rsin asin 6. 
b —Rsina 
Now, —= = —tana. 





b 
So, a =tan (-}. 
a 
And since Rcosa =a, by Fact 100(d), we have 


a a a b2 


- COS 7 cos (tan! (-4)) Jirbfay 
+(-b/a 

2 
ag 14 = OV = gnaVar +B. 


lal a’ al 


(c) Take (a) and apply Fact 102(a). 
(d) Take (b) and apply Fact 102(a). 














Exercise 155. XXX (Answer on p. 396.) 
A155. 
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36.7. Even More Compositions (optional) 


The four composite functions sin-!ocos, cos"hosin, tan-hosin, and tan™!ocos are well- 
defined. Here are their graphs: 


Figure to be 


inserted here. 





Unfortunately, sin“! otan and cos"! otan are not well-defined. (Why?)?”° 


Nonetheless, we may graph the equations y = sin’ (tanx) and y = cos"'(tanz), for x € 
[-k7t/4, k7t/4] (for k € Z): 


Figure to be 
inserted here. 





The composite functions sin7! ocos : R > R and cos"! osin: R > R do exist. Their mapping 


rules are messy and so relegated to the Appendices (Fact 261). Here are their graphs: 


Figure to be 


inserted here. 





However and somewhat interestingly, it’s impossible to write down algebraic expressions 
for tan™' (sinx), tan7'(cosz), sin"! (tanz), and cos”! (tan 2). 








*43Range tan = R ¢ [-1,1] = Domain sin™!. Hence, sin”! otan is not well-defined. 
1 














Range tan = R ¢ [-1,1] = Domain cos~'. Hence, cos”! tan is not. well-defined. 
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37. Elementary Functions 


This brief chapter serves as a quick review of some of the functions we’ve encountered so 
far. We'll also learn a new term: elementary functions. 


Definition 100. A polynomial function is any nice function defined by 7% ag + a,x + 
age += o,2", whiere ay,01,...0, © R and 2 © Za. 


Example 501. The function f : R > R defined by f (x) = 52° - 2 +3 is a polynomial 
function. (What is f (1)? f (0)?)?* 


Example 502. The function g: [1,2] > R defined by g(x) = 54? — x + 3 is a polynomial 
function. (What is g(1)? g(0)?)*” 





Definition 101. An identity function is any function defined by x + «.7“6 


Example 503. The function f : R > R defined by f(x) = x is an identity function. 
(What is f (1)? f (0)?)?4" 
By the way, any nice identity function is also a polynomial function. (Why?)™* Here for 


example, f is also a polynomial function. 


Example 504. The function g : [1,2] > R defined by g(x) = = is an identity function. 
(What is g(1)? g(0)?)?° 


Example 505. The function h : {H, w = {PH, w defined by h(x) = x is an 
identity function. (What is h (4)? h(@)? h(1)?)? 


Definition 102. A constant function is any function defined by x + c, where c is any 
object. 








244 F (1) = 7, f (0) =3. 

2459 (1) = 7, g (0) is undefined. 

46 Strictly speaking, we should distinguish between an identity function and an inclusion function. Strictly 
speaking, Definition 101 given here is not of an identity function but of an inclusion function. See 
Definitions 284 and 285 (Appendices). 

This is a subtle distinction that shall not matter at all for A-Level Maths. Indeed, the term identity 
function is never mentioned in your A-Level Maths syllabus or exams. But it is sufficiently convenient 
that we’ll sometimes use Definition 101 in the main text anyway. 

7 F(1)=1, f (0) =0. 

*48Tts mapping rule may be written as 7 ag + a,X + dou" +-+++G,2", where ao, d1,...d, € R and ne Z 
(in particular, ag = 0, a; = 1, and n= 1). 

*499 (1) = 1, g (0) is undefined. 


250 p) (PR) = ae h (w) = yy, h(1) is undefined. 
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Example 506. The function f : R > R defined by f(x) = 5 is a constant function. 
(What is f (1)? f (0)?)?™ 


By the way, any nice constant function is also a polynomial function. (Why?) Here for 
example, f is also a polynomial function. 


Example 507. The function g: [1,2] > R defined by g(x) = 5 is a constant function. 
(What is g(1)? g(0)?)?” 


Example 508. The function h : {Hw = {H, w defined by h(x) = 9PR is a 
constant function. (What is h (9M)? h(W)? A(1)?)?8 


Example 509. The function 7: R > { PH, w defined by i(x) = & is a constant 
function. (What is i(#W)? 1(@W)? i(1)? 1(0)?)?* 





A special case of a constant function is a zero function: 


Definition 103. A zero function is any function defined by x + 0. 


Example 510. The function f : IR > R defined by f (x) =0 is a zero function. (What is 
TOC 

By the way, any zero function is also a constant function. (Why?)?°° So here for example, 
f is also a constant function. 


Example 511. The function g: [1,2] > R defined by f (x) = 0 is a zero function. (What 
is g (1)? g(0)?)"" 


Example 512. The function h : {9R\, @} > {0, 4h, W} defined by h(x) = 0 is a zero 
function. (What is h (PW)? h(@W)? h(1)?)?8 


Definition 104. A power function is any nice function defined by «+ x", where k€ R. 


Example 513. The function f : R > R defined by f(x) = x'"? is a power function. 








(What is f (1)? f (0)?)?° 





2° f (1) =5, f (0) = 5. 

*°lIts mapping rule may be written as x ag + a2 + Gov? +++: + Gnx", where ao, a1,...dn € R and ne Lis 
(in particular, ao is the constant to which each x is mapped and n = 0). 

2529 (1) = 5, g (0) is undefined. 


253 p, (PR) = FR n(w) = SPR, 2 (1) is undefined. 
2545 (=) and 7 (w) are undefined, 7 (1) = wy, and 7 (0) = wv. 
*° f (1) = 0, f (0) = 0. 


2°°Tt maps each x in its domain to the same object (in this case, 0). 
2°79 (1) = 0, g (0) is undefined. 


258 p, (-R) =O0,h (w) = 0, h(1) is undefined. 
*9F(1) =1, f (0) =0. 
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Example 514. The function g : [1,2] > R defined by g(x) = 2” is a power function. 
(What is g (1)? g(0)?)? 


Definition 105. A trigonometric (or circular) function is any nice function with the 
same mapping rule as sin, cos, tan, cosec, sec, or tan. 


Example 515. Of course, sin is a trigonometric function. 
But so too is the function f : [1,2] > R defined by f (x) =sinz. 


Figure to be 
inserted here. 


Example 516. Of course, cos is a trigonometric function. 


But so too is the function g : [2,3] > R defined by g(x) = cosa. 


Figure to be 
inserted here. 








2609 (1) = 1, g (0) is undefined. 
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Example 517. Of course. sin7! is an inverse trigonometric function. 
9 


But so too is the function f : [0,1] + R defined by f (x) = sin 2. 


Figure to be 
inserted here. 


Example 518. Of course, cos is an inverse trigonometric function. 


But so too is the function g:[0,0.5] > R defined by g(x) = cos! z. 


Figure to be 
inserted here. 


In Ch. 28, we defined 





¢ The natural logarithm function In: R* > R; and 


¢ Its inverse the exponential function exp: R > R*. 


We can also define 


Definition 107. A natural logarithm function is any nice function with the mapping 
cr ing. 


Definition 108. An exponential function is any nice function with the mapping 7 & 
exp x. 
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Example 519. The function f : [1,2] — R defined by f (x) =Inz is a natural logarithm 
function. 


Figure to be 


inserted here. 


The function g:[1,2]—> R defined by g (x) = exp is an exponential function. 





All of the functions discussed in this brief chapter are elementary functions. Also, any 
arithmetic combination (Ch. 20) or composition (Ch. 22) of two elementary functions is 
also an elementary function. Formally, 


Definition 109. An elementary function is 


a polynomial function, a trigonometric function, 

an inverse trigonometric function, a natural logarithm function, 
an exponential function, a power function, 

any arithmetic combination of two elementary functions, 


or any composition of two elementary functions. 





Most functions you’ll encounter in H2 Maths are elementary. Through arithmetic combi- 
nations and compositions, we can build ever functions that “look” ever more complicated 
but are nonetheless elementary: 


Example 520. Define f:R* > R by 


f (w) = 14+ 22 + 3sin [cos (1 + 2° -Inx) |, 





While f may look complicated, it is nonetheless elementary. 
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Example 521. The absolute value function |-|: IR > R is defined by 


Te for x > 0, 
Jz] = 


-2, for x <0. 


At first glance, the absolute value function doesn’t seem to be an elementary function. 


But in fact, it is. To see why, recall Fact 12: 
le| = a2, tor alla eR, 


So, define f : IR} > R by f (z) = Va and g:R—R by g(a) = 2’. 


Both f and g are elementary. Moreover, |-| is the composition of f and g: 


In| = f (9 (w)) = f (2?) = V2. 


Hence, by Definition 109, |-| is an elementary function. 





To repeat, most functions we’ll encounter in H2 Maths are elementary. 


Probably the only non-elementary function we'll spend significant time on is the cumula- 
tive density function of the normal distribution (Part VI, Probability and Statistics). 


It is possible to prove that the derivative (if it exists) of any elementary function is also 
elementary. However and importantly, the integrals of many elementary functions are not. 
We'll learn a little about this in Part V (Calculus). 








6lDefinition 109 is merely this textbook’s and also ProofWiki’s. 
e Some writers consider the inverse of any elementary function to also be elementary (we do not). 


¢ Most (including me) consider the composition of any two elementary functions to also be elementary. 
However, at least one person does not (davidlowryduda). 
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38. Factorising Polynomials 


In this chapter, we’ll learn methods for factorising polynomials. But first, we must learn 
to long-divide polynomials: 


38.1. Long Division of Polynomials 


We'll continue using the terms dividend, divisor, quotient, and remainder (Ch. 2): 


Example 522. Consider (27 +1) +2. We have 


P(t) g(x) a(x) (2) 
— » atic, Cay —— 


Zee le= 2s ee +o 


The four polynomials labelled above have the same four names as before: 


The dividend The divisor The quotient The remainder 
Oe) = 2741 d(2) =2 siz \=2 pie el 





In the above example, it was kinda obvious that the quotient had to be q(x) = 2. In the 
next example, it’s a little less obvious and long division will be helpful: 
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Example 523. Consider (x? + 3)+(x- 1). The dividend is p(x) = 2?+3 and the divisor 
is d(x) =x-1. This time, it’s not so obvious what the quotient q(x) should be. But it 
turns out that just like with (simple) division (of two numbers), long division can help 
us here. 


With (simple) long division, going from right to left, the columns were 1s, 10s, 100s, etc. 
Here with polynomial long division, going from right to left, they’re the constant x° term, 
the linear z' term, the squared x” term, the cubed term 2°, etc. 

Terms: 1” 


Explanation 





c-1| x 

q a-(x-1)=2*-2 

(7? 23)— (2? =a) =243 

1-(a-1)=a2-1 
(x+3)-(4%-1) =4 


The quotient is q(x) = x +1, while the remainder is r (x) = 4. Altogether, 


p(x) d(x) q(x) AGE) 
a a ed 
a?+3=(r¢-1)-(x+1)+ 4 
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Example 524. Consider (3x? + x - 4) + (27-3). The dividend is p(x) = 32?+2-4 and 


the divisor is d(x) = 2x7 -3. 


Long division: 


Terms: 7° 


Explanation 





3 9 
gu (2x - 3) = 32° — 52 





(327 +0 - 4) — (30° - 5°) 


u 
=—7-4 
sie 





——— 
3a7+2-4= (2x -3)- 
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p(x) 


—_ 
3r2+0-4 3 


11 





(3-0) 


d(a) 
ns 3 


2 
p(x) 
(p 

+ 
4 





r+ 
24-3 2 
——— 
d(x) q(«) 
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Example 525. Consider (423 + Qa? + 1)+ Ue = 1). The dividend is p(x) = 4x? + 
2x7 +1 and the divisor is d(x) = 227-2 -1. 


Long division: 


Terms: 


Explanation 





De A a? 
Ax 20° (227 = 1) =47 = 9, 
(42° + 2a? +1) - os — 2x? -2x) = 407 +2r4+1 








) 
2-(2a°-x-1) =42?-22-2 
(42? + 2x + 1) - (4a? - 22 - 2) = 


47 +3 
So, the quotient and remainder are 


Wiej=274+2 and riaj=4e438. 
p(x) d(x) q(x) a) 
ee 


ne ——— 
4c? + 20? + 1 = (22? -a-1)-(2e +2) +4043. 
p(x) A) 
————\ 
Hop ibe ll i 4x +3 
7 20? -x2-1 
te SS 
d(x) 








The above examples suggest the following theorem and definitions: 


Theorem 8. (Euclidean Division Theorem for Polynomials.) Let p(x) and d(x) 
be P- and D-degree polynomials in x with D< P. Then there exists a unique polynomial 
q(x) of degree P—D such that r(x) = p(a)-d(x)q(a) is a polynomial with degree less 
than D. 

















Proof. See 1605 (Appendices). 


Definition 110. Given polynomials p(x) and d(a) and the expression p(x) +d(x), we 
call p(x) the dividend, d(x) the divisor, the unique polynomial q(x) given in the above 
theorem the quotient, and r(x) = p(x) -—d(x)q(«) the remainder. 


Exercise 156. For each expression, do the long division and identify the dividend, divisor, 
quotient, and remainder. (Answer on p. 1804.) 


16x +3 = 3a = | Viens a) 


Drege ras O [eps 
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38.2. Factorising Polynomials 
Example 526. Consider the 2nd-degree (or quadratic) polynomial x? + 4x +3. 
It can be factorised (i.e. written as the product of lower-degree polynomials): 
x? +42 +3=(£+1)(r+3). 


We say that each of x + 1 and x +3 is a factor of the given polynomial (or equivalently, 
divides it). 


Note that each of x + 1 and x +3 is itself a 1st-degree (or linear) polynomial. 


Example 527. The quadratic polynomial 6x? + x - 2 can be factorised: 


627+ 2-2 =(2r-1) (3r+2). 


Each of 27-1 and 32 +2 is a factor of (or divides) the given polynomial. 


By the way, any non-zero multiple of a factor is itself also a factor. So here, 


e Since 27-1 is a factor of the given polynomial, so too are 4x - 2, -6x+ 3, x -0.5, and 
any k (2x -1) (for k #0): 


6a? + -2= (a1) (Bn +2) = 5 (4-2) (80 +2) 


= —5 (Gr +3) (30 +2) = 2(0- 0.5) (Bir + 2) = Tk (2n- 1) (Bx +2). 


e Similarly, since 3x +2 is a factor of the given polynomial, so too are6x + 4, —9x - 6, 
1.57 +1, and any k (3x +2) (for k #0): 


6a? +2 -2 = (2n-1) (82 +2) = 5 (2a -1) (60+4) 


= 5 (2m - 1) (9x - 6) = 2 (2x ~ 1) (1.541) = = (20-1) k(80+2). 





Formal definitions of the terms factor of (or divides) and factorise: 


Definition 111. Let p(x) and d(x) be polynomials. We say that d(a) is a factor of (or 
divides) p(x) if there exists some polynomial q(x) such that p(x) = d(x) q(x). 


To factorise a polynomial is to express it as the product of polynomials, none of which 
is of higher degree and at least one of which is of strictly lower degree. 
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38.3. Compare Coefficients (plus Guess and Check) 


We'll call our first method for factorising polynomials Compare Coefficients (plus 
Guess and Check): 


Example 528. To factorise 3x7 + 5a + 2, write 
3a? +52+2 = (ax +b) (cx +d) = acz’ + (bc + ad)x + bd, 


where a, b, c, and d are unknown constants to be found. 


Comparing coefficients on the x”, x, and constant terms, we have, respectively, 
1 2 3 
ae =. bc+ad=5, and bd.= 2. 


Next, we guess and check: 


Of course, a, b, c, and d could be any numbers. But if the person who wrote this problem 
is nice,©? they’ll probably be integers. 


So, given Z let’s guess/try a = 3, c=1. 


And given 3 let’s guess/try b= 1 and d= 2. 


Unfortunately, with this set of guesses, 2 doesn’t hold: 
tesa =lto= 7 = 3. 
Well, let’s instead try b = 2 and d=1. With this second set of guesses, 2 now holds: 
be ed = 243 = 5, 
Yay! We’re done: 


3x7 + bx +2 = (84+2)(x+1). 





In the above example, we explicitly wrote out the unknown constants a, b, c, and d. 
When we look later at higher-degree polynomials, this can help reduce mistakes. But 
with quadratic polynomials, we can usually skip doing this and save ourselves some time: 





?62We can confirm that he is. 
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Example 529. To factorise 6x? — 11x - 35, first try 


Crier >) 67 =7= 3 


Nope doesn’t work. Second try: 
(Ge =n = on = ile — 35. 
Nope still doesn’t work. Third try: 


(3x —7) (22 + 5) = 6x? - 11x - 35. 





Yay works! We’re done! 
The Compare Coefficients (plus Guess and Check) method works well enough with 


quadratic polynomials. With higher-degree polynomials, we can sometimes get lucky and 
quickly arrive at the correct guess: 


Example 530. To factorise x? — 7x — 6, write 
a® — 72-6 =(ar+b)(cr+d)(ex+f). 


Comparing coefficients, we have ace = 1 and bdf = -6. 
So, let’s guess a=c=e=1 and b=2, d=-3, and f =1: 


(a + 2) (w@-3)(e+1) = (a? -2-6)(a+1) =22-2?-6r4+2?-2-6=23-Txr-6. 


Wah! So “lucky”! Success on the very first try! We’re done! 





But often, it will take more time (and pain), even with some lucky and intelligent guesses: 


410, Contents www.EconsPhDTutor.com 


Example 531. To factorise 152° — 8x7 — 9x + 2, write 


Ve or -97 = (a (ee ier 


= acex® + (acf + ade + bce) x” + (adf + bcf + bde) x + bdf. 
Comparing coefficients on the «°, x”, x, and constant terms, we have, respectively, 
ace + 15, acf + ade + bce 2 -8, adf +bcf +bde 2 -9, and bdf =2. 


Given z. let's try ¢ =5, c= 3, and e= 1. 


Given =. let’s try b= 2, d=1, and f = 1. Hm wait no, observe that some coefficients are 
negative—so, it can’t be that a—f are all positive. 


So, let’s instead try b = -2, d=-1, and f =1: 

acf +ade + bce =15-5-6=4#-8. 
Hm too big. Let’s try switching b and d around—i.e. try b= -1, d= -—-2, and f =1: 

acf + ade + bce =15-10-3=2#-8. x 
A little closer but still wrong. But ah, we now notice that the numbers 15, 10, and 3 
look a lot like they can be made to add up to —-8. Indeed, let’s see what happens if we 
try b=-1, d=2, and f =-1: 


acf + ade + bce = -15+10-3=-8. J 


Yay! 2 holds! Cross our fingers and check/hope that 3 also holds: 


adf + bcf + bde = -10+3-2=-9. 
It does! And so we’re done: 


15a? — 8a? —- 9x2 +2 = (52 -1) (34 +2)(x-1). 





Shortly we’ll also learn of other tools to factorise higher-degree polynomials (e.g. Factor 
Theorem and Intermediate Value Theorem in Chs. in 38.6 and 38.8). 


Exercise 157. XXX (Answer on p. 411.) 
A157. 
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38.4. The Quadratic Formula 


For quadratic polynomials, there’s actually no need to use the Compare Coefficients method 
at all. Instead, we can simply use the quadratic formula (Ch. 14), which involves no 
guess and check. 


Consider a quadratic polynomial ax? + br +c. Its discriminant is b? - 4ac. By Fact 34(b), 


(i) If b? - 4ac > 0, then 


—b- Vb? =) («- —-b+ V6 ) 


2 
+ba+c= — 
az’ +br+c=a (« on Da 


(ii) If b? — 4ac = 0, then 
» \2 
aa? + br +c=a(x+>-) 


(iti) If b? —4ac < 0, then ax? + bx +c cannot be factorised (unless we use complex numbers, 
as we'll learn in Part III). 


Example 532. The quadratic polynomial x? — 3x +2 has discriminant b? — 4ac = eas - 
AG (2) =I So. 


=(x-1)(4-2). 


i 3042= (2-244) (2-249) 


2 Z 


Example 533. The quadratic polynomial 32? + 52 +2 has discriminant b? — 4ac = 5? - 
AG) = iy so. 


2°3 2°3 


30? 5049-3 (2- 


Sa) (e- S54) 3+ (e+ 9) - +n Gera) 


Example 534. The quadratic polynomial 4%?-127+9 has discriminant b?-4ac = ( E12). — 
4(4) (9) =0, So: 


da? - 122 +9=4(0+ — 


Example 535. The quadratic polynomial 327-22 +1 has discriminant b? -4ac = ‘ey - 
4(3) (1) <0. So, 32? - 2x2 +1 cannot be factorised. 





Exercise 158. XXX (Answer on p. 412.) 
A158. 
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38.5. The Remainder Theorem 


You’re supposed to to have learnt this result in O-Level Additional Maths: 





Proof. By Definition 110, the remainder is r(x) = p(x) -(a-a)q(2). 


Since x-a is a 1st-degree polynomial, by Theorem 8, r(x) is a Oth-degree polynomial, i.e. 
a constant. 


So, we may simply write r(x) = R= p(x) —(z-a)q(z), where R doesn’t depend on z. 














Now plug x =a into = to get R=p(a)-(a-a)q(a) =p(a). 


Example 536. Consider p(x) = 2?-5xr+1. By the Remainder Theorem (RT), p(x) 
divided by x — 3 leaves a remainder of p(3) = 3? -5(3)+1=-5. t 


We could’ve figured this out using long division (below), but clearly the RT is a lot 
quicker. 
‘Terms 


Squared Linear Constant 
-2 











Example 537. Consider the quintic polynomial p(x) = 17x? - be a + By the An 
p(x) divided by x — 1 leaves a remainder of p(1) =17-1°-5-14+1?+1=14. a 


We could’ve figured this out using long division (I didn’t bother but you can try this as 
an exercise), but clearly the RT is a lot quicker. 





Historically, the Remainder Theorem rarely featured on the A-Level exams ... which means, 
of course, that it made a sudden appearance in 2017 just to screw students over.7°? So yea, 
it’s another thing you’ll want to remember. An exercise to help with that: 


Exercise 159. For each, find the remainder. (Answer on p. 1805.) 


(a) (207+ 72? - 32 +5) +(«-3) (b) (-2a4+32?- 70-1) +(x +2). 





For A-Level Maths, the Remainder Theorem will have little use (except to screw over 
unsuspecting students). Instead, its corollary—the Factor Theorem—will be more useful 
for factorising polynomials. 





?63See Exercise 534(a) (9758 N2017/1/5). 
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38.6. The Factor Theorem 


Another result you’re supposed to have learnt in O-Level Additional Maths: 


Theorem 10. (Factor Theorem) [f p(x) is a polynomial and a is a constant, then 


x-a is a factor for p(x) pia) =0, 





Proof. By the Remainder Theorem (Theorem 9), p (a) +(#- a) leaves a remainder of p (a). 
By Definition 111, «-a is a factor for p(x) if and only if the remainder is zero, i.e. 


played. 














Example 538. To factorise p(x) = 2? — 3x +2, we can use the Factor Theorem (plus 
Guess and Check) method: 


First try plugging in the number 1: 
p(1)=1°-3-1+2=0. J 


Wah! So “lucky”! Success on the very first try! Since p(1) =0, by the Factor Theorem, 
x —1 is a factor for p(z). 


Since p(x) is quadratic (or degree 2), its other factor must be a linear (or degree-1) 
polynomial, i.e. of the form ax+b. To find this other factor, we could continue trying 
our luck with the Factor Theorem (i.e. try plugging other numbers into p(x)). But we 
won't do that. It’s easier to write 


x? — 32 +2=(x-1)(ar+b) Zax? + (b-a)z-, 


where a and b are unknown constants to be found. Comparing coefficients, we have a = 1 
and b = —-2 (no guess and check needed). And now, we’re done: 


p(x) =a”? -32+2=(x-1)(x-2). 


Note: Above, we can actually skip writing out 2 because from 4 alone, we can easily tell 
that a= land) -=—2, 
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Example 539. Factorise p(x) = 3x? + 5a +2 
Try plugging in the number 1: 


p(1)=3-17+5-1+2=100. x 


Aiyah, sian, doesn’t work: Since p(1) #0, by the Factor Theorem, x - 1 is not a factor 
for DCE). 

By the way, here’s a tip to save you time: Above we found the exact value of p(1) to be 
10. But actually, we can skip finding the exact value. All we want to know is whether 
p(1) equals zero (or not). And by observing that each of the three terms is obviously 
positive, we can tell that p(1) is also obviously positive and, in particular, not equal to 
Zero. 


Next try -1: 


p(-1) =3-(-1)°+5-(-1)4+2=0. J 


Yay, works! Since p(-1) = 0, by the Factor Theorem, x-(-1) = 2+1 is a factor for p(2). 


As in the last example, to find the other factor, write 
p(x) = 3a7+52+2=(x+1)(ax+b). 
Comparing coefficients (no guess and check needed), a = 3 and b = 2. So, 


p(x) = 347 +52+2=(x+1)(32+2). 
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We now try using the Factor Theorem to factorise higher-degree polynomials: 
Example 540. To factorise p(x) = 152° — 17x” — 22a + 24, try plugging in 1: 
p(1)=15-1°-17-17-22-1+24=0. J 


Wah! So “lucky”! Success on the very first try! Since p(1) =0, by the Factor Theorem, 
x —1 is a factor for p(x). 


Since p(x) is cubic (degree 3), p(x) divided by x - 1 gives us a quadratic (degree-2) 
polynomial ax? + ba + c that we now want to find. 


Write 15a - 17a? - 227 + 24 = (x - 1) (ax? + br +c). 


Comparing coefficients on x° and the constants, a = 15 and c= —24. 


Comparing coefficients on x”, -17 = -a+b=-15+b. So, b=—-2. 
Hence, az’ + ba +c = 15x? - Qe - 24. 


We next factorise 1522-27-24. To do so, we can use any of the three methods introduced 
so far (Compare Coefficients, Quadratic Formula, and Factor Theorem). Here we'll just 
use the quadratic formula (which is the only one of the three that doesn’t involve any 
guesswork): 


The discriminant is b? - dac = (-2)? - 4(15)(-24) = 4+ 1440 = 1444 > 0. Moreover, 
V 1444 = 38. Hence, 


150? — 20-24 = 15 (2-==*)} («- =") = 15 (a+ 2 
30 30 5 


Altogethet,— 17x? - 227 + 24 = 15(2-1) (z 4 =) (z - 5) =(x-1)(54+6)(3a-4). 


Exercise 160. Factorise each polynomial. (Answer on p. 1805.) 


(a) 22?-52-3 (b) 7x?-197-6 (c) 6a27+z7-1 (dd) 222-2? -172r-14. 
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38.7. The Integer and Rational Root Theorems 


The Integer Root Theorem (IRT) says that given a polynomial p(x) with integer 
coefficients, every integer solution to p(x) = 0 divides the constant term: 


1hetpyatpo with 


Corollary 15. (Integer Root Theorem) Let p(x) = ppx"+pp-1u"™ 


0; P1,---;Pn€Z. Supposeae Z. If p(a) =0, then po/a€ Z. 














Proof. See p. 419. 





The IRT may seem a bit abstract. But its significance is this—given a polynomial with 
integer coefficients, we can now find every factor of the form x + a (where a € Z): 


Example 541. Factorise p(x) = 2° + 627+ 11lz+6. 
The coefficients are all integers. So, the IRT says this: 


If a€ Z solves p(x) =0, then a divides the constant term 6—i.e., a must be 1, -1, 2, -2, 
3, —3, 6, or -6. 


So, let’s check if the value of p at any of these values is zero: 

¢ Clearly, if x >0, then p(x) >0. So, p(1), p(2), p(3), and p(6) are not zero. 
«p-l)= (Gl) 46C1) £11 C-1)+6=-146—1146=0 

© p(-2) = (-2)* +6 (-2)* +11 (-2) +6 =-8 +24-22+6=0 

(28) ee 26-8) 11-3) 6s 274 94 3 4G a0 


« p(-6) = (-6)* +6 (-6)* +11 (-6) +6 = -216 + 216 - 66 + 6 = -60 


Altogether, the IRT says that -1, -2, and -3 are the only integer solutions to p(x) = 0. 
And so, by the Factor Theorem (FT), 2 +1, 7 +2, and x +3 are factors of p(z). 


Of course, since p(x) is a 3rd-degree polynomial, these are also the only factors and 
eae one Vile Ore er 2) ie 
(Indeed, once we found three values of a for which p(a) = 0—as we did above with -1, 


—2, and -3—we needn’t have bothered checking if p(-—6) = 0 because we should already 
have known that p(-6) #0.) 
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Example 542. Factorise p(x) = 42° + 4x? - 52 - 3. 
The coefficients are all integers. So, the IRT says this: 


If aé Z solves p(x) =0, then a divides the constant term -3—i.e., a must be 1, -1, 3, or 
—3. 


So, let’s check if the value of p at any of these values is zero: 

e p(1) =4+4-5-3=0 

e p(-1) =-4+4+5-3=2 

¢ p(3) =108+36-15-3#0 

¢ p(-3) =-108+36+15-3+#0 

Altogether, the IRT says that 1 is the only integer solution to p(x) = 0. And so, by the 


Factor Theorem (FT), x-1 is the only factor of p(x) that is of the form x +a, where a 
is an integer. 


Note though that the IRT does not rule out the possibility of other non-integer solutions 
to p(x) = 0. So, there is the possibility p(x) has other factors of the form x +a, but 
where a is not an integer. This turns out to be the case: 


Write p(a) =42° +42" 52-3 = (2-1) (427 + br +c). 


Comparing coefficients on x? and the constant term, we have -4+b=4 and -c=-3. So, 
b=8 and c= 3. Hence, 


p(x) = 42° + 42” - 52-3 = (x - 1) (42? + 8x +3) 
To factorise 47? + 82 + 3, I’ll simply use the quadratic formula: 


_ -8+/8?-4(4)(3) 18 
_ 2(4) eo 








1 a 
Thus, 477+ 87+3=k (z + 5) (z + 5) where clearly k = 4. Altogether, 


2 
p(o) = (w= 1) (40? +80 +3) = 4(e-1) («+ 5) («+5). 


So, p(x) does have other factors of the form x +a, but where a is not an integer and so 
could not be detected by the IRT. 





418, Contents www.EconsPhDTutor.com 


The Rational Root Theorem (RRT) is a more general version of the Integer Root 
Theorem. Let’s first do a very quick primary-school review of what it means for a fraction 
to be fully reduced: 


128 
Example 543. The fraction ai is not fully reduced because we can further reduce it 
128 64 32 16 
Mo 12 6 3 





(i.e. cancel out common factors in the numerator and denominator): 


1 64 ae 
The fraction = is fully reduced. The fractions 1p and i are not. 





More precisely and formally, 


Definition 112. Let N,D«Z. Suppose there is no integer f greater than 1 such that 
N/f and D/f are both integers. Then we say that the fraction N/D is fully reduced (or 
fully simplified); we also say that N and D share no common factors greater than one. 





Now, the RRT says that given a polynomial p(x) with integer coefficients, every (fully 
reduced) rational solution to p(x) = 0 is such that its 


e numerator divides the constant term; and 
e denominator divides the leading coefficient. 


More precisely and formally, 


Theorem 11. (Rational Root Theorem) Let p(x) = ppx" + ppv” | +--+ + pix + po 


with po,P1,---,Pn€Z and also N, De Z. Suppose N/D is fully reduced. 
Po Pn 


N’ D 





N 
itr(5)=0. then —, — € Z. 














Proof. See p. 1606. 


(By the way, given the RRT, we can now easily prove the IRT: Simply plug in D = 1.) 


Again, the IRT may seem a bit abstract. But its significance is this—given a polynomial 
with integer coefficients, we can now find every factor of the form x + a (where a € Q): 
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Example 544. Let p(x) = 24x? + 2627+ 9x +1. 
The coefficients are all integers. So, the RRT says this: 


N 
If D (fully reduced) solves p(a) = 0, then N divides the constant term 1 and D divides 


the leading coefficient 24—i.e. N ee be 1 or -1, while D must be 1, 2, 3, 4, 6, 12, 24 
Le SIA SA lessee 41 al 


h ive of th l SEAS) ee Geeta ee 
(or the negative of these values). So, — ~ must bez “7 5? 5? 3-3 2? BB? 1 


1 1 
-—, —, or -—. 
12° 24 24 
So, let’s check if the value of p at any of these values is zero: 


¢ Clearly, if x > 0, then p(x) > 0. So, none of p(1), p(1/2), p(1/3), p(1/4), ps0), 
p(1/12), and p (1/24) equals 0. 


° p(-1) = -24+26-94+1#0 


1 13 9 
e i — Pee l= 
r| 5) 5 a 0 


8 26 
Beenie ee, 
Tl oa 


Lp eee er 
1616 4 


1 1 1 
At this point, we’ve already found three factors—(a + 5) (z + 5) and (z + ;) So, we 


1 
needn’t bother checking p (=), D ( 3): and r| 
can go ahead and conclude 


p(v)=k(e+5)(e+5)(0+3), 


=) (we know they’ll all be non-zero). We 


where clearly k = 24. 
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Example 545. Let p(x) = 22° + 4x? - x - 2. 
The coefficients are all integers. So, the RRT says this: 


N 
If D (fully reduced) solves p(x) = 0, then N divides the constant term —2 and D divides 
the leading coefficient 2—i.e. N must be 1, 2, -1, or -2; and D must be 1, 2, -1, or -2. 
g N a DOP AS ali? 
—m en See 
A eee gl 
course just repetitions of the first two.) 


Oe = —1. (The last two possibilities are of 


So, let’s check if the value of p at any of these values is zero: 
« p(1)=2x1?+4xV-1-2=3 

© Ole? (1) S421) 221 

© p(2)=2x23+4x2?-2-2=28 

ap (2) =D x (2) 44-2) (2) = 2 =0 


“o()-()()-G-e 


3 2 
Cle) ae ; 
2 2 2 D 4 
Altogether, the RRT says that —2 is the only rational solution to p(x) =0. So, x +2 is 


the only factor of p(x) that is of the form x+a with ae Q. 


Note though that the RRT does not rule out the possibility of other irrational solutions 
to p(x) = 0. So, there is the possibility p(a) has other factors of the form x +a with 
a¢éQ. This turns out to be the case: 


Write P(e) = 2n° 4a? =a -2= (7 42) (227 + br +c). 


Comparing coefficients on x” and the constant term, we have 4+b = 4 and 2c = -2. So, 
b=0 and c=-1. Hence, 


p(x) = 2x°+4a?-x-2 = (x +2) (2a? - 1) = 2(a+ 2) (2-5) =2(x+2) («+ =} (“- =} 


So, besides «+2, p(x) does have two other factors of the form «+a, but where a ¢ Q and 
so could not be uncovered by the RRT. 





Exercise 161. Factorise each polynomial. (Answer on p. 421.) 
XXX 
A161. 
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38.8. The Intermediate Value Theorem (IVT) 


The Intermediate Value Theorem (IVT) is another useful tool for factorising polyno- 
mials. We first describe it informally with an example: 


Example 546. Below is part of the graph of some continuous function f:R— R. 


We are missing the graph of f for the interval (1,3). 


Say we know that f (1) = -2 and f (3) = 2. What then can we say about the missing 
portion of the graph? 


IN 
Y 


(1,=2) 


We have most of the 
graph of f, but are 
missing the interval (1,3). 





Since f is continuous, it must be that we can draw its entire graph without lifting our 
pencil. In particular, we can connect the dots (1,-2) and (3,2) without lifting our 
pencil. But obviously, the only way to do so is to have our pencil “go through” every 
value between —2 and 2. That is (and this is what the IVT says), 


f must attain (or “hit”) every value between —2 and 2. 





And yup, that’s all the IVT says—if f is continuous on the interval [a,b], then f must 
attain (or “hit”) every value between f(a) and f(b) in the interval (a,b). A bit more 
formally, 


Theorem 12. (Intermediate Value Theorem) /f f is continuous on the interval 





[a,b], then for every y€ (f (a), f (b)), there exists x € (a,b) such that y = f (x). 
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Proof. Omitted.?°+ 











Let’s now see how the IVT can help us factorise polynomials. 


Example 547. To factorise p(x) = 2x7 + 9x —5, it’s probably quicker to use either the 
Compare Coefficients (plus Guess and Check) method or the quadratic formula. 


But here, just to illustrate how the IVT can be used, we’ll start instead by using the 
Factor Theorem (FT). We try plugging in 1: 


p(1) =2-17+9-1-5=6+#0. 


Aiyah, sian, doesn’t work: Since p(1) #0, by the FT, 2-1 is not a factor for p(z). 


Here we could immediately try another guess with the FT. But instead, here we can first 
make use of the IVT: 


Observe that p(0) =—5. What good is this observation? 


Well, since 0 is between p(0) = —5 and p(1) = 6, the IVT says there must be some 
aé (0,1) such that p(a) =0. Cool! 


So, guided by this knowledge, let’s try the FT again by guessing 1/2: 


2 
) +9-5-5=0 
2 


Yay, works! Since p(1/2) =0, by the FT, x -1/2 or 2x-1 is a factor for p(z). 
Write Qn? + 92 -5 = (2x -1) (ax+b). 
Comparing coefficients, a = 1 and b= 5. Hence, 


p(a) = 247 +92-5 = (24-1) (x+5). 





Example 548. XXX 


Example 549. XXX 


Exercise 162. XXX (Answer on p. 423.) 
A162. 





264See e.g. O’Connor (2001). 
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38.9. Factorising a Quartic Polynomial 
For the A-Levels, you’ll often have to factorise quadratic polynomials and, sometimes, cubic 


polynomials. 


It’s unusual that they ask you to factorise polynomials of a higher degree. And if they do, 
the nice MOE folks will usually give you a little help.?°° 


In the next example, we factorise a quartic (degree-4) polynomial using what we’ve learnt. 
It’s long and tedious, but conceptually, it’s not any harder than what we’ve already done: 


Example 550. Factorise p(x) = 6x* + 13° — 29a? — 52x + 20. 
First try plugging in 1: 


pl) 6-17 418-19 = 29-17 = 52-1420 <0. 


(Again, we don’t need to compute the exact value of p(1) to see that p(1) <0.) 


Aiyah, sian, doesn’t work: Since p(1) #0, by the Factor Theorem (FT), 2-1 is not a 
factor for p(2). 


But now, observe that 0 is between p(0) = 20 and p(1) <0. And so, the IVT says there 
must be some a € (0,1) such that p(a) = 0. So, let’s next try 1/2: 


p(4)=6-(4) +13-(4) -20.(4) -s2-(2)+20<0 


(Again, we don’t need to compute the exact value of p(1/2) to see that p(1/2) < 0.) 


Aiyah, sian, still doesn’t work: Since p(1/2) #0, by the FT, 7-1/2 is not a factor for 
p(a). 

But again, observe that 0 is between p(0) = 20 and p(1/2) < 0. And so, the IVT says 
that there must be some b € (0, 1/2) such that p(b) = 0. So, let’s next try 1/3: 


il i iy lag 1 
2) 26 3| =) 9484] =| 2205) | = a> | =| 20 
(5) (5) i (5) (5) ik 
G@ ie oon eae 213 29 8 15 5_ 


zee ae ea 
81 27 Te 97" 97,9 °3° 27 «9 


Yay, works! Since p(1/3) = 0, by the FT, x - 1/3 or 3a —1 is a factor for 6x* + 132° — 
29a* — 52a + 20. 


(Example continues on the next page ...) 








26>See e.g. 9740 N2010/II/1(b) (Exercise 651). 
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(... Example continued from the previous page.) 


Dividing p(x) by 32-1 should yield some cubic polynomial. So, write 


P@yaC2 + 13n°= 207° 527420 = (r= 1)2r sar +brs e). 


Comparing coefficients on the constant, x°, and x” terms, we have —c = 20, 3a - 2 = 13, 
and 3c-—b=-52. So, c= -20, a=5, and b=-8. 


To factorise 27? + 5x? — 8x — 20, we try plugging in 2: 
p(2) =2-23+5-2?-8-2-20=16+20-16-20=0 


Yay, works! Since p(2) = 0, by the FT, x —2 is a factor for 27? + 5x? — 8x — 20. 
Dividing 22° + 5x? — 82 - 20 by x — 2 should yield some quadratic polynomial. So, write 


2x° + 5a? — 8x - 20 = (x - 2) (227 + dr +e). 


Comparing coefficients on the constant and x? terms, we have —2e = —20 and d-4 = 5. 
po, €= 0 and d=9. 


To factorise 2x7+92+10, we use the Compare Coefficients (plus Guess and Check) method 
and try 


(2x +5) (x +2) = 22? + 9x +10. J 
Wah! So “lucky”! Success on the very first try! And now, at long last, we’re done: 


Oi ta Or S167 = 907 = 522 20 = ep a ta) 
Exercise 163. Let p(x) = ax’ + ba? — 31x? + 3x +3, where a and b are constants. You are 
told that (i) p(a) divided by x - 1 leaves a remainder of 5; and (ii) p(0.5) =0. 
(a) Find a and b. 


You are now also told that (iii) p(-1/3) < 0. 
(b) Factorise p(x). (Answer on p. 1806.) 
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38.10. Two Warnings 


All of our examples have so far involved a n-degree polynomial that can be fully factorised 
into n 1s-degree (or linear) factors. But this need not always be the case: 


Example 551. Consider the quartic polynomial «4 - 1. We can write 
vt -1=(2?+1)(e+1)(e-1). 
Unfortunately, x? + 1 has negative discriminant and so cannot be further factorised.?© 


Example 552. Consider the quartic polynomial «* + 52?+4. We can write 


Bor toate (ge ted), 


Both x? +1 and x? +4 have negative discriminants and so cannot be further factorised.?°’ 


Example 553. The quartic polynomial «* + x + 1 cannot be factorised at all.?° 


The above three examples illustrate these two important warnings: 





1. Not every n-degree polynomial can be fully factorised into n linear factors. 


2. Indeed, an n-degree polynomial may not even have a single linear factor! 


Exercise 164. XXX (Answer on p. 426.) 
A164. 





66Unless we use complex numbers, which we’ll learn about in Part III. With complex numbers, we can 
write 27+ 1=(x+i)(2-i) and thus 2*-1=(x+i)(x-i)(r+1)(x-1). 

267 Avain, with complex numbers, we can write x? +1 = (x+i)(2-i), v7+4 = (w+ 2i)(#-2i), and thus 
xi + 5x? +2=(x+i)(x-i) (a + 2i) (wv -2i). 

268 Again, with complex numbers, it’s actually possible to factorise x* +x +1 into four linear factors. 
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39. Solving Systems of Equations 


Let’s start with two warm-up questions (PSLE and O-Level style): 


Exercise 165. Today is Apu, Beng, and Caleb’s birthday. When Apu turned 40 some 
years ago, Beng was twice as old as Caleb. Today, Apu is twice as old as Beng, while 
Caleb turns 28. What ages do Apu and Beng turn today?” (Answer on p. 1807.) 


Exercise 166. Planes A and B leave the same point at the same time. Plane A travels 
(precisely) northeast at a constant speed of 100kmh7!. Plane B travels (precisely) south 
at a constant speed of 200kmh'. After (precisely) 3 hours, both planes instantly turn to 
start flying directly towards each other (at the same speeds as before). How many hours 
after making this instant turn will they collide? (Answer on p. 1807.) 


at 


v4 A travels northeast 
at 100kmh7! 


B travels south 
at 200kmh7! 





v 


+ 


In Ch. 13, we learnt the terms solution and solution set, in the context of a single 
equation involving only one variable. ‘These terms also carry the same meaning in a system 
of equations, which is any set of m equations in n variables: 








269 Just to be clear, we use the standard western convention where one’s age on any day is an integer and 
increases by one on each birthday. (We do not for example use the East Asian convention.) 
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Example 554. Consider this system of two equations in two variables: 


yartl (x,y €R). 


In secondary school, we already learnt to solve such systems of equations (it’s just simple 
algebra): 


Plug 2 into = to get -r+3=2+4+1 or 2= 22 or x=1. Now from either + or 2 we can also 
get y = 2. Conclude, 


« “The system of equations has one solution: (2, y) = (1,2).” 
¢ Or more simply, “The system of equations has one solution (1, 2).” 
e Or, “The system of equations has solution set {(1,2)}.” 











428, Contents www.EconsPhDTutor.com 


Example 555. Solve this system of two equations in two variables: 


(x,y €R). 


2s 1 ae ee 2 
Plug = into = to get @ = Peo 023° -—22—-3 = (@—3) (e+ 1). So,a=3 or x = -1, 
And correspondingly, y = 3 or y = -1. Conclude, 


« “The system of equations has two solutions: (x,y) = (3,3) ,(-1,-1).” 


¢ Or more simply, “The system of equations has two solutions: (3,3) and (-1,-1).” 
e Or, “The system of equations has solution set {(3,3) ,(—1,-1)}.” 


IN 
y 
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Example 556. Here’s a system of three equations in three variables: 


y = 3a -2, z27-y, ciytz (x,y,z €R). 


Plug + and 2 into 2 to get c= 32 -2+7-y=324+5-y or y 220 +5. 

Plug 4 into = to get 22 +5=37-2 or 7=2. 

Pluie a= 7 into to get y= 19. Then plug y = 19 into 2 to get z = -12. Conclude 

« “The system of equations has one solution: (x,y,z) = (7,19, -12).” 

¢ Or more simply, “The system of equations has one solution (7,19, -12).” 

e Or, “The system of equations has solution set {(7,19,-12)}.” 

By the way, (7,19,-12) is this textbook’s first example of an ordered triple. This is 
exactly analogous to an ordered pair, except that now there are three coordinates. As 


you can imagine, we also have ordered quadruples, ordered quintuples, and more 
generally ordered n-tuples.?” 


When we study vectors in Part III, we’ll learn that it’s actually possible to graph the 
above three equations. (Spoiler: Each of the three graphs will be a plane in 3-dimensional 
space. ) 





A system of equations can have no solutions: 





2"0For the formal definition of an n-tuple, see Definition 267 (Appendices). 
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Example 557. Consider this system of two equations in two variables: 


2 


y tina and y 


a (z,y €R). 
Observe that the graph of + lies below that of 2 everywhere. Hence, no ordered pair 
(x,y) satisfies the above system of equations. Conclude, 

e« “The system of equations has no solutions.” 


e Or, “The system of equations has solution set @.” 


IN 
Y 
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Example 558. Consider this system of two equations in two variables: 


L 2 


y=-1 (x,y eR). 


Observe that the graph of + lies above that of 2 everywhere. Hence, no ordered pair 
(x,y) satisfies the above system of equations. Conclude, 


e« “The system of equations has no solutions.” 


¢ Or, “The system of equations has solution set 2.77 











Example 559. Consider this system of four equations in three variables: 


l 2 


Zs, 223, ry +0 (x,y, 2 €R). at 


(ele y 


There is no (x,y,z) that satisfies the above four equations. So there are zero solutions 
to the above system of equations. Equivalently, 


« “No (2,y,z) is a solution.” 


e Or, “The system of equations has solution set @.” 





At the other extreme, a system of equations can have infinitely many solutions: 














2"IBut as we'll learn in Part IV, if we rewrite the system of equations so that “x,y ¢ R” is replaced by 
“x,y €C”, then we’d instead conclude, 





« “The system of equations has two solutions: (x,y) = (-i,-1), (i,-1).” 
¢ Or more simply, “The system of equations has two solutions: (—-i,-1) and (i,-1).” 


e Or, “The system of equations has solution set i, -1.” 
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Example 560. Consider this system of two equations in two variables: 


2Qy = Qa (z,y € R). 


Observe that + and = are really the same. And so, this system of equations has infinitely 
many solutions. Specifically, 


e “Any (x,y) for which y = x is a solution.” 

e Or, “The system of equations has solution set {(x,y): x = y}.” 
Alternatively and equivalently, 

¢ “For any c€ R, (c,c) is a solution.” 


e Or, “The system of equations has solution set {(c,c): ce R}.” 


IX 
Y 








433, Contents www.EconsPhDTutor.com 


Example 561. Consider this system of two equations in two variables: 


2 


y + cosx and y= ll 


This system of equations has infinitely many solutions. Specifically, 

e “Any (2, y) for which x = 2k7 (for any k € Z)and y = 0 is a solution.” 

e Or, “The system of equations has solution set {(x,y): x = 2k7,y =0,k € Z}.” 
Alternatively and equivalently, 

e “For any k € Z, (2k7,0) is a solution.” 

e Or, “The system of equations has solution set {(2k7,0):keZ}.” 








Example 562. Consider this system of two equations in three variables: 


(x,y,z €R). 


By 2 any solution must have z = 0. Plug 2 into + to also get x = 0), 
restrictions on what y can be. 


There are 


So, there are infinitely many solutions. Specifically, 

e “Any (x,y,z) for which x =0 and z =0 is a solution.” 

e Or, “The system of equations has solution set {(x, y, z): x = 0, z =0}.” 
Alternatively and equivalently, 

e “For any c€R, (0,c,0) is a solution.” 


e Or, “The system of equations has solution set {(0,c,0):ceR}.” 
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In the context of a system of equations, here are the formal definitions of a solution and 
the solution set:?” 


Definition 113. Given a system of equations (and/or inequalities) in n > 2 variables, we 


call any ordered n-tuple that satisfies the system a solution (or root). And the set of all 
such solutions is called the solution set of that system. 





Exercise 167. XXX (Answer on p. 435.) 
A167. 

Example 563. Let a,b,c,z,y ¢ R. Suppose the equation y = ax? + bx +c has solutions 
(0,1), (2,3), and (4,5). Then what are a, b, and c? 


Here things seem a little strange: We speak of the equation y = ax? + bx +c and its 
solutions, with x and y being the variables. 


Yet what we really want to do is to solve the following system of equations, where a, b, 
and c are the variables and we’ve plugged in the given values of x and y: 
12a-02+b-0+c=c, 
320-2 4b-24+e=4a42%b+e, 


534@-424b-44+c=1l6a+4b+c. 


2 


As usual, this is just simple algebra: = - i yields 4a + 2b =2 and 2-4 yields 16a + 4b 24. 
Now, 29x = yields 8a = 0 or a= 0. Now from =, we also have b= 1. Finally, from 2 we 


also have c= 1. Altogether, 


(a,0,c) =(0, 1.1). 
Below are two similar exercises. (See also N2011/I/2—Exercise 554.) 


Exercise 168. Let a,b,c,z,y€R. (Answer on p. 1808.) 


(a) If y = ax? +bx+c has solutions (x,y) = (1,2), (3,5), (6,9), then what are a, b, and c? 


(b) If y = az? + bx +c has the solution (x,y) = (-1,2) and the strict global minimum 
point (0,0), then what are a, b, and c? 








2"2Note that these are exactly analogous to our earlier Definition 59. 
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39.1. Solving Systems of Equations with the TI84 
You need to use your graphing calculator to solve systems of equations: 
Example 564. Consider this system of equations: 


4 3 5 


y=u°-2°- and Y= ne, 


We'll solve this system using two methods. 


Method 1. Graph both equations, then find the intersection points: 


1. Enter the two equations (precise instructions omitted). 
2. Press to graph the two equations. 
3. Press and to bring up the CALC (CALCULATE) menu. 
4. Now press [5] to select the “intersect” function. 
Your TI84 now asks, “First curve?” So, 
. Press to tell the TI84 that “y, = «4 - «?-5” is indeed our first curve. 


The TI84 now asks you, “Second curve?” This time, 


. Use the left and right arrow keys |(j and || to move the cursor to roughly where you 
think there is an intersection point. For me, I’ve moved it to (x,y) » (1.702, 0.532). 


7. Now hit (NWR. The TI84 will now ask you “Guess?”. 
. So hit WINN) again. After working furiously for a moment, the T184 will inform you 


that the intersection point is (x, y) » (1.87, 0.624). 


It looks though like there might be another intersection point near x = 0. And so, 
repeating the above steps but now moving the cursor left (instructions and screenshots 
omitted), you should find another intersection point at (x,y) * (0.00674, -5). 


Thus, the solutions are (1.87... ,0.623...) and (0.00674...,-5). 


Step 1. Step 2. Step 3. Step 4. 


Step 5. Step 6. Step 7. Step 8. 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Method 2. Combine the two equations into the single equation y = x* - 2° -5-Inz. 
Graph it, then find the x-intercepts. Brief instructions: 


1. Graph y = 2*- 2? —5-Inc. 


It looks like there’s an x-intercept near 2. So, 
. Use the “zero” function to find that there’s indeed an x-intercept at x » 1.87. 


It’s not obvious, but it looks like there might be another x-intercept near the origin. 
So, 


3. Use the “zero” function to find that there’s indeed another x-intercept at x » 0.00674. 


With Method 2, we must plug in these two values of x into either of the original two 
equations to get the corresponding values of y. Then as before, we'll conclude that the 
solutions are (1.87...,0.623...) and (0.00674...,-5). 


Step 1. Step 2. Step 3. 
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40. Partial Fractions Decomposition 


This is yet another topic you’re supposed to have mastered in secondary school. 


As per p. 14 of your A-Level syllabus, you need only know how to decompose “partial 
fractions with cases where the denominator is no more complicated than 


° (ax +b) (cx+d) 
© (ax +b) (cx +d) 
© (ax +b) (2? +0*)” 


The lovely folks at MOE have kindly included the following on your List MF26 (p. 2): 


Partial fractions decomposition 


Non-repeated linear factors: 


px+q A B 


(ax+b)(cx+d) (ax+b) . (cx+d) 





Repeated linear factors: 


px? +qxtr A B Cc 


(ax + b)(cx +d)? 7 (ax +b) a (cx +d) . (cx+d)* 





Non-repeated quadratic factor: 


px? +qx+r A Bx+C 


(axt+b)(x2 402) (ax+b) (x? +c?) 








Let’s go through these three “categories” of partial fractions decomposition in turn: 
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40.1. Non-Repeated Linear Factors 


Example 565. Rewrite or decompose the expression into partial frac- 


; a? +3042 
tions. 


Observe that 27+ 32 +2 = (x+1) (x +2). So, write 
ihe 1 AB 
qe So ae) ere 


Be Ce) Ce re 
Se ee Gee 








From the numerator, 1 = (A+ B)x+2A+B. Comparing coefficients, A+B = 0 and 
9A+B?1. Taking Z_i yields A =1 and then B =-1. Hence, 


eae 1 
e2+3et+2 etl +2) 





, observe x7 + 34 +2 =(x+1)(x+2) and write 


5 
Example 566. ‘To decompose eee 
xu? + 3x42 


50 A ‘i BO Ate t 2 eel) Areata 
Corea 2 erie oy Ge tia) 


Comparing coefficients, A+ B=5 and 2A+ B=-3. So, A=-8 and B=13. Hence, 


bo-3 8 18 
e+3r+2 xvt+1l +2) 


9x -5 


er observe —z? + 5a -6 = -(x-3)(x-2) and 


Example 567. To decompose 


write 


oo Oy A P BAG 2) ia 3) (a= )\a ase 
0 Se ey ee) 


Comparing coefficients, A- B =9 and -2A+3B=-5. So, A= 22 and B=13. Hence, 
9x -5 22 13 22 13 


= 5p 56 9 Gs) F203 


Exercise 170. Decompose each expression into partial fractions. (Answer on p. 1810.) 
8 Gps 
b) ———.. 
(@) (>) 32-853 





2 +2-6' 
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40.2. Repeated Linear Factors 


Q 1 
Example 568. To decompose eas 5, write 
(x+1)(2-1) 





jee cage al i B F C DAG =) 8G Nao CoG | 
(e+1)(e-1)? #+1 2-1 (¢-1)9 (x+1)(x-1Y 
ee) ee ee 
; (2+1)(a-1)° i: (+1) (a-1) } 








Comparing coefficients, A+ B=1, -24+C=1, A- B+C =1. Summing these three 
equations, we get 2C = 3 or C = 3/2 and then also A=1/4 and B = 3/4. 


fe ae | 1 3 3 


So, 


Gei@-iy 26) 4@=1) 9@-17 


o6 
Example 569. ‘To decompose ———__—__,, write 
(4a - 1) (4 +2) 


oa alee : eB C 
(4-1) (42) 4m—1 0 we 2 (p49) 


: A(r+2)? + B(4a-1)(x+2)+C (42-1) 

7 (4c - 1) (x + 2)° 

_ A(x? +4a+4)+ B (4a? + 7x —-2)+C (4x -1) 

7 (4c - 1) (x + 2)° 

(A+4B)2?+(4A+7B+4C)r+4A-2B-C 
(4x - 1) (x +2)" 





Comparing coefficients, A + 4B 2 3, 4A+7B+4C 2 -1, and 4A-2B-C 21. 


Rearrange = to get A23-4B. Next, 2 minus 2 yields 9B +5C = -2 or C2 - (2+ 9B) /5. 
Plug = and 2 into 2 to get 4(3-4B) -2B+(2+9B) /5=1 or B = 19/27 and then also 
A=5/27 and C =-5/3. 


327 -x2+1 ) 19 5 


oe (4n—-1)(m+2)? 274e-1) 27(e+2) 3(a+2)" 


=a 7 


a eee into partial fractions. (Answer on p. 1810.) 
NG 


Exercise 171. Decompose 
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40.3. Non-Repeated Quadratic Factors 


Das | 
Example 570. To decompose (Gx i ial +)’ write 





Mee tesa A Tak , Ber _ Ae? +1) + (Be +C) (54-1) 
Ce eee (G72 1) Ges) 

MA ee OC fae MA ee Fe 8 aA} 

2 (5a —1) (x? +1) , (5a —1) (a? +1) 





Comparing coefficients, A+ 5B 2 2,5C -B 2 1, and A-C 21. 


Rearrange 2 to get A42-5B. Rearrange 2 to get have C 2 (B+1)/5. Plug # and 2 
into 2 to get 2-5B-(B+1)/5=1 or B=2/13 and then also A = 16/13 and C = 3/13. 


2 16 . Diges y 
G1) Fil) Bis) 13 G2 i) 


So, at 





Ag? — 37 +2 
Example 571. To decompose om 7) or 9)’ write 





Ay? 3E2 A , BurC _ A(x? +9)+ (Br + C) 2x +7) 
Cr a7 2 = * (22 +7) (x2 + 9) 
Ae OA Be Be ei NA ee) es ie 2C a Oe ne. 
7 (2x + 7) (22 +9) . (2x + 7) (x? +9) 








Comparing coefficients, A + 2B < 4,7B+2C 2 -3, and 9A+7C 39. 


Rearrange + to get A 44-2B. Rearrange = to get C 2 —(7B+3)/2. Plug = and 2 
into 2 to get 9(4-2B) + 7(-7B +3) /2 = 2 or B = 47/85 and then also A = 246/85 and 
C = -292/85. 


g Ag? —-37+2 246 F A7x — 292 

O A Fa SI pe, 

(22+7)(a2+9) 85(22+7) 85(22+9) 
—3x7 +5 


Exercise 172. Decompose a eae 


into partial fractions. (Answer on p. 1810.) 
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41. Solving Inequalities 


41.1. Solutions and Solution Sets 


Recall (Ch. 13) that to solve an inequality is to find all its solutions (or roots). 
Equivalently, to solve an inequality is to find its solution set. An inequality usually has 
infinitely many solutions: 


Example 572. Solve x- 1 $0 (a eR). 


1 
Three perfectly good (and equivalent) solutions to >: 
6 Cm > ile 

a Cae (lice). 


1 
¢ “The solution set of > is (1, 00).” 


1 
Example 573. Solve 1+5>8 (xe R). 


1 
Three perfectly good (and equivalent) solutions to >: 
e “ae > 3” 


a are Even 


1 
¢ “The solution set of > is [3,00).” 
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41.2. Sign Diagrams 


Example 574. Solve (« - 1) (4-2) $0, 
Since LHS is the product of two terms, we can use Fact 14 to make these five observations: 


If x<1, thenx-1<0 and x-2<0, so that (x-1)(%-2)>0. 
If g=1, then g—1=0, so that (4 - 1) (4-2) =0. 
Ifl<a2<2, then r-1>0 and x-2<0, so that (x-1) (4-2) <0. 
If g=2, - then x -2=0, so that (x -1)(#-2) =0. 
If x>2, then x-1>0 and r-2>0, so that (x-1)(%-2)>0. 


The above observations can be concisely summarised in a single sign diagram: 





1 
So, here are three perfectly good (and equivalent) solutions to >: 
=o LOR a >a,” 

e “x € (-00,1) U (2, 0).” 


¢ “The solution set is (—00, 1) U (2, 00).” 





443, Contents www.EconsPhDTutor.com 


Example 575. Solve a = 50 


-3 il 
Again, LHS is the product of two terms: = (x - 3) ——. So, 
a — 


1 
1. If x<3, thenz-3<0,x2-4<0, and q <9) 80 that 
Ye 


Go 
Vv 
o 


8 


If g=3, then g—3=0, so that 


| | 
eo 


A 
= 


If3<a<4, then x-3>0,27-4<0, and — <0, so that 
5G = 


1 = 
4. If x>4, thenx-3>0,2-4>0, and 7 > 0, so that — SS, 


= x4 
(Note: If x = 4, then > has an undefined LHS and so can’t possibly hold.) 


Again, we can summarise the above observations in a single sign diagram: 





1 
So, here are three perfectly good (and equivalent) solutions to >: 
«953 OR o> 47 

e “x € (-00,3) U (4, 00).” 


» “The solution set of > is (—00, 3) U (4, c0).” 





Now that we understand how sign diagrams work, we’ll skip the step of explicitly listing 
the observations and instead jump straight to drawing the sign diagram: 


Example 576. Solve (1 +1)(x-5)>0. Sign diagram: 








Solution: x € [-2,1]. 
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Example 578. Solve (37 +5) (#-8) >0. Sign diagram: 


aa = 


5/3 





Solution: x € (—o0, -5/3] U[8, ov). 


32+5 
Example 579. Solve = > 0. Sign diagram: 
% 


a | 


-5/3 





Solution: x € (—00, —5/3] U (8, 00). (How and why does the solution here differ from the 
last example?)? 


Exercise 173. Solve each inequality. (Answer on p. 1812.) 


24+3 —4r +2 
: 3, 
(a) Sey ©») c+1 a 





Above we looked at inequalities involving the product of two terms. Let’s now look at 
inequalities involving the product of three or more terms: 


Example 580. Solve (x - 1) (~- 2) (a - 3) > 0. 

We use Fact 14 to make these seven observations: 
If x<l1, thenx-1<0,x2-2<0, and x-3<0, so that (7-1) (w@- 2) (4-3) <0. 
i ig= 1, then g—1=0, so that (1-1) (a - 2) (2-3) =0. 
Ifl<a2<2,then x-1>0 x-2<0, and x-3<0, so that (7-1) (w- 2) (4-3) > 0. 
If g= 2,0 then t—2=(, so that (« - 1) (@- 2) (2-3) =0. 
If2<a2<3, then x-1>0 x-2>0, and x-3<0, so that (4-1) (a - 2) (4-3) <0. 
if g=3, then x-3=0, so that (x-1)(x%-2)(x-3) =0. 
If x>3, thenx-1>0 x-2>0, and x-3>0, so that (1-1) (4-2) (4-3) >0. 


The above observations can be summarised in a single sign diagram: 








Solution: x € (1,2) U (3, 00). 


Again, in the next few examples, we’ll omit explicitly listing out the observations and 
instead jump straight to drawing the sign diagram. You should nonetheless figure out for 
yourself how the sign diagram is constructed: 





2"3Tn Example 579, 8 is excluded as a solution to the inequality. This is because if x = 8, then LHS of the 
inequality is undefined so that the inequality cannot hold. So, 8 cannot be a solution to the inequality. 
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Example 581. Solve (a +4) (~-5) (#-6) >0. Sign diagram: 














More generally, consider the expression (x — a;) (a - az)...(&@ — Gn), where ay < dg < +++ < dn. 


If n is even, then the sign diagram is this: 














—1)(4%@-2 
Example 584. Solve a > 0. Sign diagram: 
4b _— 





Solution: x € (1,2) U (3, 00). 
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(2-4) (w-5) 
(x - 6) 


Example 585. Solve >0. Sign diagram: 





<0. Sign diagram: 


Example 586. Solve tee) 
DV —_ 


(x -9) 





(oe Wir 2) 
(x - 3) (4-4) 


Example 587. Solve > 0. Sign diagram: 





(x1) («-3) 


(x —2) (a -4) (z@-5) < 0. Sign diagram: 


Example 588. Solve 





Solution: x € (—o0, 1] U (2,3] U (4,5). (Note that 2, 4, and 5 are excluded.) 





(x — by) (a -— bp)... (a - bm) 


ne ins 
(fbi) Ody) =n) aie 


More generally, consider the expression 


b,, are distinct. 








Rearrange by, bo, ..., bp, in ascending order as ay < dy <-+** < Gy.24 
If n is even, then the sign diagram is this: 
+ = + = + = + = 
| | | ite fe | | | 
I | I I | | 
ay a2 a3 An-2  OAn-1 An 


If n is odd, then the sign diagram is this: 





48 That is, {bi bas< 214 ba} = 404,09, 250m) 
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41.3. Quadratic Inequalities 


Let a#0 and b,ceR. Recall that we call 


¢ The expression ax? + bx + c a quadratic polynomial: 


« Any equation that can be rewritten as ax” + bx +c =0 a quadratic equation. 


We shall now also call any inequality that can be rewritten as 


1 2 
ax +batc>0 or ax +ba +c>0 


a quadratic inequality. 
How do we solve quadratic inequalities? 


It turns out that we already learnt to do so in the previous subchapter, for those cases where 
b? — 4dac > 0 (positive discriminant). In such cases, the corresponding quadratic equation 
has two distinct roots and we may write ax? + ba +c as the product of two distinct linear 
terms: 


Example 589. Solve x? - 3%+2> 0. 


Observe that x? — 3x2 +2 = (x-1)(x-2). So, as usual, by considering the five possible 
cases (1 <1, 2=1,1<a”<2,2=2, and x > 2), we construct this sign diagram: 


Figure to be 
inserted here. 


Solution: x € (—00, 1) U (2, 00). 


There’s actually another way to “see” where the above sign diagram comes from. Observe 
that in x? - 3x + 2, the coefficient on 2? is positive. So, the graph of y = 2? — 3x2 +2 is 
U-shaped. Hence, the given inequality holds on the left of the smaller root and on the 
right of the larger root. 
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Example 590. Solve x? -27+3<0. 


Since x? - 24 -3 = (x + 1) (x —3), we have this sign diagram: 


Figure to be 
inserted here. 


Solution: x € [-1,3]. 


Example 591. Solve -2? - 77+ 4>0. 


Since —x? — 77+ 4=-(x+4) (22-1), we have this sign diagram: 


Figure to be 
inserted here. 


Solution: x € [-4, 1/2]. 


Example 592. Solve 5x7 + 27 — 3 > 0. 


Since 527 + 27 —3 = (x +1) (52-3), we have this sign diagram: 


Figure to be 
inserted here. 


Solution: «x € (—0c0, -1] U[3/500). 





If b? — 4dac < 0, then the graph of y = ax? + br +c does not intersect the x-axis at all: 


450, Contents www.EconsPhDTutor.com 


Example 593. Solve x7 +2+1>0. 


Figure to be 
inserted here. 


The discriminant is negative (17-4-1-1=-3<0). So, the graph of y = ax? + br + c does 
not intersect the x-axis at all. 


The coefficient on x” is positive. So, the graph is U-shaped and is entirely above the 
z-axis. Equivalently, 2? + 2+1> 0 for all z. 


Solution: x¢€R. 


Example 594. Solve 32? - 27+1< 0. 


Figure to be 
inserted here. 


The discriminant is negative ((-2)* -4-3-1=-8 <0). So, the graph of y = az? + bx +c 
does not intersect the x-axis at all. 


The coefficient on x” is positive. So, the graph is U-shaped and is entirely above the 
z-axis. Equivalently, x? + 2+1> 0 for all x. 


Solution: No real number z satisfies the given inequality.?” 





If b? —4ac = 0, then y = ax? + bx +¢ intersects the x-axis exactly once (at its turning point): 


Example 595. Solve x? -2x+1> 0. 


The discriminant is zero ((-2)? -4-1-1=0). So, the graph of y = ax? + bx + c intersects 
the x-axis exactly once. 


Indeed, we have x? - 2x +1 = (x - ee So, the graph intersects the z-axis at x = 1. 


The coefficient on x” is positive. So, the graph is U-shaped and is entirely above the 
x-axis except at x = 1 (where it touches the z-axis). 


Solution: «¢ Rv {1}. 
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Example 596. Solve 47? + 47 + 1> 0. 


The discriminant is zero (2?-4-1-1=0). So, the graph of y = ax” + bx + c intersects the 
x-axis exactly once. 


Indeed, we have 4a? + 4x + 1 = (22 - We So, the graph intersects the x-axis at x = 1/2. 


The coefficient on x” is positive. So, the graph is U-shaped and is entirely above the 
x-axis except at x = 1/2 (where it touches the x-axis). 


Solution: x¢€ R. 


Example 597. Solve -x? + 6x -9>0. 


The discriminant is zero (6”—4-(—1)-(—9) = 0). So, the graph of y = ax? +bx +c intersects 
the x-axis exactly once. 


Indeed, we have —x? + 62 -9 = - (x - Be So, the graph intersects the x-axis at x = 3. 


The coefficient on x? is negative. So, the graph is n-shaped and is entirely below the 
x-axis except at x = 3 (where it touches the z-axis). 


Solution: x = 3.2 (This is this textbook’s first example of an inequality that has exactly 
one real solution.) 





For future reference, we provide the following result, which summarises our above findings 
and gives the general solution for the quadratic inequalities. (Do not try to mug this result! 
Instead, seek to understand how we solved the quadratic inequalities in the above examples. 
And of course, do the exercises below.) 


Fact 103. Let a,b,c,x¢R with a#0. Consider the inequalities 


1 2 
ax? +br+c>0 and ax? +bxat+c>0. 


—b-— Vb? -—4ac a _ =b++/ 6? —dac 
— = —— 


a 


Let ry = dT 


(a) Suppose a> 0. 
2 
(i) Ifb?-4ac > 0, then the solution set for> is R\[11, 12] and that for > is R\(141, 12). 
2 
(ii) If b? - 4ac=0, then the solution set for > is R\ {—b/2a} and that for > is R. 
2 
(iii) If b? - 4ac < 0, then the solution set for both > and > is R. 
(b) Suppose a <0. 
2 
(i) If b? -4ac > 0, then the solution set for > is (71,72) and that for > is [r1,r2]. 


(ii) If b? -4ac =0, then the solution set for > is @ and that for > is {-b/2a}. 


2 
(iii) [f b? - 4ac < 0, then the solution set for both > and > is @. 


Exercise 175. Solve each inequality. (Answer on p. 1813.) 
(a) -327?+2-5>0. (b) «2?-2r-1>0. 
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41.4. The Cardinal Sin of Dividing by Zero, Revisited 


1 
Example 598. Solve ze” > xe (x € R). 


1 
Beng reasons, “Divide > by x to get e” > e. Which holds for all x > 1. So, solution: 
oe loo)! x 


1 
Beng’s mistake is to divide > by x and assume the inequality will be preserved. 
1 
Correct solution: xe” > xe —> xe” — xe > 0 — xe (er! — 1) > 0 
=— z(e™!-1)>0. 


In the last step, we multiplied both sides by the positive constant 1/e—by Fact 15, the 
inequality is preserved. 


Observe that 





ee t_1=0 —> lal — gel: 
ee t_1>0 —> 11 — r>ml; 


x 


eo tip Q ae tel eS ri. 


So, sign diagram for x (e"? - ie 





Solution: x € (-00,0] U[1, 0c). 
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41.5. Inequalities Involving the Absolute Value Function 


Fact 104. Suppose x€R and b>0. Then 
(a) |z|<b —b<a<b. 


(b) |a| <b -b<a<ob. 
(c) |a|>b x<—-bOR «>b. 
(d) |z|>b z<-bOR «2b. 





Proof. If x > 0, then |z| =x. If x <0, then |a| = -2. 

(a) Suppose x >0. Then x > -b. Also, |z]<b <== 2<b => -b<a<b?™ 
Suppose x <0. Then x <b. Also, |27]<b => -x<b = > x>-b —> -b<x<b. 
(b) Similar, omitted. 


(c) |zl|<b <=> NOT-(je|/<b) “4 NOT-(-b<2<b) <> (2<-bORe>8) 
(d) Similar, omitted. 














Example 599. |z7|/<5 <> -5<a<5 
Example 600. |z|<3 <=> -3<2<3 


Example 601. |z|>7 =< > (4#>7OR2<-7) 


Example 602. |z|}>1 <— > («>1OR2<-1) 
Here’s a more general version of the above result: 


Fact 105. Suppose a,xéR, b>0. Then 


(a) |x-al<b a-b<a<at+b. 


(b) |z-al<b a-b<a<ar+b. 
(c) |x-al>b (x>at+bORa<a-bd). 
(d) |x-al>b (x>a+bORa<a-b). 





Proof. (a) By Fact 104(a), |jra-al<b <=> -b<a-a<b = a-b<au<atbd. 
(b) Similar, omitted. 

(c) By Fact 104(c), |r-al>b — > (a@-a>bORa2-a<-b) <= (4>a+bORa<a-D). 
(d) Similar, omitted. 














Fact 105 has a nice geometric interpretation: 


(a) |x -a| <b means that the distance between x and a is less than b. 





2""The last <=> follows because any statement (here x < b) is equivalent to the conjunction of that 
statement with any true statement (here x <b AND x >-b). 
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a—b a £ a+ b 
(b) |x -a| <b means that the distance between x and a is no greater than b. 


lx -—al<b 


o> 


a.—o a z ao 
(c) |x —a|>b means that the distance between x and a is more than b. 


|x — al > b 
—_—_—— ST wVoqccee—- 0—> 
a —b a a+b ie 


(d) |x -a|>b means that the distance between x and a is no less than b. 


|x -al>b 


Example 603. Solve |x - 1| <5. 
1 
jr -1] <5 
Solution: x € (-4,6). 


2 
Example 604. Solve |x + 4| < 3. 


2 
jn +4| <3 -3<7%+4<3 


Solution: x € [-1,-7]. 
Example 605. Solve | - 2| a7 


ln-2|57 <> a2-2>7OR2-2<-7 <> 2x>9OR2<-5. 


Solution: x € (00, -5] U [9, 00). 
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4 
Example 606. Solve |x + 1| > 1. 


4 
jo+1[>1 —» w+121O0Ra+1<-1 — r200R2<0 <— «eR. 


Solution: x€R. 





It will often be easier to solve inequalities with the aid of a graph: 


1 
Example 607. Solve |x — 4| < 2a. 


1 

Let’s first solve < by graphing y = |x - 4| and y = 2x. For the former, recall (Ch. 26.6) 
that (i) above the z-axis, y = |f (~)| coincides with f; but (ii) below, y = |f (x)| is the 
reflection of f in the x-axis. 











We “see” that the two graphs intersect at P. We also “see” that 


1 
< holds — x is to the right of P (inclusive). 


So, let’s find P. We “see” that at P, x -4<0. And so, at P, 


lal = 2a <= Aa = 27 — GA) 3. 


Solution (to <): ipa ae 


This should be good enough for the A-Levels. But others might consider it a little less 
than rigorous (due to the two “sees” above). 





(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


1 
So, let’s now solve < again, but this time more rigorously (and without graphs). 
We consider two possible (and mutually exhaustive) cases—ax -4<0 and x-4>0: 


Case 1. Suppose x -4<0 or x <4. Then 
la — 4) < 2a —> 4A-x< 2x —> Aa 3e — ZN ee 
So, in Case 1, < holds <=> (<4 AND 4/3< 2) — we [4/3,4). 
Case 2. Suppose x -4>0 or x 5 4. Then 
la — 4) < Qe = r-4< 2x =A 7, 


which, of course, holds for all x 5 ay 


Altogether, < holds <=> (z €[4/3,4) OR x > 1) 7 24/3, 


(Of course, this second solution is the same as the first. We’d be worried otherwise. ) 


Example 608. Solve |2z — 4| ex. 


IN 
y 








We “see” that the graphs of y = |2x — 4| and y = x intersect at P and Q. Also, 


é holds — x is between P and Q (exclusive). 
So, let’s find P and Q. At P, 2x-4< 0. So, |2”- 4] = -(27-4) = 4- 2x. Hence, P is 
given by 4-22 = or x = 4/3. 
At Q, 2x -4>0. So, |2x - 4] = 27-4. Hence, Q is given by 24-4 = or x =4. 
Solution: x € (4/3, 4). 


(Example continues on the next page ...) 





457, Contents www.EconsPhDTutor.com 


(... Example continued from the previous page.) 


1 
Let’s now solve < again, but more rigorously (and without graphs). 
Again, we look at the two possible (and mutually exclusive) cases—2x-4 < 0 and 2x-4 > 0: 


Case 1. Suppose 2x -4 <0 or x <2. Then 
p= Alce A= ee ea Bee 


So, in Case 1, < holds <=> (x<2AND4/3<2) <=> xe (4/3,2). 
Case 2. Suppose 2x -4>0 or x >2. Then 


lQe-4|ca ee Ae 


So, in Case 2, < holds <=> (x>2AND2<4) =< ve[2,4). 
Altogether, < holds <=> 2 € (4/3,2) u[2,4) = (4/3, 4). 


Example 609. Solve |3 - 4| > Qa +2. 
We “see” that the graphs of y = [3x - 4| and y = 2x +2 intersect at P and Q. Also, 
> holds a x is to the left of P or right of Q (inclusive). 
So, let’s find P and Q. At P, 3r-4<0. So, |8a - 4] = 4-32. Hence, P is given by 
fe = 2 or aT, or = 6) 2: 
At Q, 34 -4>0. So, |3a - 4] = 32-4. Hence, Q is given by 37 - 4 = 27 +2 or x = 6. 
Solution: x € R \ [2/5, 6]. % 


aN 








(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


1 
Let’s now solve > again, but more rigorously (and without graphs). 


Again, we look at the two possible (and mutually exclusive) cases—3x-4 < 0 and 3x-4 > 0: 


Case 1. Suppose 3x -4 <0 or x < 4/3. Then 
1 1 
|3a — 4| > 27+ 2 —> 4-37 > 27 +2 = 225m — 2 ae 


So, in Case 1, > holds <=> (a <4/3 AND 2/5<2) <> re [2/5,4/3). 
Case 2. Suppose 3x -4 20 or x > 4/3. Then 


1 
|3a — 4| > 2x +2 — 32-4<24+2 — ae: 


So, in Case 2, > holds <=> (x >4/3 AND x <6) <> ve [4/3,6]. 





Altogether, < holds <=> 2x € [2/5,4/3) u[4/3, 6] = [2/5, 6]. 


And now for an example with two absolute value functions in the inequality: 


1 
Example 610. Solve |x + 1| > |22 - 1]. 


y = |2z - 1 





We “see” that the graphs of y = |x +1| and y = |2x -1| intersect at P and Q. Also, 


1 
> holds — x is between P and Q (inclusive). 


So, let’s find P and Q. 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 

At P,x+1>0and 2x-1<0. So, |x +1)=2+1 and |2x - 1] = 1-22. Hence, P is given by 
e+1l=1-22 or 0), 

At Q, x+1>0 and 2x-1>0. So, |x+1]/=2+1 and |2x- 1] = 27-1. Hence, Q is given by 
all or TO, 

Solution: x € [0,2]. x 

Let’s now solve : again, but more rigorously (and without graphs). 


This time, there are four possible cases: 
Case 1. Suppose x + 1< 0 AND 22 - 1 < 0—this is equivalent to x < -1 AND x < 1/2 or 
x<-l. Then 


1 
la +1] > |2x -1| — -~%-12>1-2¢ a ad, 


1 
which contradicts x < -1. So, in Case 1, > never holds. 
Case 2. Suppose x + 1 >0 AND 22 - 1 < 0—this is equivalent to x > -1 AND x < 1/2 or 
x €[-1,1/2). Then 


1 
|x + 1] > |2a - 1 —— tere Sy = 20. 


1 
So, in Case 2, > holds = > xe [0,1/2). 

Case 3. Suppose x + 1 <0 AND 22 - 1 > 0—this is equivalent to x < -1 AND x > 1/2 or 
LES. 

So, in Case 3, S never holds. 


Case 4. Suppose x + 1 > 0 AND 22 - 1 > 0—this is equivalent to x > -1 AND x > 1/2 or 
x >1/2. Then 


1 
la +1] > |2x -1| —> x+1>2r-1 2) 


So, in Case 4, > holds <> we ay, 2) 

Altogether, < holds <=> x €[0,1/2)U[1/2,2] = [0,2]. 

Exercise 176. Solve each inequality. (Answer on p. 1813.) 
(a) |x -4|< 71. 

(b) |5-2| > 13. 

(c) |-3a+2|-4>a-1. 

(d) |x +6] >2|2x -1]. 
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41.6. The Triangle Inequality, Revisited 


Earlier in Corollary 12, we used the Law of Cosines to prove the Triangle Inequality (the 
length of one side of a triangle is no greater than the sum of the other two). We can now 
prove it again now that we’re better acquainted with inequalities involving the absolute 
value function: 


Fact 106. (Triangle Inequality) Suppose x,y¢R. Then 


Fee S Pale || a 





Proof. We have (a) —|z| < x < |z|; and (b) -|y| < y < ly. 
Add up (a) and (b): -(|z]+|y|) <a+y < |2|+|yI. 














Now, |x + y| < |le] + [yl] = [2] + [yl 


Some close relatives of the Triangle Inequality (indeed each of these results are often simply 
also called the Triangle Inequality): 


Corollary 16. Suppose x,y¢€R. Then 


oS eal |i): 

















Proof. By the Triangle Inequality (Fact 106), |a — y| = |x + (-y)| < |a| + |-y| = |x| + lyI. 


Corollary 17. Suppose a1,d2,...,d,¢€R. Then 


lay + Gg +--+ + Ap] < |aq| + lag] +--- + [ap]. 





Proof. Repeatedly apply the Triangle Inequality (Fact 106) and Corollary 16: 











lay + dg +++ £Gp| < lay|+|ag +--+ + Gn| < faq|+]a2|+]a3 +--+ + Gp| < +++ < |ay|+]ag|+---+an. 





Corollary 18. Suppose x,y¢€R. Then 


posed es eal (all. 














Proof. By Corollary 16, |x| = |a+y-y|<|x+y|+|y|. Rearranging, |x + y| > |x| -|y]. 
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See also the Reverse Triangle Inequality (Fact 266, Appendices). 


The Triangle Inequality looks simple and humble, but is often useful. In particular, it shows 
up surprisingly often in proofs of results in calculus. 


Example 611. Q. Show that if a,b,cé R, then |a - }| < |a-c| + |b-d. 


A. la—b| 2 |a—c+c—8| = |a—c—-(b-0)| S |a—cl + [b—e, 


2 
where = uses the Plus Zero Trick and < uses Corollary 16. 


Exercise 177. (Hard) Show that if x,y € R, then (Answer on p. 1814.) 








jz + y| P el yl . 
l+la+y|~ 14+ lz) 1+y| 
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ax? +bxr+c¢ 


41.7. —.———_. 
dx? +ex+ f 


Your syllabus says you need to solve the above inequality in those cases where both numer- 
ator and denominator are either always positive or factorisable. 


Let’s first look at those cases where either numerator or denominator is always positive: 


ax? +bx+c1 
dx?+ex+fo 


Fact 107. Consider the inequality 


1 
(a) Ifax*+bax+c is always positive, then > is equivalent to dx? + ex + f > 0. 


1 
(b) If dx? +ex+ f is always positive, then > is equivalent to ax? + br +c > 0. 





Proof. By Fact 15(a), we can 


(a) Multiply both sides of S by 1/ (ax? + bx + c) > 0 to get 1/ (dx? +ext+ f) > 0, which is 
equivalent to dx? + ex + f > 0; and 





(b) Multiply both sides of 5 by dz? +ex+ f >0 to get ax? +br+c>0. 











Recall (Ch. 14): A quadratic expression is always positive if and only if its coefficient on x” 


is positive AND its discriminant is positive (so that the corresponding graph is U-shaped 
AND everywhere above the x-axis). 


0. 





NO Sg rel pa 
Example 612. Solve D = 3g2- on 25 > 
First, N is always positive, because its coefficient on x” is positive and its discriminant 
1? 4(1)(1) = -3 is negative. So, by Fact 107(a), > is simply equivalent to D being 
positive: 327 - 27-5 ‘ 0. 


q278 


Next, D is U-shape and intersects the z-axis at 


ae 2+/(-2)°-4(3)(-5)_ 22. Ve 


10 | 
6 


+ 
2°3 23 


ek Ae 
6’ 


Altogether, > holds <=> > holds <> ze (—o00, -1) U (5/3, 00). 








2"8\ore correctly and pedantically, we should say that the graph of y = D is u-shaped. 
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N- -x? 1 
Example 613. Solve Dp ee ‘ 





207 -x+1 

First, D is always positive, because its coefficient on x” is positive and its discriminant 
(-1)? - 4(2)(1) = -7 is negative. And so by Fact 107(b), > is simply equivalent to N 
being positive: -x? + 7x +1 50. 


Next, NV is n-shaped and intersects the x-axis at 


—-74/P-4(-1I) (01) -74 V53_ 74 53 
a 2-(-1) cei. = 








Altogether, > holds <=» 5 holds <> ze ((7- V53) j2, (7+ V53) /2). 





We next look at the case where both ax? + bx +c and dx? + ex + f are factorisable. 


OP ae? 
E le 614. Solve — = ——————_ 
xample olve 5 eres 


Limite oe Selle eNed = 2 =the oe) 





Solution: x € (—00, —2) U (-1,3/2) U (2, «) 


Na? + bg 4 


i 0. 
aa 


Example 615. Solve 





First, N =-2?+52-4=-(x-1)(2-4). Next, D = 327-22 -5= (32-5) (x+1). 





Exercise 178. Solve each inequality. (Answers on pp. 1815, 1816, and 1817.) 


v2 4+2r4+1 g7= 3718 


(a) fT =23a 42 ee oy , (c) —72+9x-14 








464, Contents www.EconsPhDTutor.com 


41.8. Solving Inequalities by Graphical Methods 


Example 616. Solve x > sin (0.5717). 


Graph y = x-sin (0.577) on your TI84. Our goal is to find the roots of this equation, i.e. 
the values of x for which x - sin (0.57tx) = 0. 


Step 1. Step 2. Step 3. Step 4. 


Step 5. Step 6. Step 7. 


1. Press to turn on your TI84. 
2. Press to bring up the Y= editor. 


3. Press e (0) ~ 5). Next press and then @ to enter 7. Now press 
AEM Py and altogether you will have entered “x — sin (0.57x)”. 


4. Now press (@VNa5) and the T184 will graph y = x - sin (0.577). 


It looks there may be some x-intercepts close to the origin. Let’s zoom in to see better. 


5. Press the WOO) button to bring up a menu of ZOOM options. 


6. Press (2) to select the Zoom In option. Nothing seems to happen. But now press 
NEVE and the TI84 will zoom in a little for you. 


It looks like there are 3 horizontal intercepts. To find out what precisely they are, we'll 
use the T184’s “zero” option. 


7. Press and then @83VN@9) to bring up the CALC (CALCULATE) menu. 


8. Press (2) to select the zero function. This brings you back to the graph, with a cursor 
flashing at the origin. Also, the TI84 prompts you with the question: “Left Bound?” 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Step 8. Step 9. Step 10. 


Step 11. Step 12. Step 13. 


Recall?” that zero is another word for root. So what TI84’s zero function will do here is 
find the roots of the given equation (i.e. the values of x for which y = 0). Those of you 
accustomed to newfangled inventions like the world wide web and the wireless telephone 
will probably be expecting that the TI84 simply and immediately tells you what all the 
roots are. But alas, the TI84 is an ancient device, which means there’s plenty more work 
you must do to find the three roots. 


To find a root, you must first specify a “Left Bound” and a “Right Bound” for 7. The 


TI84 will then check to see if there are any values of x for which y = 0 between those 
two bounds. 


9. Using the [| and ))| arrow keys, move the blinking cursor until it is where you want 
your first “Left Bound” to be. For me, I have placed it a little to the left of where I 
believe the leftmost horizontal intercept to be. 


LO. Press (QQ0098) and you will have just entered your first “Left Bound”. 
TI84 now prompts you with the question: “Right Bound?”. 


. SO now just repeat. Using the |(j and ))| arrow keys, move the blinking cursor until it 
is where you want your first “Right Bound” to be. For me, I have placed it a little to 
the right of where I believe the leftmost horizontal is. 


. Again press US)NIN38) and you will have just entered your first “Right Bound”. 


TI84 now asks you: “Guess?” This is just asking if you want to proceed and get TI84 to 
work out where the horizontal intercept is. So go ahead and 


13. Press (BEDE. T184 now informs you that there is a “Zero” at “x =-1”, “y =0” and 
places the blinking cursor at that point. So, x =-1 is the first root we’ve found. 


To find the other two roots, “simply” repeat steps 7 through 13—two more times. You 
should find that the other two roots are x = 0 and x = 1. Altogether, the three roots are 
x =-1,0,1. Based on these and what the graph looks like, we conclude: 


2 > sin —> z-sin— >0 <=> ré(-1,0)U (1,00). 








279Remark 29. 
466, Contents www.EconsPhDTutor.com 


Example 617. Solve x >e+Inz. 


1. Graph y = x-e-Inz on your TI84 (precise instructions omitted). 


We see that there’s clearly an x-intercept at around x € (4,5). (Note that by default, 
each of the little tick marks shown on your TI84 marks 1 unit.) 


. Zoom in (precise instructions omitted). 


Now we see that there’s probably also an x-intercept near the origin. But unfortu- 
nately, now we can no longer see the other x-intercept. To fix this: 


. Press to bring up the WINDOW menu. 
We will adjust Xmin and Xmax: 
4. Press (0). We have adjusted Xmin to 0. Next, 
5. Press (5). We have adjusted Xmax to 5. 


. Now press (@iNiNS5). We can now see the portion of the graph between x = 0 and 
7 =o 


. To find the two roots, “simply” go through the steps described in the previous 
example—twice (precise instructions omitted). You should find that the two roots 
are x 8 0.708, 4.139. 


Based on these roots and what the graph looks like, we conclude that the inequality’s 
solution set is x € (0,0.708...) U(4.139...,00) = R* \ (0.708...,4.139...). % 


Step 1. Step 2. Step 3. Step 4. 


Step 5. Step 6. Step 7. 


Exercise 179. Use a GC to solve each inequality: (Answers on p. 1818.) 


(a) 2? -a?+2-1>e*. (b) Vx >cosz. (c) 


as >2>4+sin a. 
—-2x 
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42. Extraneous Solutions 


42.1. Squaring 


Example 618. To solve x = V2 — x (x < 2), we try these three steps: 


1. Square both sides: x? = 2-2. 


2. Rearrange and factorise: x? + —2 = (x-1)(x+2) =0. 


3. Conclude: «21 or x 2-2. 


Now, x 2 1 does indeed solve 4: 14V2-1=V1=1. J 


But x 2-2 does not: =2 #,/2- (-2) = /4=2. x 


So, the above steps are wrong because they produce the extraneous solution x 3-9. 


Where was this extraneous solution introduced? It turns out it was introduced in Step 
1, where we applied the squaring operation. 


To see this more clearly, let’s be more explicit about our above chain of reasoning. In 
particular, let us use the logical operators <> (“is equivalent to”) and = > (“implies”): 


(x —1)(x +2) =0 


a2] 


We now see clearly how Step I differs from Steps II and III. Step Tis a “ => ” statement, 
while Steps II and III are “ = > ” statements. Or in plainer English, the squaring 
operation in Step I is an irreversible operation. 


It is always true that =D => a’ = 6". 
But, the converse is false: og —> a=b. 


For example, Ciy - 1°, but -1 ¢ 1. So, squaring is an example of an irreversible 
operation; it produces an “ = > ” statement and not a “ = >” statement. 


So, our above chain of reasoning produces this (true) implication: 


Z 2-2 —> 21 0R e-— 


z 
We must now check, on a case-by-case basis, whether each of x 21 OR x 2-2 solves the 


original equation x + /2—zx. And when we do check, we find that x 21 does, while x 39 
does not and is thus an extraneous solution that must be discarded. 
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The following (very) informal Theorem is our moral of the story. It describes how, when, 
and why extraneous solutions may arise: 


Theorem 13 (informal). (Extraneous Solutions Theorem) /f our chain of reasoning 
contains only <= > ’s, then all is well. However, if it contains even one = > (i.e. an 


irreversible step), then extraneous solutions may arise and we must be careful to check 
for them. 





Note the emphasis on the word may. Extraneous solutions may arise but might not. 


The operation of squaring is merely one example of when extraneous solutions may be 
introduced. Two others are multiplying by zero and removing logarithms: 


469, Contents www.EconsPhDTutor.com 


42.2. Multiplying by Zero 


2_ 3 
Example 619. To solve a 5 7 +2+ i z 0, we try these three steps:7°? 
x? — z 


1. Multiply by x? - 1: 2? - 32 +2(2?-1)+(x+1)=0. 
2. Rearrange and factorise: 32? — 2x2 —1 = (x -1)(3a +1) =0. 
3. Conclude: x 2 1 or x 2-1/3. 


Exercise: Verify that x 31 /3 solves 2, while x 2 1 does not.28! 


pO, £ 2 | is an extraneous solution. Where was it introduced? 
Again, to clearly detect the error, let us write out our chain of reasoning more explicitly 


with == and =: 


xe? — 34 
+2+ 
ge? —1 g-1 


x? - 30 +2(x?-1)+(x+1)=0 


I 
=; 
I 
—= 


Bn = 27 = | ate = 1 ee 41) =) 


Ul 2 
<—_ = 


= 1 or x 2-1/3. 


I 
Now, why is == an irreversible operation? Because we’re multiplying by some unknown 
quantity x? — 1 which might be zero. 


And multiplying by zero is an irreversible operation: 


In general, uy — O-y =0- Zz. 


But, O-y=0-2 => Y=2 
For example. (I= lv 0 J=0= 1. but 0-2=0-3 => 2=3. 
So, our above chain of reasoning yields this (true) implication: 


xv — 3a ee 
+2+ = 
7 = 


x 21orx2-1/3. 
We must then check, case-by-case, whether each of x 21 0rz2-1 /3 actually solves the 


original equation +. In this example, we find that x 5-1 /3 solves =. while x = 1 does not 
and is an extraneous solution that must be discarded. 








80This example was stolen from Manning (1970, p. 170). 
1/3)? -3(-1 1 
81Plug x 2-1/3 into 2: ie) att ae weave, S. ee J 
(-1/3)?-1 asi 1621 4. 4 4 


In contrast, x 2 1 does not solve = because x - 1 = 0, so that some terms in + are undefined. 
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By the way, a mathematician might say that the problem of multiplying by zero is dual 
to the problem of dividing by zero (Ch. 2.2): 


¢ Dividing by zero may cause us to lose (valid) solutions; while 


¢ Multiplying by zero may introduce extraneous (or invalid) solutions. 
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42.3. Removing Logs 


Example 620. To solve log x + log (a + 1) = log (2x + 2), we try these four steps: 


1. Use a Logarithm Law: log x + log (a + 1) = log (a. + z) 2 log (2x + 2). 


2. Remove logs: 27+” = 2x +2. 


3. Rearrange and factorise: x* - x - 2 = (x+1)(x-2) =0. 


A. Conclude: x 2-1 or 22. 


Exercise: Verify that x 32 solves = but x 2-1 does not.2° 


oye: , si 
So, x = 1 is an extraneous solution. Where was it introduced? 


Again, to clearly detect the error, let us write out our chain of reasoning more explicitly: 


log x + log (x + 1) = log (2x + 2) 


zee loga + log (x +1) = log (a? +z) = log (2r +2) 


Il 
=> e+e = 20-2 


ae a == (ea 1 2) — 0) 


IV 9 
—_ fi — 


-] or 7 22. 
This time, it’s Step II that’s irreversible. In general, we have 


In general, log a = log b —= c=.) 
But, = 0 loga 2 log b. 


Because if a = b is non-positive, then loga or logb is undefined, so that 2 ig necessarily 
false. 


So, our above chain of reasoning yields this (true) implication: 
log x + log (x + 1) = log (2x + 2) —> r2-lorr22. 
We must then check, case-by-case, whether each of x 2-1 or 222 actually solves the 


original equation +. In this example, we find that x 3-1 /3 solves é, while x = 1 does not 
and is an extraneous solution that must be discarded. 








22 Phas 2 2 into 2: log 2 + log (2+ 1) = log2 + log3 = log6 4 log (2-2 + 2) = log6. J 


In contrast, x 2 —| does not solve = because log -1 is undefined. 
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This brief chapter merely examined three examples of operations by which extraneous 
solutions may be introduced—namely squaring, multiplying by zero, and removing 
logarithms. These are not exhaustive and you will likely encounter more of such operations 
as your maths education progresses. 


The important thing is to remember how and why extraneous solutions arise. In particular, 
you should remember and understand the Extraneous Solutions Theorem. 


Exercise 180. To solve sinz+cosz = 1 (x <[0,27t) ), we try these steps:7°? (Answer on 


p. 1819.) 


» 


On. 23 1 
. Square both sides: (sin x + cosz)” = sin? x + cos? x + 2sinxcosxz = 1? = 1. 


. Apply the identity sin? x + cos? x = 1 to get 2sinxcosx = 0. 


1 
2 
oF DO. sing = Dior cosy =) 
4 


. Conclude: «20 or x2 mor x4 m/2 or x 2 3n/2. 


Is/are there any extraneous solution(s)? At which step(s) did it/they arise? 


Exercise 181. To solve r!/3 + 7-220 (x € R), we try these steps:7*4 


1. Factorise: tai’ = 1) (ae oF 2) = 0. 

2: So: 2’ =1org Ss) 

3. Conclude: x = (2'/6)" =1821ore= (a'/)" = (-2)° 264. 

Is/are there any extraneous solution(s)? At which step(s) did it/they arise? (Answer on 
p. 1819.) 

Exercise 182. To solve 22+ 72+120 (x € IR), we try these steps:*®° 


ae 


1. Rearrange: x” 
2. Since x = 0 doesn’t solve 4 anyway, we know that x # 0. 
So, we can divide z by x to get x Stet 
H 


1 il 
ahh Now plug ® into + to get 2?+(-1-=)+140 or 2?=— ora? =1, 
6 6 


A. Conclude: x 2 1. 


Is/are there any extraneous solution(s)? At which step(s) did it/they arise? (Answer on 
p. 1819.) 








*83Stolen from Sullivan (Precalculus, 10e, 2017, p. 519), hat tip to &. 
84Stolen from Sg. 
85Stolen from Sg. 
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43. O-Level Review: The Derivative 


This chapter is a short review of what you (supposedly) mastered in secondary school. In 
Part V (Calculus), we'll revisit these concepts but at greater depth. 


43.1. The Derivative Is the “Gradient Function” 


Example 621. Define f :R—- R by f (x) = 52. 


Figure to be 
inserted here. 


This graph’s gradient is 5 at every point. 


So, let’s define a new function g: R > R by g(x) = 5 and call g the “gradient function” 
of f. That is, g is very simply the function that tells us what the gradient of the graph 
of f is at each point. 


But we don’t use the term “gradient function”.7°° Instead, we call g the derivative of f. 


Of course, in this example, g isn’t very interesting and simply tells us that the gradient 
of the graph of f is 5 at every point. For example, at x = -2, the graph’s gradient is 
g(@) = 5; at x = 0, it’s g(@) = 5; and at x = 3, it’s g(a) = 5, Indeed, at any point 7 R, 
g(x) =5. 








286The word gradient is actually used in another closely related context in multivariable calculus (beyond 
A-Level Maths). 
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Example 622. Define f:R—R by f (x) = 2? +1. 


Figure to be 
inserted here. 


We don’t yet know how, but it’s possible to show that this graph’s gradient is equal to 
2x at every point. 


So again, we can define a new function g: R > R by g (x) = 2” and call g the derivative 
OL 

Again, g simply tells us what the gradient of the graph of f is at every point. For example, 
at x = -2, the graph’s gradient is g (—2) = 2x (-2) = —4; at x =0, it’s g(0) = 2x0=0; and 
ae = 3, 1b so (3) =2%3 = 6: 


Example 623. Define f:R— R by f (x) = 2° + 2z. 


Figure to be 
inserted here. 


We don’t yet know how, but it’s possible to show that this graph’s gradient is equal to 
327 +2 at every point. 


So again, we can define a new function g : R > R by g(z) = 327+ 2 and call g the 
derivative of f. 


Again, g simply tells us what the gradient of the graph of f is at every point. For 
example, at « = -2, the graph’s gradient is g(-2) = 3 x (-2)’ +2 = 14; at x = 0, it’s 
G0 = 30 22 =o md ate = oc sais ~ 3s 22 20) 


Definition 114 (informal). Given a function f, its derivative is the function that tells 
us what the gradient of the graph of f is at each point. 





To stress, repeat, and emphasize, 
The derivative is itself simply a function. 


For the formal and precise definition of the derivative, see Definition 203 (Part V, Calculus). 


475, Contents www.EconsPhDTutor.com 


43.2. The Confusing and Varied Notation for the Derivative 


In the last three examples, given a function f, we simply denoted its derivative by g. 


But given a function f, writers usually denote 


¢ Its derivative (a function) by 


f f ae 
or or dx 
(Lagrange), (Newton), (Leibniz). 


¢ Its derivative at a point (a number that we interpret as the gradient of the graph of 
f at that point) by 





; d d 
fo) FQ) Fa) or F 
(Lagrange), (Newton), (Leibniz). : 


Example 624. Define f:R— R by f (x) = 2. 


The derivative of f is the function with domain R, codomain R, and mapping rule 
Lr 22. 


This last sentence is equivalent to each of these next three sentences: 
Lagrange: The derivative of f is the function f’: R > R defined by f’(x) = 2z. 


Newton: The derivative of f is the function f: IR > R defined by f(a) = 2z. 








d d 
© Leibniz: The derivative of f is the function = IR > R defined by (a) 2 Qe. 
c i 


The derivative of f at x =5 is the number 10 (and this is the gradient of the graph of f 
abe = 9) 

Leibniz 
a — 
df df 
—| =—(5)=2x5=10. 
dx = ) * 


v= 


Lagrange 
WN Oe 
Newton 


Lagrange places a prime symbol ' to the right of f. 


Newton’s notation is very similar—instead of a prime symbol ’ to the right of f, we have 
a dot - above. 


Leibniz’s notation is the oddball. 
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Remarks on Leibniz’s notation (references to above example): 


— 


. In 2, the “(a)” is superfluous and can be omitted, because the dummy variable «x is 


already specified in the symbol a And so, line © may be written more simply as 
1s 


Leibniz: The derivative of f is the function f :R => R defined by ae 2 
x 


—= 2%, 
x 


i) 


. We’ve already explained the concept of dummy variables several times in this text- 
book.?8"In line ©, our dummy variable is “2”—this can be replaced by any other symbol, 
such as y, t, or even «. So, line © is exactly equivalent to each of these three lines: 


Leibniz: The derivative of f is the function - : IR > R defined by al 2 2Qy. 
y y 


Leibniz: The derivative of f is the function of : IR > R defined by ay 2 


2b. 
dt 


d 
Leibniz: The derivative of f is the function a : IR > R defined by — 2 Ox. 
x a 


Co 


d d 
. Rather than oi you may be more used to seeing a This is perfectly fine so long as y 
x 


is a well-defined function. 
The thing is, your teachers and A-Level examiners will often be sloppy and write some- 
d 
thing like this: “Let y = 2?. Then ali 2a 
a 
This isn’t outright wrong, but is a bit sloppy, because we should really be careful to 
specify that y is a function of x and also specify the domain and codomain of y. 


We'll have more to say about Leibniz’s notation in Ch. 89. 


Exercise 183. For each function (recycled from Exercise 335), write down its derivative 
and its derivative at 2 in Newton’s and Leibniz’s notation. (Answer on p. 1896.) 


(a) f :R-R defined by f (x) = 7. 


(b) g:R-R defined by g (x) = 54 +7. 





(c) h:R-—R defined by h (x) = 227 + 5x4 +7. 





87Example 93, Ch. 5.8, p. 211. See also Ch. 46.1. 
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Fun Fact 


Isaac Newton was both one of the greatest physicists ever and one of the greatest mathe- 
maticians ever. Which is why in one ranking of the most influential persons in history,?°> 
Newton was ranked second (and the only among the top six who was a non-religious 
figure). 


Gottfried Wilhelm von 


Isaac Newton (1643-1727) 
Leibniz (1646-1716) 


Gottfried Wilhelm von Leibniz®? was likewise a first-rate genius and a polymath. Indeed, 
he has sometimes been called “the last man to know everything”, the rationale being 
that 


Since his time the growth of knowledge has resulted in, and indeed neces- 
sitated, specialization. The horizon for the individual is now restricted, 
for few can hope to attain proficiency in more than one subject. 


— A. L. Leigh Silver (1962). 


Newton and Leibniz are often dubbed the “inventors” of the calculus. Indeed, their 
dispute over who “invented” calculus is perhaps history’s most famous academic dispute. 
(Even history’s greatest geniuses are not above some petty bickering.)?°? But as has been 
well said by the historian of mathematics Carl B. Boyer (1949), 


Few new branches of mathematics are the work of single individuals. 








*88Michael Hart, in The 100: A Ranking of the Most Influential Persons in History (1978, 1992). In case 
you’re wondering, Muhammad was #1 and Jesus #3. Full rankings plus summary here. Book here. 
89Sometimes spelt Leibnitz. 


20Ror a popular account of this dispute, see The Calculus Wars: Newton, Leibniz, and the Greatest 
Mathematical Clash of All Time (2007). 
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43.3. The Derivative as Rate of Change 


dx 
So far, we’ve interpreted a only as the gradient. 


dx 
But as you may recall, we can also interpret a as 


e The change in x resulting from a small unit change in ft; and 


e« The rate of change of x with respect to time tf. 


Example 625. A particle P travels along a line. Its eastward displacement x: R— R 
(metres, m) from the point O at time t (seconds, s) is given by 


a (t) = #8 — 6t? + 9¢. 


e _1___@—____1_ ____9 —-» 1 (metres) 
O 1 2 3 J 


At t= 0s, P starts x = 0?-6-0?+9-0=0m east of O. In other words, it is at O. And 
during t € (0,1), P travels eastwards away from O. 


At t=1s, P stops and is « = 1°-6-17+9-1=4m east of O. 


During ¢ € (1,3), it travels westwards, i.e. back towards O. For example, at t = 2s, P 
is g = 2? -6-2?+9-2=2m east of O. 


At t= 3s, P has returned to O (x = 3° -6-32+9-3=0) and stops. 


During t > 3s, P keeps travelling eastwards away from O. For example, at t = 4s, P is 
a= 4°-6-47+9-4=4m east of O. And at t = 10s (not depicted in the figures), P is 
x = 10°-6-107+ 9-10 = 490m east of O. 


x (m) 








(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


We define velocity v (metres per second, ms!) to be the rate of change of displace- 


ment with respect to (w.r.t.) time. In other words, velocity is the (first) derivative of 
displacement w.r.t. time: 


ae 


Using rules of differentiation that we’ll review in Ch. 43.6, it’s possible to work out that 
P’s (eastward) velocity is given by 


C= 37 dee 


No me) 








At each instant of time, v tells us what P’s velocity is—in other words, the rate at which 
x is changing per “infinitesimally small” unit of time t. If v > 0, then P is travelling 
eastwards. While if v <0, then P is travelling westwards. 


From the above graph, we can tell that 
¢ During t€[0,1), P travels eastwards. 
e At t=1, P stops. 

¢ During t€ (1,3), P travels westwards. 
e At t=3, P stops. 

e During t > 3, P travels eastwards. 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


We define acceleration a (metres per second per second or metres per second squared, 
ms”) to be the rate of change of velocity w.r.t. time. In other words, acceleration is the 
(first) derivative of velocity w.r.t. time and thus the second derivative of displacement 
w.r.t. time: 


pa dv Pe 
dt dt?’ 


it is possible to work out that P’s (eastward) acceleration is given by 


a(t) = 6t — 12. 


2S 


a (mig?) 








At each instant of time, a tells us what P’s acceleration is—in other words, the rate 
at which v is changing per “infinitesimally small” unit of time t. If a > 0, then P’s 
eastwards velocity is increasing (or equivalently, its westwards velocity is decreasing). 
And if a < 0, then P’s eastwards velocity is decreasing (or equivalently, its westwards 
velocity is increasing) 


From the above graph, we can tell that 


¢ During t € [0,2), P’s eastwards velocity is increasing (or equivalently, its westwards 
velocity is decreasing). 


¢ During t > 2, P’s eastwards velocity is decreasing (or equivalently, its westwards ve- 
locity is increasing) 
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43.4. Differentiability vs Continuity 


In Ch. 18, we learnt that informally, continuity means you can draw the graph without 
lifting your pencil. We now introduce the concept of differentiability, which turns out 
to be a stronger condition than continuity. Informally, a function is differentiable if its 


graph is “smooth” and, in particular, doesn’t have “kinks”.?*! 


First, three examples of functions that are both continuous and differentiable: 


Example 626. The function f : R > R defined by f(x) = x° — x? -1 is continuous 
everywhere, because you can draw its entire graph without lifting your pencil. 





f is differentiable everywhere 
because its graph is 
“smooth” and has no “kinks”. 





The function f is also differentiable everywhere, because it is “smooth” everywhere and 
has no “kinks”. 


Example 627. The sine function sin is both continuous everywhere and differentiable 
everywhere. 


sin is differentiable 

everywhere because 
its graph is “smooth” 
and has no “kinks”. 














29lFor the formal definition of differentiability, see Definition 204 in Part V (Calculus). 
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Example 628. The exponential function exp is both continuous everywhere and differ- 
entiable everywhere. 


exp is differentiable everywhere 
because its graph is 
“smooth” and has no “kinks”. 











Next, two examples of functions that are continuous but not differentiable: 


Example 629. The absolute value function |-| is continuous everywhere, because you can 
draw its entire graph without lifting your pencil. 


However, it is not differentiable everywhere, because it is not “smooth” everywhere. In 
particular, it has a “kink” at x= 0: 


We can say though that the absolute value function is differentiable everywhere except 
at x =0. Or equivalently, it is differentiable on R\ {0}. 


|-| is not |-| is 
differentiable everywhere differentiable 
because it has a kink at x = 0. on R \ {0}. 
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Example 630. Define the piecewise function g: R > R by 


for «<0 
g(x) = 
tora > 0: 


The function g is continuous everywhere, because you can draw its entire graph without 
lifting your pencil. 


However, it is not differentiable everywhere because, like |-|, it has a “kink” at x = 0. 
Nonetheless, again, we can say that g is differentiable everywhere except at x = 0. Or 
equivalently, g is differentiable on R\ {0}. 


\ 


g is not differentiable 
everywhere because it 
has a kink at x = 0. 
gis 
differentiable 
on Rv {0}. 











It turns out that every differentiable function must also be continuous.””” (This 
is exactly what we meant when we said that differentiability is a stronger condition than 
continuity. ) 


However, the converse is false—not every continuous function is differentiable; that. is, 
continuity does not imply differentiability. This was illustrated by the last two examples. 


Differentiable functions are thus a subset of continuous functions. Beautiful Venn diagram 
drawn by an artistic genius: 





282This assertion is formally stated and proven as Theorem 29 (Appendices). 
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Differentiable 
functions 


Continuous 
Functions 





Happily, most functions we’ll encounter in A-Level maths will be both continuous and 
differentiable. There are however exceptions, as we’ve seen here and in Ch. 18. 


Exercise 184. Graphed below are three functions f, g, and h. State if each is continuous 
and/or differentiable. If not, state the set of points on which each function is continuous 
or differentiable. (Answer on p. 1794.) 


uN 
Y 
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43.5. A Subtle Point about Differentiability 


This subchapter parallels Ch. 18.1 (“A Subtle Point about Continuity”): 

We'll formally define differentiability only in Part V (Calculus). 

It turns out that given a function f, we say that f is 

¢ Either differentiable or not (and not both) at each x ¢€ Domain f (Definition 198); 
¢ Differentiable if it is differentiable at every x € Domain f (Definition 199); 


e Neither differentiable nor differentiable at every a ¢ Domain f 


The last bullet point is somewhat subtle and surprising. We recycle the same three examples 
from Ch. 18.1 to illustrate it: 


Example 631. Define f :[1,2]U[3,4] > R by 


L for @e|1, 2), 
f (2) = me 
2, tor xe(3,4]. 


Figure to be 
inserted here. 


It turns out that f is differentiable at every x ¢ Domain f = [1,2] U [3,4]. 


And so, perhaps surprisingly, f is a differentiable function. 


Example 632. Define g: (0,1) U (1,2) U (2,3) > R by 


1, for x € (0,1), 
g(z)=42 for x (1,2), 
3, for xe (2,3). 


Figure to be 
inserted here. 


It turns out that g is differentiable at every x € Domain f = (0,1) uU (1,2) U (2,3). 


And so, perhaps surprisingly, g is a differentiable function. 
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Example 633. Again, it turns out that tan is differentiable at every x ¢ Domain tan = 
R \ {Odd integer multiples of 71/2}. 


Figure to be 
inserted here. 


And so, perhaps surprisingly, tan is a differentiable function. 
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43.6. Rules of Differentiation 
You should find these familiar from secondary school: 


Theorem 14. (Rules of Differentiation) Let c be a constant and x, y, and z be 
variables. Then 


Constant Rule Constant Factor Rule Power Rule 
d C d EF dy d e 4 c-l 
aa aa (oY oor qe =r 


Sum and Difference Rules Product Rule 


oe 


er dz 


Quotient Rule Cosine 
dy 


dz 
Oy 2 245 — Uae : P 
5 sila = cos a —— Cos 7 = —sin a 
daz x dx dx 


Natural Logarithm Exponential 
d 


— |ag == —e* =e” 


ax a dx 





(The Chain Rule will be covered in the next subchapter. ) 


Common mnemonic for the Quotient Rule: 


Lo-D-Hi minus Hi-D-Lo, 
Cross over and square the Lo. 


In Part V (Calculus), we’ll explain where these rules come from (and even learn to derive 


them). For now, you need merely “know” these rules and how to use them to “solve” 
differentiation problems: 
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Example 634. —5 £ 0. (Constant Rule) 
i? 

Example 635. —500 £ 0. (Constant Rule) 
a 


Example 636. = (—200) £0. (Constant Rule) 
4g 


Example 637. - (52°) 25 (—*) = O(n) adog. (Constant Factor Rule) 
a A 


d d 
Example 638. —— (500x°%) = 500 (—2"*) =p00(0g2-" \ 150m" 
ob x 


Example 639. = (-2002-1) = -200 (<0) = -200 (-27*) = 20027”. 
x Lv 





Example 640. a8 = 3a”. (Power Rule) 
a 
(Power Rule) 


(Power Rule) 


a (5002) J =32 71502, 


go + — 
dx 


d 
ne 3 e (5000 |= 3r —lola 


Example 644. Product Rule and Sine: 


6 x 6 
ae iz sin x) = aie sin zw + ~? COS Z. 
XL 


Example 645. Quotient Rule and Cosine: 


2 


Ga P32 Cosme = ae (= simiy Aaa 


— = = (3cosx+axsin2). 
dx cos x cos? © cos? © 
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Example 646. Combination of various rules: 
ne ee dace eee leaders 
da lng) da Zine 72 cde line 
. Ine (ete*) - (e*e*) A Ina 
2 (Inx)” 
p lng (ere ere") — (ere ) - 
2 (Inx) 
ein e2e ( 1 











x 


os (Inz)? "Ing 


lng 


Exercise 185. For each, find cy and dy (Answer on p. 1794.) 
dx dx} oe 

(a) y=2". 

(b) y= 3° =4r° + 7 = 2. 


(c) y= (a? + 3x44) (30° - 42? + 7x - 2). 


d 
Exercise 186. For each, find = without using the chain rule. 
q 


(a) y=e" lng. 
(b) y=2'e"lnz. 
sin £ 


(c) y= 


z 
sin © 

(d) y=tanz, given that tan x = and sin? x + cos’ x = 1. 
COS & 


di 
(e) y =—, where z is a variable that can be expressed in terms of x. (Leave your answer 
2 


dz 
in t f d —. 
in terms of z and ) 


Use (e) to solve (f), (g) and (h): 
1 

(f) y =cosecxz, where cosecx = ——. 
sin x 


(g) y=secz, where sec x = 
COS & 


(h) y =cotx, where cot x = (Answers on p. 1794.) 


tan x 
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43.7. The Chain Rule 


The Chain Rule informally stated:2% 


Theorem 15 (informal). (Chain Rule) Suppose y and z are functions. Then 


dz _dz dy 
dx dy dr 








One mnemonic is to think of the derivatives on the RHS as fractions—in which case, the 
dy’s get cancelled out and we’re left with dz/dz. 


In Part V (Calculus), we'll explain (a) why this is merely a mnemonic; (b) strictly speaking, 
it’s wrong to think of derivatives as fractions; (c) nonetheless, such thinking can be a 
helpful aid to intuition. 


We can give the Chain Rule this informal interpretation: 


The change in z caused by _ The change in z caused by . The change in y caused by 


a small unit change in x a small unit change in y a small unit change in x. 
This interpretation makes (common) sense: 


Example 647. Suppose that when I add to a cup of water 1g of Milo, its water volume 
increases by 2cm?. And when the water volume increases by 1cm?, the water level rises 
by 0.3cm. 


Then by common sense, when I add 1g of Milo, the water level should rise by 2 x 0.3 = 
0.6 cm. 


Let’s now rewrite the above common-sense observations more formally: 


Let x be the mass (g) of Milo in a cup of water, y be the total volume (cm? ) of water in 
the cup, and z be the water level (cm) in the cup. 


¢ When z increases by 1g, y increases by 2cm?. 


d 
More formally, — =2cem?g. 
5 


¢ When y increases by 1cm™®, z increases by 0.3cm. 


d 
More formally, = =O3cmem  =O0.3em—. 
Yy 


e And so, by the Chain Rule, when x increases by 1g, z increases by 2 x 0.3 = 0.6cm. 


dz _dzdy 
dx dydzx 


More formally, =O03cm -x2cm ¢  —=ViGcme — 








**3For a formal statement, see Theorem 32 (Part V). 
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Examples of how to “use” the chain rule: 


Example 648. If y =e”, then 


dy de" cide" dang ~ 


ze esin G 
dz dx dsinz dz 





COS Z. 


Example 649. If y = V4 -1, then 


dg dy 4a 4 ondy4e = ldiae- 1) 
de Ss rd7 ees dita) Fede eve 





A slightly more complicated example, where we use the Chain Rule more than once: 


Example 650. If y = [sin (2x - 3) + cos(5- 2z)]°, then 


dy — d[sin (2x - 3) +cos (5 - 2x)]° 
dc dx 

ch d[sin (2x - 3) + cos(5- 2)]° d[sin (2x - 3) + cos (5 - 2z)] 

~ d{sin (2x - 3) + cos (5 - 22)] da 

dsin(2x-3) dcos(5-2z) 

dx ‘ dx | 

dsin (2x -3)d(2x%-3) dcos(5-2x)d(5- 2r) 
d (2x - 3) de d(5- 22) da 





= 3[sin (2x - 3) + cos (5 -22)]° | 


ae [sin (22 - 3) + cos (5 - 2x)}° 


= 3[sin (2x - 3) + cos (5 — 2x) }’ [cos (2x - 3) x 2 + (—sin (5 - 2r)) x (-2)] 


= 6 [sin (2x - 3) + cos (5 — 2x) }° [cos (2x - 3) + sin (5 - 22)]. ® 
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Exercise 187. For each, find 2 and a (Answer on p. 1795.) 
qi g 
x=0 


x 


(a) y=14+[x-m(r+1))’. (b) Se eertcnane 


Exercise 188. Let F, m, v, t, and p denote force, mass, velocity, time, and momen- 
tum. Momentum is defined as the product of mass and velocity. 


(a) Newton’s Second Law of Motion states that the rate of change of momentum (of an 
object) is equal to the force applied (to that object). Write this down as an equation. 


(b) Acceleration a is defined as the rate of change of momentum. Explain why Newton’s 
Second Law simplifies to F = ma if mass is constant. (Answer on p. 1795.) 


Exercise 189. In Part V (Calculus), Fact 221, we will formally state and prove that 


1 
Sete 


dx c 
aa pauls d 
But assuming = is true, we can quite easily prove that a exp’ = exp a: 
a 


d 
(a) Use the Chain Rule to write down an expression for ae In (exp z). 
z 


(b) What do you observe about the expression In (expx)? Use this observation to write 


down another expression for a In (exp 2). 
. 


(c) Then conclude that = exp Z = exp Z. (Answer on p. 1796.) 
ay 


Remark 87. By the way, no need to mug the five derivatives below because they’re in List 
MF26 (p. 3). (We'll review the inverse trigonometric functions sin™', cos"', and tan! in 
Ch. 36. Also, we’ll restate the following as Fact 207 in Part V.) 


2 
l+x 
cosecx —cosecx cotx 


Sec X sec x tan x 
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43.8. Stationary and Turning Points 


Earlier Definition 56 gave an informal definition of stationary points. 


We can now formally define a stationary point (of a function) as any point at which that 
function’s derivative equals zero: 


Definition 115. A point x is a stationary point of a function f if f’ (x) =0. 


Example 651. Define f:R— R by f (x) = 2. 
The derivative of f is the function f’:IR > R defined by f’ (x) = 22. 


We now check if f has any stationary points. By the above definition, a is a stationary 
point of f if and only if 


i (a= 0 — 2a=0) 


Hence, f has only one stationary point, namely 0. (We’ve labelled this point as AK in the 
graph below.) 











K 


When searching for stationary points, we will often write 
f' (a) =0. 


Because = appears so often, it has a name: the First-Order Condition (FOC). 





We reproduce from Ch. 12.4 our definition of a turning point: 
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Example 652. Continue to define f:R > R by f (x) = 2’. 


Above we showed that f has only one stationary point, namely 0 (labelled as K in the 
graph). 











K 


Observe that K is also a strict local minimum of f. 


Since K is both a stationary point and a strict local minimum of f, by the above definition, 
kK is a turning point of f. 








2AT he term turning point appears on your H2 Maths syllabus (pp. 5, 15) and these A-Level exam questions: 
N2016/1/3, N2014/1/4, N2008/1/9, N2015/I/11, N2012/1/8, N2010/I/6, N2010/II/3, N2009/1/11. 


495, Contents www.EconsPhDTutor.com 


Example 653. Consider g: R > R defined by g (x) = 2”. 
Then D = (0,0) is a stationary point the derivative (or gradient) of g at K is zero: 


gq Oye 0. 











Moreover, D is also a turning point because it is both a stationary point and a strict 
extremum (in particular, it is a strict local maximum.) 


Example 654. The sine function has infinitely many stationary and turning points. For 
every integer k, the point J, = ((0.5+k) 7,1) is both a stationary and a turning point. 


ay 1 
Y Ig = (7.1) 





13 = (-= I= 


Note that at every even integer k, the point J, = ((0.5 +k) 7,1) is a strict local maximum. 
While at every odd integer /, the point J; = ((0.5 + 21) 7,1) is a strict local minimum. 





By definition, every turning point is a stationary point. 
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Stationary 
points 


Turning 
points 





But, subtle point—the converse is false. That is, a stationary point need not be a turning 
point: 


Example 655. Define f :R—R by f (x) = 2°. 
The origin (0,0) is a stationary point of f, because the derivative at that point is zero. 


However, (0,0) is not a turning point because it is not a strict extremum. 


aN 


= 








(0,0) is a stationary point 
but not a turning point 





As we’ll learn in Part V (Calculus), (0,0) is actually an example of an inflexion point. 
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Example 656. Define 7: R > R by 


-81, for x < -3, 
j (@) = 1 0.75a° - 3.7524 - 523 +3022, for -3<2<5, 
125, fOr G28, 


G = (5,125) 





B = (-3,-81) 


The derivative (or gradient) of 7 at each of the points A, C, D, E, F, and H is zero. 
Hence, each of these points is a stationary point. 


Observe that moreover, C' and F are strict maximum points, while D and F are strict 
local minimum points. Hence, the stationary points C, D, E, and F are also turning 
points. In contrast, A and H, which are not strict extrema, are not turning points. 


The derivative of 7 at each of the points B and G is not equal to zero. Indeed and as 
we'll learn later on in Part V (Calculus), the derivative of 7 at each of those points does 
not even exist! Hence, neither B nor G is a stationary point. Since B and G are not 
stationary points, they are not turning points either. 


(By the way, is 7 continuous everywhere? Differentiable everywhere?)*”° 





Every turning point is a strict local extremum. 


But, second subtle point—the converse is false. That is, a strict local extremum need not 
be a turning point: 





295The function 7 is continuous everywhere, but not differentiable everywhere. It is differentiable every- 
where except at B and G. 
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Example 657. The function 7 : [-1,00) > R defined by i(a) = x has a strict local 
minimum at J = (-1,-1). (Indeed, J is also a strict global minimum. ) 


MN 








However, it can be shown that the derivative of i at I is not equal to zero.2”° So, I is not 
a stationary point and cannot be a turning point either. 








296Some writers define differentiability so that i is not differentiable at J. In contrast, others (including 
this textbook) define differentiability so that i is differentiable at J and specifically, 7’(-1) = 1. Either 
way, i’ (-1) #0. 
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Example 658. Define f : {-5,1,2} > R by f (-5) =4, f (1) =1, and f (2) =-3. 


Let A = (-5,4), B = (1,1), and C = (2,-3) be points. The graph of f is simply the set 
{A,B,C} of three (isolated) points. 


As explained earlier in Example 243, each of A, B, and C is both a strict local maximum 
and a strict local minimum of f. 


A=(-5,4) 
® 





® 
C20 





However, it turns out that differentiability is formally defined so that a function is never 
differentiable at an isolated point. So here, f is not differentiable at A, B, or C (all 
isolated points). 


Hence, f is nowhere-differentiable. Thus, f has neither stationary points nor turning 
points. 
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i) 


Example 659. Define f: \-5 | >R by f (x) = 2° +22*+2°. 


ee 





A 





Y 





Global maximum 





Strict global maximum 





Local maximum 





Strict local maximum 





Global minimum 





Strict global minimum 





Local minimum 





Strict local minimum 





Turning point 





Stationary point 














J 





D is a stationary but not a turning point. (As we’ll learn in Ch. 96, D is an example of 
an inflexion point.) 


A and FE are extrema but not turning points. 
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Exercise 190. For each of the following functions, identify all turning points. 


(a) f :IR—R defined by f(z) =27+1. 
(b) g:[-1,1]-R - Gia) =a 
(c) h:R>R : Dy = cos: 
(d) i:[-1,1]-R ia) S=1c0s ec, (Answer on p. 1796.) 


Exercise 191. Explain whether each of the points A—H in the graph below is a turning 
and/or stationary point. (Answer on p. 1796.) 


I 
Y 


Assume the graph is 
“smooth” everywhere 
except at two points. 








Exercise 192. Is each of the following statements true or false? If true, explain why. If 
false, give a counterexample from Example 659.) (Answer on p. 1797.) 
(a) Every maximum point or minimum point is a stationary point. 

(b) Every maximum point or minimum point is a turning point. 

(c) Every stationary point is a maximum point or minimum point. 

(d) Every turning point is a maximum point or minimum point. 

(e) Every turning point is a stationary point. 


(f) Every stationary point is a turning point. 
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44. Conic Sections 


In this chapter, we'll study the graphs of the following eight equations. All eight are 
examples of conic sections.” 


We first study the unit circle and ellipse centred on the origin: 







































































eae 
Tae y" = | he 
a2? 
We then study six types of hyperbolae: 
2 42 
eee gy? =1 lens Se 

x a? 6 

if ee _ mae FOE rc 
ba — dxt+e — drte 























Fun Fact 


Hyperbola has the same etymology as hyperbole (an exaggeration or overstatement). 





By the way, we’ve actually already studied one example of a conic section—this was the 
eraph of the quadratic equation y = ax? + bx +c, which is a type of conic section called the 
parabola. 





2°7For why these are called conic sections, see Ch. 142.20 (Appendices). 
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44.1. The Ellipse 17+ y? =1 (The Unit Circle) 


We already discussed the circle in Ch. 10. It turns out that the circle is an example of a 
conic section. And so, for completeness, here we'll just do a quick recap. 


The graph of x? + y” = 1 is the unit circle centred on the origin: 





(0,1) is a strict 
global maximum. 





(0,-1) is a strict 
global minimum. 


= 





Some characteristics of the above graph: 







Az+Az=1 


1. Intercepts. The y-intercepts are (0,-1) and (0,1). The x-intercepts are (-1,0) and 


(1,0). 


2. Turning points. The two turning points?” are (0,1) (strict global maximum) and 
(0,-1) (strict global minimum). (This is merely by observation. We haven’t actually 
proven that these two points are the graph’s turning points. ) 


3. Asymptotes. None. (Again, this is merely by observation. We haven’t actually proven 


that the above graph has no asymptotes.) 


4. Symmetry. Every line that passes through the origin is a line of symmetry. No other 
line is a line of symmetry. (Again, these two assertions are merely by observation. We 


haven’t actually proven them.) 





298 Actually, we haven’t defined turning points for graphs yet. See Ch. 142.7 (Appendices) for a possible 


definition. 
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2 2 


44.2. The Ellipse — re 
a iP 


To get from x? + y? = 1 to (x/a)” + (y/b)” = 1, perform these two transformations:2”” 


1. To get (x/a)’ +y” = 1, stretch x? + y? = 1 horizontally, outwards from the y-axis, by a 
factor of a. 

2. Next, to get (x/a)? + (y/b)” = 1, stretch (z/a)’ + y? = 1 vertically, outwards from the 
x-axis, by a factor of b. 


Thus, (z/a)” + (y/b)” = 1 is simply the unit circle stretched horizontally and vertically by 
factors of a and b. We call this “elongated” or “imperfect” circle an ellipse. % 


This ellipse’s centre is (0,0).°%° 





(0, b) is a strict 
global maximum. 






Line of symmetry 
f=) 






Line of symmetry 
y=0 






(0,—-b) is a strict 
global minimum. 


1. Intercepts. The y-intercepts are (0,-b) and 0,b. The z-intercepts are (-a,0) and 
(a, 0). 

2. Turning points. By observation, the two turning points are (0,b) (a strict global 
maximum) and (0,-b) (a strict global minimum). 


3. Asymptotes. By observation, none. 





29°Read Ch. 26 if you haven’t already. 

3°0Here’s one way we can define the centre of an ellipse: First, the ellipse has two lines of symmetry—one 
intersects the ellipse at the points A and B while the other intersects at C and D. (By the way, A, 
B, C, and D are called the ellipse’s vertices.) If |AB| < |CD|, then we call the lines AB and CD the 
ellipse’s minor axis and major axis, respectively. And vice versa. (If |AB| = |CD]|, then we have a 
circle.) The centre of the ellipse may then be defined as the point at which the two axes intersect. 
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4. Symmetry. By observation, if a # b, then there are only two lines of symmetry, namely 
y = 0 (the x-axis) and x = 0 (the y-axis). (Note that if a = 6, then this ellipse is in fact a 
circle and there are again infinitely many lines of symmetry.) 


Exercise 193. Given a,b,c,d € R and a,b + 0, graph the equation below. Label any 
turning points, asymptotes, lines of symmetry, and intercepts. (Hint in footnote, 


(x +c) (y+d) _ 
a2 . b2 





1. (Answer on p. 1820.) 





The rest of this chapter will look at six examples of hyperbolae. Our first and also the 
simplest example of a hyperbola is y = 1/2: 





3°lTo find the y-intercepts, plug in x = 0. To find the z-intercepts, plug in y = 0. 
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1 
44.3. The Hyperbola y= — 
i 


All hyperbolae we’ll study will share some common features: 


1. There’ll be two branches—y = 1/x has a bottom-left branch and a top-right branch. 
. There may or may not be x- and y-intercepts—y = 1/z has neither. 


. There may or may not be turning points—y = 1/x has none. 


Ee Ww Nb 


. There’ll?? be two asymptotes—y = 1/x has horizontal asymptote y = 0, because as 
r>-oo, y>0 andasz>o, y>0*. Also, y = 1/x has the vertical asymptote x = 0, 
because as 7 > 0°, y > -o0 and as x > 0", y> ow. 


A rectangular hyperbola is any hyperbola whose two asymptotes are perpendicular— 
thus, y = 1/x is an example of a rectangular hyperbola. 


5. The hyperbola’s centre is the point at which the two asymptotes intersect?”’—y = 1/zx 
has centre (0,0). 

6. There'll be two lines of symmetry—ceach (a) passes through the centre; and (b) 
bisects an angle formed by the two asymptotes. 


For y = 1/z, they’re y = x and y = -x. Observe that indeed, each (a) passes through the 
centre; and (b) bisects an angle formed by the two asymptotes. 






Line of symmetry 
ti. Eee 


y=- Line of symmetry _ 
5 yen eet 


- 


~ 
~ 






~ 
~ 
~ 


Horizontal asymptote Oe 
y=0 







Centre : 
(0,0) ee, 


Vertical asymptote 
c= 





3°? Actually, we haven’t defined asymptotes for graphs yet. See Ch. 146.5 (Appendices) for a possible 
definition. 

3°3For simplicity, this shall be this textbook’s definition of a hyperbola’s centre. 
Note though that in the usual and proper study of conic sections, the centre is instead defined as the 
midpoint of the line segment connecting the two foci. That the two asymptotes intersect at the centre 
is then a result rather than a definition. However, in H2 Maths, there is no mention of foci and so I 
thought it better to simply define the centre as where the two asymptotes intersect. 
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44.4. The Hyperbola 2? - y? = 1 


Consider the graph of x? - y? = 1. Observe that no x € (-1,1) satisfies the equation. So, 


the graph contains no points for which x € (1,1). ae 
Se y we 
Ss. Oblique asymptote Oblique asymptote ,-77 
Ns. Y= re eS, 
gy = hu we 
ri a Line of symmetry 
i 9 we y = 0 
—]| ra RS il L 
Pa Centres. 
ae (0, 0) ii 
ae Line of symmetry S 
oa L= 0 “SN 
1. Two branches—one on the left and another on the right. 
2. Intercepts. The z-intercepts are (-1,0) and (1,0). There are no y-intercepts. 
3. Turning points: None. 
4. As x > -o0, y > +x. And as x > o, y > +x. So, 2?-y’ = 1 has two oblique 


asymptotes y = +”. 
Since the two asymptotes y = x and y = —x are perpendicular, this is again a rectangular 
hyperbola. (In fact, we call this an “east-west” rectangular hyperbola. ) 

5. Centre: (0,0). 


6. Two lines of symmetry: y = 0 (the z-axis) and x = 0 (the y-axis). Observe that each 
(a) passes through the centre; and (b) bisects an angle formed by the two asymptotes. 
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2 
44.5. The Hyperbola — —--—~=] 
a 


Consider the graph of (a/a)? - (y/b)° = 1. Again, no x€ (-a,a) satisfies this equation. So, 
the graph contains no points for which x € (-a,a). 


To get from x? - y? = 1 to (x/a)’ — (y/b)” = 1, perform these two transformations: % 
1. To get (x/a)? —y? = 1, stretch x? - y? = 1 horizontally, outwards from the y-axis, by a 
factor of a. 


2. Next, to get (x/a) - (y/b)” = 1, stretch (x/a) — y? = 1 vertically, outwards from the 
x-axis, by a factor of b. 


eh Y ‘ Ps 
my Oblique asymptote Oblique asymptote Ps 
wie b b re 
+ Yo=—z CF agg -" 
i‘ o a a Pe 
Line of symmetry he Ps 
gS 0 ee a 
-—a a Rs, a 4 
Pa Centre .. 
ae (0,0) te 
ae Line of symmetry aS 
a v= 0 ie 


. Two branches—one on the left and another on the right. 
. Intercepts. The xz-intercepts are (-a,0) and (a,0). There are no y-intercepts. 


. Turning points: None. 


~ W DN 


Asx > -0, y > +bx/a. And as x > 0, y > +bx/a. So, (z/a)’ - (y/b)* = 1 has two 
oblique asymptotes y = +ba/a. 


Since the two asymptotes y = br/a and y = —bx/a are perpendicular, this is again a rect- 
angular hyperbola. (This is again an “east-west” rectangular hyperbola. ) 


5. Centre: (0,0). 


6. Two lines of symmetry : y = 0 (the x-axis) and x = 0 (the y-axis). Observe that each 
(a) passes through the centre; and (b) bisects an angle formed by the two asymptotes. 
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yr x 
44.6. The Hyperbola — -—=1 
b? a? 


Take (x/a)°—(y/b)° = 1 (from the last subchapter), but switch a and b to get 22/b?-y?/a2 = 1. 
Note that this is also an east-west rectangular hyperbola, but with x-intercepts (+b, 0) 
instead of (+a,0). 


To get from 2?/b? — y?/a? = 1 to y?/b* - x7/a? = 1, apply any one of these transformations: 


7 TT 
Rotate 5 clockwise. Rotate B anticlockwise. Reflect in the line y = 2. 


Observe that no y € (—b,b) satisfy y?/b? — x?/a? = 1. So, this graph contains no points for 
which y € (-b, b). 


ae Y ig 
Me ya 
Oblique asymptote ~> a 
b ~S b rs 
y=-—0 oe F 
a ee Po 
— rs Line of symmetry 
Ts ra y =0 
re 7 Oman be 
ra ht (0, 0) 
Ps —b "ee 
Oblique asymptote  |-” a 
b e o> 
ye we = 
ag 7 on 
Ry Line of symmetry om 
wo z=0 ae 


1. Two branches—one above and another below. 
2. Intercepts. The y-intercepts are (—b,0) and (b,0). There are no x-intercepts. 


3. Two turning points: (0,0) and (0,-b)—the former is a strict local minimum, while 
the latter is a strict local maximum. 


4. As x > -o0, y > +br/a. And as 2 > 00, y > +bx/a. So, y?/b? — x?/a? = 1 has two 
oblique asymptotes y = +ba/a. 


Since the two asymptotes y = br/a and y = —bx/a are perpendicular, this is again a rect- 
angular hyperbola. (In fact, we call this a “north-south” rectangular hyperbola. ) 


5. Centre: (0,0). 
6. Two lines of symmetry: y =0 (the z-axis) and x =0 (the y-axis). Each (a) passes 
through the centre; and (b) bisects an angle formed by the two asymptotes. % 
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44.7. The Hyperbola y = ceaahs 
rte 





2+br+e 
In the next subchapter, we’ll study y = a i Sell 
dx+e 





bx +c 
But to warm up, let’s first study the simpler case where a = 0—i.e. y = ae 
L+e 
We’ll assume that d #0 and cd- be #0. This is because 
¢ If d=0, then this is simply a linear equation; and 


¢ If cd-be =0, then as we’ll show below, this is simply the horizontal line y = b/d. 
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22 +1 
Example 660. Consider y = —. Do the long division: 
x 








. Two branches—one on the top-left and another on the bottom-right. 

. Intercepts. Plug in x = 0 to get y = (2-0+1)/(0+1) = 1—the y-intercept is (0,1). 
Plug in y = 0 to get 2x + 1 =0 or x = -1/2—the z-intercept is (-1/2,0). 

. Turning points: None. 


4.Asz—>-1°,y7oo. Andas x > -1*, y > -o0. So, y = (22 +1)/(x+1) has vertical 
asymptote x = —1. (Not coincidentally, this is the x-value for which xz + 1 = 0.) 


As «> -o, y> 2*. Andasx>0,y>2-. So, y = (2x4 +1)/(#+1) has horizontal 
asymptote y = 2. (Not coincidentally, this is the quotient in the above long division.) 


Since the two asymptotes y = 2 and x = —-1 are perpendicular, this is again a rectan- 
gular hyperbola. 


. Centre: (-1,2). (The centre’s coordinates are simply given by the vertical and hori- 
zontal asymptotes. ) 


. Two lines of symmetry: y=27+3 and y=-x+1. 
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In general, given the hyperbola y = (bx +c) /(dx +e), here’s how to find its intercepts, 
asymptotes, and centre: 


e Intercepts 


Plug in x = 0 to get y = (b-0+c)/(d-0+e) =c/e and thus the y-intercept (0,c/e). (Note 
that if e = 0, then c/e is undefined and there is no y-intercept.) 

Plug in y = 0 to get bu +c =0 or x = -c/b. And thus, the x-intercept is (—c/b,0). (Note that 
if b= 0, then —c/b is undefined and there is no x-intercept.) 


« Asymptotes 


The value of x for which dx+e =0 is x = -e/d. Thus, the vertical asymptote is x = —e/d. 


To find the horizontal asymptote, do the long division: 





b/d 
dx +e | bx +C 
bx  +be/d 





c—be/d 


br+c b c-% b cd-be 1 


Th 2 2 | 
ee de+e d dzv+e ad @ ated 





The quotient b/d gives us the horizontal asymptote y = b/d. 


Note that since we always have two perpendicular asymptotes (one horizontal and another 
vertical), y = (bx +c) / (dx +e) is a rectangular hyperbola. 


(By the way, note that if cd—be = 0, then this hyperbola is simply the horizontal line y = b/d. 
This is why above we imposed the condition that cd - be # 0.) 


e We’ve defined the centre to be the point at which the two asymptotes intersect. And 
so, the centre is simply (-e/d,b/d). (These coordinates are simply given by the vertical 
and horizontal asymptotes. ) 


¢ The two lines of symmetry are y = +x + (b+ e) /d. 


You need not know where the above two lines of symmetry come from (but see the proof of 
Fact 108 if you’re interested). Indeed, you need not even mug them. All you need remember 
are the following two points: The two lines of symmetry 


(a) Pass through the centre; and 
(b) Have gradient +1. 


The second point (b) implies that the two lines of symmetry may be written as y=x+a 
and y = -x + 3, where aand ( are constants you can easily find. Examples: 
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(nae: 
Example 661. Consider y = — Do the long division: 
& 


3.9 


ice a a 
De en a ea 


yan ys 








. Two branches—one on the top-left and another on the bottom-right. 

. Intercepts. Plug in x = 0 to get y = 3/4. So, the y-intercept is (0,3/4). 
Plug in y = 0 to get 7x +3 =0 or x = -3/7. So, the x-intercept is (-3/7,0). 

. Turning points: None. 


4. Asymptotes. The value of « that makes the denominator 0 is -2—hence, the vertical 
asymptote is x = —2. 
The quotient in the long division is 7/2—hence, the horizontal asymptote is y = 7/2. 
Since the two asymptotes x = —2 and y = 7/2 are perpendicular, this is again a rect- 
angular hyperbola. 
. Centre: (—2,7/2). (These coordinates are given by the vertical and horizontal asymp- 
totes.) 


. The two lines of symmetry may be written as y = «+a and y = -% +6 and pass 
through the centre (-2,7/2). Plugging in the numbers, we find that a = 11/2 and 
G = 3/2. Thus, the two lines of symmetry are y = x + 11/2 and y = -x + 3/2. 
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—52 +1 
Example 662. Consider y = 3 ai 
x 


. Do the long division: 
+2 


—5/3 
3r +2 | =o7 +1 
-5a -10/3 
13/3 





eee 13/3 
ee ee eee 











. Two branches—one on the top-left and another on the bottom-right. 

. Intercepts. Plug in x = 0 to get y = 1/2. So, the y-intercept is (0, 1/2). 
Plug in y = 0 to get -5x + 1=0 or x =1/5. So, the x-intercept is (1/5,0). 

. Turning points: None. 

4. Asymptotes. The value of x that makes the denominator 0 is -2/3—hence, the 

vertical asymptote is x = —2/3. 
The quotient in the long division is -5/3—hence, the horizontal asymptote is y = —5/3. 
Since the two asymptotes x = -2/3 and y = —5/3 are perpendicular, this is again a 
rectangular hyperbola. 

. Centre: (-2/3,-5/3). (These coordinates are given by the vertical and horizontal 
asymptotes. ) 


. The two lines of symmetry may be written as y = x +a and y = -% +3 and pass 
through the centre (-2/3,-5/3). Plugging in the numbers, we find that a = -1 and 
6 = -7/3. Thus, the two lines of symmetry are y = 2-1 and y = -x-7/3. 
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bx +c 





The following Fact summarises the features of the hyperbola y = ae 
xr+e 


Fact 108. Let b,c,d,e¢ R with d#0 and cd- be #0. Consider the graph of 


br+e 


dx+e 


(a) Intercepts. If e # 0, then there is one y-intercept (0,c/e). (If e = 0, then there are 
no y-intercepts.) And if b #0, then there is one x-intercept (-c/b,0). (If b = 0, then 
there are no x-intercepts.) 


(b) There are no turning points. 


c) There is the horizontal asymptote y = b/d and the vertical asymptote x = —e/d. 
NA 
(The asymptotes are perpendicular and so, this is a rectangular hyperbola.) 


(d) The hyperbola’s centre is (-e/d, b/d). 


(e) The two lines of symmetry are y = +2 + (b+e) /d. 

















Proof. We proved (a), (c), and (d) above. For (b) and (e), see p. 1609 (Appendices). 


Exam Tip for Towkays 


For the hyperbola y = (bx +c) /(dx+e), you should know how to find (a) the z- and 
y-intercepts; and (c) the horizontal and vertical asymptotes. 


You're not required to know what (d) the hyperbola’s centre is, but since this is simply 
the intersection of the two asymptotes (which you already know how to find), you may 
as well know about it, as it will help you sketch better graphs. 


You're also not required to know how to find (e) the equations of the two lines of sym- 
metry. But as we’ve shown in the above examples, it’s not very difficult to figure out 
their equations. It’s certainly not very difficult for you to at least sketch them. 


After you are done with Part V (Calculus), there is a small possibility that you are 
required to prove that (b) this hyperbola has no turning points. (And so you may or 
may not be interested in reading the proof of (b) (Appendices).) 


Exercise 194. Graph and describe the features of each equation. 


(Answer on p. 1822.) 


(Answer on p. 1823.) 


(Answer on p. 1824.) 
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2+b 
44.8. The Hyperbola y = ——— 
x+e 


We now study the equation y = (ax? + br +c) / (dx +e). We'll assume that a #0, d#0, and 
either c #0 or e #0. This is because 

e If a=0, then this is simply the equation we studied in the last subchapter. 

e If d=0, then the equation is quadratic and we’ve already studied that. 


e If c=0=e, then the equation is linear and we’ve already studied that. 


Example 663. Consider y = (x? +1) /z. 


Do the long 


division: 





. Two branches—one on the bottom-left and another on the top-right. 
. Intercepts. If we plug in x = 0, then y is undefined. Thus, there are no y-intercepts. 


And if we plug in y = 0, then 2? +1 = 0, an equation for which there are no (real) 
solutions. Thus, there are no x-intercepts. 


. Two turning points: (-1,-2) is a strict local maximum and (1,2) is a strict local 
minimum. 


. Asymptotes. The value of x that makes the denominator 0 is 0—hence, the ver- 
tical asymptote is x = 0. The quotient in the long division is x—hence, the oblique 
asymptote is y = x. (By the way, here for the first time, the asymptotes here are not 
perpendicular and so, this is a non-rectangular hyperbola. ) 


. Centre: (0,0). Recall that the centre is simply the point at which the two asymptotes 
intersect. In this example, the intersection of the asymptotes x = 0 and y = x is (0,0), 


6. Two lines of symmetry: y = (1 + v2) - 
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Here are the features of the hyperbola y = (ax? +bx+ c) / (dx +e). 


Intercepts. Plug in x = 0 to get y = c/e and thus the y-intercept (0,c/e). (Note that 
if e = 0, then c/e is undefined and there is no y-intercept. This was the case in the last 
example.) The x-intercepts are given by the values of x for which ax? + br + ¢ = 0: 


a q 


2a 


Note that if b? - 4ac < 0, then there are no 2z-intercepts (this was the case in the last 
example). And if b? —4ac = 0, then there is exactly one x-intercept, namely (—)/ (2a) ,0) 


Asymptotes. The value of x that makes the denominator dx + e zero is x = —e/d and 
gives us the vertical asymptote x = -—ce/d. To find the other oblique asymptote, do 
the long division: 


ar/d +(bd-ae) /d? 








dx +e | ax? +bx +C 
ax” +aex/d 
(bd -ae)x/d +C 


(bd —ae)a/d + (bde - ae”) /d? 
(cd? + ae? — bde) /d?. 





a bd-ae cd? +ae? —bde 1 








d d? Ge dxt+e 
ee eee 
Quotient Remainder 


The quotient gives us the oblique asymptote y = ax/d+(bd-—ae)/d’. (Since the 
asymptotes are not perpendicular, this hyperbola is not rectangular.) 


The centre is the point at which the two asymptotes intersect. Its x-coordinate is given 
by the vertical asymptote x = -e/d. For its y-coordinate, plug x = —e/d into the equation 
of the oblique asymptote: 





af{e bd-ae -ae+bd-ae bd-2ae 
y= 5(-5)+ ee @- 


bd — 2 
Thus, the centre is (-e/4 ~~). 


d2 


You need not know how to find the equations of the lines of symmetry. 


You should however know how to roughly sketch them. So, just remember that they 
(a) pass through the centre; and (b) bisect the angles formed by the two 
asymptotes. 


518, Contents www.EconsPhDTutor.com 


243241 
Example 664. Consider y = a Do the long division: 
x+ 


xe? + 3x41 il 
ee : 
c+ 1 crt+1 


. Two branches—one on the left and another on the right. 
. Intercepts. Plug in x = 0 to get y= 1/1 =1. Thus, the y-intercept is (0,1). Plug in 
y = 0 to get x7 +32 +1=0—thus, the two z-intercepts are (0.5 (-3 + v5) : 0). 
3. Turning points: None. 


4. Asymptotes. The value of « that makes the denominator 0 is -1—hence, the vertical 
asymptote is x=—1. The quotient in the long division is x + 2—hence, the oblique 
asymptote is y = x +2. 

. The centre’s x-coordinate is given by the vertical asymptote x = -1. For its y- 
coordinate, plug x = —1 into the oblique asymptote to get y = -1+2=1. Hence, the 
centre is (-1, 1). 


. Two lines of symmetry: y = (1 + v2) et+24V2. 





a y= (1+ V3) at 24 v3 


/ 
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Qn7+2%4+1 


. Do the long division: 
= ad 


Example 665. Consider y = 


2x7 +2r+1 

se ee ee 
—-£+1 r+1 —-£+1 

. Two branches—one on the top-left and another on the bottom-right. 

. Intercepts. Plug in x = 0 to get y=1/1=1. Thus, the y-intercept is (0,1). 


Plug in y = 0 to get 2x?+2x+1 = 0, an equation for which there are no (real) solutions. 
Thus, there are no x-intercepts. 


. Two turning points: (1 + 0.5V10,-6+2V 10). 


. Asymptotes. The value of x that makes the denominator 0 is 1—hence, the vertical 
asymptote is x = 1. The “quotient” in the long division is -2x —- 4—hence, the oblique 
asymptote is y = —2x — 4. 

. The centre’s x-coordinate is given by the vertical asymptote x = 1. For its y- 
coordinate, plug x = 1 into the oblique asymptote to get y = -2(1) -4 = -6. Hence, 
the centre is (1,-6). 


. Two lines of symmetry: y = (-2 + v5) t-42VJ5. 


er er i 
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You now know how to find its intercepts, asymptotes, and centre of the hyperbola?” 


ax? + bx +c 
dx+e 


And after we’ve done Calculus (Part V), you’ll also be able to find the turning points. 


To repeat, you do not need to know how to find the equations of the two lines of sym- 
metry. However, you should at least be able to roughly sketch them. 








34Fact 264 (Appendices) summarises the features of this hyperbola. 
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45. Simple Parametric Equations 


We can sometimes describe a graph (i.e. a set of points) using an equation. We can 
sometimes also describe a graph using parametric equations: 


Example 666. Let S = 1(@, yi 2? t+y? = 1} be the unit circle centred on the origin. 


By Fact 97, sin?t+cos?¢ = 1 for all t ¢ R. So, if we let x = cost and y = sint, then 
a? +y” = 1. Hence, we have a second way to write down the set S: 


S24 (o,u) 7a = cost, sim, £20) | 
In words, S is the set of points such that x = cost, y = sint, and t > 0. We call the variable 


t a parameter (hence the name parametric equations). 


As t increases from 0 to 271, we trace out, anti-clockwise, the unit circle: 


a) a) ae 1 0e b= 2 =— (9) =(-1,0), 
t=n/4 —> (2,y) = (2/2, V2/2), t = 5/4 => (2,y) = (-V2/2,-v2/2), 
bao 2—— (7) = (0.11), t= an) 2 (9-9) = (0. S1)2 
tod — = (0.7) = (-v2/2, v2/2), t= (0/4 == (2,4) = (v2/2, -/2/2). 


Arrows indicate Aite 1 

instantaneous 

direction of i“ (x,y) » (0.54, 0.84), 

travel. (Uz, Uy) * (-0.84, 0.54) , 

(Gz, @y) ¥ (-0.54, -0.84) . 

At t=0, 

ieee (x,y) = (1,9), 

SSaen oy =i) (vy, vy) = (1,0), 
24 (fy) 7 = costy =sing,t > 0} | (G5,0,) =(=1,0). 





ont 


Att=—, 
4 


=) 


) 


oa (- 


2 
(vase) -( ie 





2 
7) 


(a) -(¥ 


One natural interpretation of S is as the movement of a particle over time t. For example, 
at t =7ts, the particle is 1m west and 0m north of the origin. 
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Example 667. Let a,b >0. The graph of the equation x?/a? + y?/b? = 1 is the set 


U ={(a,y): 27/0? + y?/b? = 1}, 


which is an ellipse centred on the origin, with x-intercepts (+a,0) and y-intercepts 
(0, +b). 


Observe that by letting x =acost and y = bsint, we have 





a2cos?t 6? sin? t 
= f 


5 ; =cos?t+sin?t = 1. 
a b 


So, we can use parametric equations to rewrite the set U as 
U =4(2,4)i@=acost,y =bsint,t2 0}. 


/\ 








Again, we can interpret U as the movement of a particle over time ¢. And again, the 
particle is travelling anticlockwise. 
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Example 668. The graph of x? — y? = 1 is the set 


W = {(c,y):2?-y? =I}, 


which is an “east-west” hyperbola centred on the origin, with z-intercepts (+1, 0). 


By Fact 97(a), sec?t — tan?¢ = 1 for all te R\ {ka:k eZ}. So, if we let x = sect and 
y = tant, then x? - y* = 1. Hence, this gives us another way of writing the set W: 


W241 (a, 9). @ =sect,y=tant, 20}. 
Again, we can interpret W as the movement of a particle A over time t. 


W = {(a,y) ia? —y? = 1} Vy 


= 5 = — > 
{(z, y) x= sect, y =tant,t 2 O} During fe [0, 0.570), Pad 


hones A is moving northeast. 
Arrows indicate 8 


instantaneous 
direction of travel. 


At t=0, 
au) = (1,0), 
Ug, Vy) = 0, 1 ; 
During ¢ € (0.57, 1.572), he : | i 
A moves upwards along x) dy ,U). 
the left branch. 


a 


An instant after t = 0.57, 
A magically reappears 
“near” “bottom-left infinity”. 





At each instant t, A’s velocity in the x- and y-directions is given by 








dg =) _ (== dtant 


(U0, = (= zr = (sect tant, sec” ole 


(Example continues on the next page ...) 


dé “di 


Exercise 196. Continuing with the above example, write down A’s acceleration in the 
x- and y-directions at the instant t. (Answer on the next page.) 





524, Contents www.EconsPhDTutor.com 


. Example continued from the previous page.) 


d x 
air (sect tant) =secttant tant + sect sec? t 
= sect (tan? ¢ + sec? t) = sect (2sec*t = 1 


du d 
Le (sec? t)  Osect sect tant = 2sec? t tant. 


Oa el = a 


sega 
dt2’ dt2} \ dt’ dt 


Thus, (a,9¢,) = ( 


= (sect (2 sec” t - 1) ,2sec?ttant). 


Note that at t = 0.571, 1.571, 2.571,,..., both sect and tant are undefined. And so, we’ll 
say that at these instants in time, the particle A’s position, velocity, and acceleration are 
simply undefined. 


Observe that interestingly, v, =sec”t > 0 for all t (for which sect is well-defined). Hence, 
the particle A is always moving upwards (except during the aforementioned instants in 
time when its velocity is undefined). 


Ati =0, (x,y) = (secO, tan 0) = (1,0), 
(vz, Vy) = (sec 0 tan 0, sec” 0) = (0, 1), 


(a;, ay) = (sec 0 (2sec? 0 - 1) ,2sec? 0 tan 0) = (1,0). 


So, A starts at the midpoint of the right branch of the hyperbola, is moving upwards at 
1ms', and is accelerating rightwards at 1ms_‘.] 


During t € [0,71/2), the particle A is moving northeast. As t > 71/2, it “flies off” towards 
the “top-right infinity” (co, co) and 


L,Y, Vx, Vy, Ax, Ay 7 ©}. 


An instant after t = 7/2, A magically reappears “near” “bottom-left infinity” (—0o, -00). 


During t € (0.571, 1.57t), the particle travels upwards along the left branch of the hyperbola. 
And again, as t > 1.571, it “flies off” towards “top-left infinity” (—co, co) and we have 


L, Vz, Ay > —00 and UC Oe Oo: 


Exercise 197. Continue with the above example. (Answer on p. 1829.) 


(a) What happens to particle A an instant after t = 1.57? 
(b) Describe A’s movement during t € (1.571, 2.57). 
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45.1. Eliminating the Parameter t¢ 


Above, we rewrote a single (cartesian) equation into a pair of (parametric) equations and 
in the process introduced the parameter t. Now we'll go in reverse—we’ll rewrite a pair of 
(parametric) equations into a single (cartesian) equation and in the process eliminate the 
parameter f. 


Example 669. Consider the set S' = te Ge sp - At, y 2t- Lee 0}. 


As usual, we can interpret S as the motion of a particle P in the plane, where t¢ is time 
(seconds), while x and y are P’s rightward and upward displacements (metres) from the 


origin. With a little algebra, we can combine + and 2 intoa single equation and eliminate 
the parameter tf: 

e First rewrite y2t- las t2y+l. 

« Then plug 2 into = to get x= (y+1)?-4(y+1) =y° — 2y—3. 

e Observe also that £20 <> y20-1=-1. 

Altogether, S = 12, Wee=y = 29-3 92 -1}. 


Note that here we have a quadratic equation. Our quadratic equations have usually been 
the variable x, but in this case, it is in the variable y. 


The set S is the black graph below and does not include the grey portion. At t =0, P 
starts at the position (x,y) = (0,-1) and is travelling northwest. % 


IN 
Y 


After ¢ = 2, 
At t =2, P moves rightwards. 


zy) =(-4,1 
(ony) 0.1) | ue 
(az, ay) = (2,0) (2) = (1, 1+ v5) 


ce Uy) = (0, 1) 
P always moves (ae.8)) = (2,0) 
upwards at lms‘. 





Starting 
point 
During ¢ € [0, 2), At t=0, ei 

P moves leftwards. 

(x, y) = (0, =) 


(Uz, Vy) = (-4, 1) 
(az, Ay) = (2,0) 





(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Here are P’s velocity and acceleration, decomposed into the xz- and y-directions: 


dy, da, da 2 day d*y 
dp 








U Oe = Qs = = eH 
a dt dt? a dé ) dt 


In the y- or upwards-direction, P moves at a constant velocity of 1ms' and does not 
accelerate. 

In the x- or rightwards-direction, P moves at velocity 2t-4ms 7 and accelerates at a 
constant rate of 2ms”. In particular, 

e When ¢t < 2, we have 2t-4<0 and so P is moving leftwards. 


e At t = 2, the particle is at the leftmost point of the parabola and its velocity in the 
z-direction is Oms?. 


e And when t > 2, we have 2t-4>0 and so P is moving rightwards. 


Example 670. Consider the set U = { (x,y) :¢ + 2cost— Ay 2 3sint+ eae of. 


Again, we can interpret U as the motion of a particle Q in the plane. With a little 
oul ae a 
algebra, we can combine = and = into a single equation and eliminate the parameter f: 


A 

e First rewrite x e 2cost — 4 as cost = “— 
y-1 
- 

¢ Now recall that by Fact 97(a), sin?t+cos*t = 1 for all te R. 


e Next rewrite y 2 3sint+1 as sint = 


So, we can rewrite the set U as 


v= (eon (A) | 


This describes an ellipse centered on (-4, 1). x 


Since x and y can be expressed in terms of trigonometric functions, they must be periodic. 
That is, the particle Q will keep repeating its movement along a certain path. And so, 
we need not include any constraints for x and y. 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Starting 
point 





At t=0, 
(x,y) = (-2,1) 
(Uz, Vy) = (0,3) 
(az, dy) = (-2,0) 





The velocity and acceleration of Q, decomposed into the x- and y-directions, are 


d d 
(ie = =-2sint, Uy= i 3cost, 








At t=0, Q’s starting position and velocity are 
(x,y) = (2cos0 — 4,3sin0+ 1) = (-2,1) and (vz, Vy) = (—2sin0,3cos0) = (0,3). 


So, it starts at the rightmost point of the ellipse and is moving upwards at 3ms !. Thus, 
its direction of travel is anticlockwise around the ellipse. Every 271s, Q@ completes one 
full revolution around the ellipse. 


Exercise 198. The sets A, B, and C below describe the positions (metres) of particles 
A, B, and C at time t (seconds), relative to the origin. For each set, 


(i) Rewrite the set so that the parameter t¢ is eliminated. 
(ii) Sketch the graph. 


(iii) Describe the particle’s position and velocity as time progresses. 

(a) Aa4 (ey) eat ly ane 1 ee 0} (Answer on p. 1831.) 
b) B= 2 = — 

(b) B= {(w,y) x= > 
(Ce = Ci = ein 1 cose 2 0 (Answer on p. 1833.) 


ae ane ee 0}. (Answer on p. 1832.) 
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Part II. 
Sequences and Series 
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[T/here is nothing as dreamy and poetic, nothing as radical, subversive, and 
psychedelic, as mathematics. 


— Paul Lockhart (2002, 2009). 


[M]athematics is capable of an artistic excellence as great as that of any music, 
perhaps greater; not because the pleasure it gives (although very pure) is compa- 
rable, either in intensity or in the number of people who feel it, to that of music, 
but because it gives in absolute perfection that combination, characteristic of great 
art, of godlike freedom, with the sense of inevitable destiny; because, in fact, it 
constructs an ideal world where everything is perfect and yet true. 


— Bertrand Russell (1902). 


Beauty is Truth, Truth Beauty—That is all 
Ye know on Earth, and all ye need to know. 


— John Keats (1819). 
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46. Sequences 


Informally, a sequence is simply an ordered list of real numbers. 
Example 671. The Fibonacci sequence is 
(Tell 2,3)0,8, 13,2 64 io, Bolas, | 


We call the numbers in a sequence its terms. 
So the Fibonacci sequence’s Ist term is 1, 2nd is 1, 3rd is 2, etc. 


The first two terms of the Fibonacci sequence are fixed as 1. Each subsequent term is 
then simply the sum of the previous two terms. So, 


the 3rd term is 1+1 = 2; the 7th term is 5+8 =13; 
the 4th term is 1+2 =3; the 8th term is 8+13 = 21; 
the 5th term is 2+3 =5; the 9th term is 13 + 21 = 34; 
the 6th term is 3+5 =8; etc. 


Letting f (nr) denote the nth term, the Fibonacci sequence may be defined by 


1 for 7 =1, 2, 


f(n-2)+f(m-1),  forn>3. 


Example 672. The sequence of square numbers is 
(1,4, 9, 16, 25, 36, 49, 64,...) 


This sequence’s lst term is 1, 2nd is 4, 3rd term is 9, etc. 


Letting s(n) denote the nth term, this sequence may be defined by s(n) = n?. 
Example 673. The sequence of triangular numbers is 
(1,3,6, 10, 15, 21, 28, 36,...) 


This sequence’s lst term is 1, 2nd is 3, 3rd term is 6, etc. 





Letting t(m) denote the nth term, this sequence may be defined by t(n) =1+2+---+n. 





3° Note though that some writers prefer using braces {} or angle brackets (). 
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The above sequences were infinite. But of course, sequences can also be finite: 
Example 674. The finite sequence of the first six Fibonacci numbers is 
(Aa ee ae 
We call this a finite sequence of length 6. 


Example 675. The finite sequence of the first seven square numbers is 


(1,4, 9, 16, 25, 36,49). 


We call this a finite sequence of length 7. 
Example 676. The finite sequence of the first four triangular numbers is 
(173,610). 


We call this a finite sequence of length 4. 
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46.1. Sequences Are Functions 
A little more formally, sequences are functions: 


Definition 116. A finite sequence of length k is a function with domain {1,2,... 


Definition 117. An (infinite) sequence is a function with domain Z* = {1,2,3,...}. 





We now formally rewrite our earlier examples of sequences as functions: 


Example 677. Formally, the Fibonacci sequence is the function f : Z* > R defined by 


ae i for 7-2 12. 
f(n-2)+f(m-1),  forn2>3. 


Example 678. Formally, the sequence of square numbers is the function s : Z* > R 
defined by s(n) = n?. 


Example 679. Formally, the sequence of triangular numbers is the function t: Z* > R 
defined by t(n) =1+2+---4n. 


Example 680. Formally, the finite sequence of the first six Fibonacci numbers is the 
function fg: {1,2,3,4,5,6} > R defined by 


At, for 7= 1,2. 
fe (n) = 
fe(n-2)+fe(n-1), forn=3,4,5,6. 


Example 681. Formally, the finite sequence of the first seven square numbers is the 
function s7: {1,2,3,4,5,6,7} > R defined by s7(n) =n’. 


Example 682. Formally, the finite sequence of the first four triangular numbers is the 
function t4: {1,2,3,4} > R defined by t4(n) =1+2+---+n. 








More examples: 
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Example 683. The function e: Z* > R defined by e (n) = 2n is the sequence of (positive) 
even numbers (2, 4,6,8,10,12,...). 


Example 684. The function g: Z* > R defined by g(n) = 2n? - 3n +3 is the following 
sequence (2, 5, 12, 23, 38, 57, 80, 107, 138, 173,...). 





Recall that a function need not “follow any formula” or “make any sense”. The same is 
true of sequences (since sequences are simply functions): 


Example 685. The function h: {1,2,3,4} + {Cow, Chicken} is defined by 
h(1)=Cow, h(2)=Cow, h(3)=Chicken, and h(4) = Cow. 


Although h does not seem to “make any sense”, it is a (perfectly) well-defined function. 


Indeed, the function h is also the following finite sequence of length 4: 
(Cow, Cow, Chicken, Cow). 


Although this sequence does not seem to “make any sense”, it is a (perfectly) well-defined 
finite sequence of length 4, simply because it is a function with domain {1,2,3,4}. 


Example 686. The function 7: {1,2,3} > {t, |, >,<, Punch, Kick} is defined by 
GOA) =). 0 72) =, and 7(6) = Punch 


Although j does not seem to “make any sense”, it is a (perfectly) well-defined function. 
Indeed, the function 7 is also the following finite sequence of length 3: 


({, >, Punch). 


Although this sequence does not seem to “make any sense”, it is a (perfectly) well-defined 
finite sequence of length 3, simply because it is a function with domain {1, 2,3}. 


Exercise 199. Formally rewrite each sequence as a function. (Answer on p. 1834.) 


(a) (1,4,9, 16, 25, 36, 49, 64, 81, 100). 

(ey eh 1 eee). 

(c) (1,8,27, 64, 125, 216, 343). 

(d) (2,6,6, 12, 10, 18, 14, 24, 18, 30, 22, 36, 26, 42,...). 
(e) (5,0, 99). 

(f) (1,2,6,24, 120, 720, 5 040, 40 320,...). 

(2) (120, 100) 10000, 100, OKO 1, 0000 MeO) Innes 
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46.2. Notation 


Let (a1, @2,...,a%) be a finite sequence of length k. This sequence can also be denoted as 
any of the following: 


Example 687. Let s; = 1, so = 4, s3 = 9, sg = 16, and s5 = 25. Then we can denote the 


finite sequence of the first five square numbers by 


5 
(1, 82, 83, $4, 85) = (Sn nai = (Sn) na1,2,3,4,5 = (Sn) ne(1,2,3,4,5} = (Sn) tenes - 





Again, s and n are merely dummy or placeholder variables. And here, n in particular may 
also be called an index variable, because it indexes or indicates which term in the 
sequence we’re referring to. 


We could replace s or n with any other symbol, like © or *. And so, we could rewrite the 
above example as 


Example 688. Let ©, = 1, ©2 = 4, ©3 = 9, ©4 = 16, and ©5 = 25. Then we can also 
denote the finite sequence of the first five square numbers by 





5 
(©), ©2, ©3, a, ©5) a (©, )y-4 a (94) .212345 ~ (©. ).6(1,2,3,4,5} = (Os reese 


Of course, it’s a bit strange to use symbols like © or *. The point here is simply to illustrate 
that once again, these are mere symbols that can be replaced by any other. We’ll usually 
stick to using boring symbols like letters from the Latin alphabet. 


Next, let (a ,@2,...) be an (infinite) sequence. This sequence can also be denoted as any 
of the following: 


(Gn)na-1 OF (Qn) n=1,2,... or (An)nez+ OF (An) nef1,2,...} or (an). 


Example 689. Let ¢, = 1, to = 3, t3 = 6, tg = 10, t5 = 15, etc. Then we can denote the 
(infinite) sequence of triangular numbers by 


(11,%3,...)/= (tn )net = (tn) na1,,... = (tn) nez* = (tn) ne{1,2,...} = (tn). 





Again, we can replace t and n with any other symbols: 


Example 690. Let ©; = 1, ©2 = 3, ©3 = 6, ©4 = 10, ©5 = 15, etc. Then we can also 
denote the (infinite) sequence of triangular numbers by 





(©1,©o,...) = (©.)e1 - (O,),-12,.. ~ (9s). ez ~ (Os) re(1,2,...} =(©,). 
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46.3. Arithmetic Combinations of Sequences 


In Ch. 20, we learnt to create arithmetic combinations of functions. Since sequences are 
functions, we can likewise create arithmetic combinations of sequences: 


Example 691. Suppose (a,) = (1,1,2,3,5,8, 13,21, 34,...) is the Fibonacci sequence; 
(b,) = (2,4, 6,8, 10, 12,14, 16,18,...) is the sequence of even numbers; and k = 10. Then 
(an + bn) = (3,5,8, 11, 15, 20, 27, 37,52,...), 
(Geb) = (5 = 1), 


(nbn) = (2,4, 12, 24,50, 96, 182, 336,612,...), 
() -($4555 5909S 
ie 
(kan) = (10, 10, 20, 30, 50, 80, 130, 210, 340,...), 
(kb,) = (20, 40, 60, 80, 100, 120, 140, 160, 180,... ). 
Exercise 200. Let (c,,) be the sequence of negative odd numbers, (d,,) the sequence of 


cube numbers, and k = 2. Write out the first five terms of (a) (c,); and (b) (d,). Then 
write out the first five terms of each of (c)—(h). (Answer on p. 1834.) 


(c) (en+dn) (d) (cen - bn) (e) (cndn) 


() (2) (g) (Ken) (h) (kd) 
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A7. Series 


Definition 118. Given a finite sequence (@n),<(1,9, 4+ its series is the expression 


ay + Ag+ Q34+°:'+ Ak. 
And the sum of this series is the number that equals the above expression. 


Example 692. Consider the finite sequence of the first five square numbers: 
(1,4,9,16,25). Its series is the expression 1+ 4+9+16+25, while the sum of this 
series is the number 55. 


Example 693. Consider the finite sequence of the first six even numbers: 
(2,4,6,8,10,12). Its series is the expression 2+4+6+8+10+12, while the sum of 
this series is the number 42. 





It may seem strange and unnecessary to distinguish between a series and its sum. Aren’t 
they exactly the same thing? 


It turns out that expressions like a; + ag +a3+---+a, play an important role in maths. And 
so, we want to reserve a special name for the expression itself, in order to distinguish it 
from the number that is the sum of the series. 


Example 694. Given the sequence (1,3,5,7), we might be specifically interested in the 
expression 1+3+65+7, rather than just the number 16. 


It is thus convenient to have separate names for them—vwe call the expression 1+3+5+7 
the series and the number 16 the sum (of the series). 
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47.1. Convergent and Divergent Series 


Definition 119. Given an (infinite) sequence (a,), its series is the expression 


aj; +Qo+agt... 





As we saw on the previous page, every finite series has a well-defined sum—simply add up 
all the numbers! 


With an infinite series, things get a little trickier. It may sometimes be that an infinite 
series diverges and its limit does not exist. 


Example 695. Let (1,1,1,1,1,...) be the (infinite) sequence that consists solely of 1s. 
Its series is the expression 1+1+1+1+1+... 


“Clearly”, this expression is not equal to any number. And so formally, we say that this 
series diverges and that its limit does not exist. 


Also, observe that this expression “grows ever larger”. As shorthand, we write?” 


ee co. 


Example 696. Let (2,4,6,8,10,...) be the sequence of (positive) even numbers. 


Its series is the expression 2+4+6+8+10+... 


“Clearly”, this expression is not equal to any number. And so formally, we say that this 
series diverges and that its limit does not exist. 


Also, observe that this expression “grows ever larger”. As shorthand, we write 


2+4+64+8+10+---=o0. 








36Pedantic point: this “equation” is not really an equation. Instead, it is merely shorthand for this 
informal statement: 


“The expression 1+1+1+1+1+... grows ever larger.” 


“Grows ever larger” is, in turn, a vague and informal phrase that we clarify only in Ch. 143 (Appendices). 
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We just looked at two examples of series that diverge. We now look at examples of series 
that converge to some limit: 


Example 697. Consider the zero sequence (0,0,0,0,0,...). 
Its series is the expression 0+0+0+0+0+... 


This series converges to 0. We call 0 its limit. And as shorthand, we write 
ows i 


Note that what was called the sum (in the previous context of finite series) is now called 
the limit (in the current context of infinite series). 


i 
Example 698. Consider the sequence ( .). 


1 PP wl 
+—+—-+——-+—+4... 
2 74°55 lo 32 
As we'll soon learn, it turns out that 
il 
2 





Its series is the expression 


ll 
+o—-+to—+—4+—+:-=1]. 
4 8 16 32 


That is, this series converges to 1. (And we call 1 the limit of this series.) 


Here we should remark that whenever we are dealing with infinite series, we must be very 
careful. Here the = sign in the above equation is not the usual one. Instead, the above 
equation is merely shorthand for 


nelle le 5 Pell 
+—+—+-——+—+... 
2 4°38 16 32 





“the expression converges to the number 1”. 


In A-Level Maths, an intuitive and informal understanding of the phrase converges 
to will suffice. But you should be aware that it does actually have a clear and precise 
meaning—see Ch. 143 (Appendices). 


Example 699. Consider the sequence of reciprocals of squares ( 
Pie ef 1 ee 
It series is the expression eo Lo eae 


It’s not at all obvious, but it turns out that 


That is, this series converges to 7/6. (And we call 7/6 the limit of this series.) 


The problem of finding the above limit is called the Basel Problem and was first solved 
by Euler (albeit not quite rigorously) in 1734. We’ll revisit it in Ch. XXX. 





Now, when does a series converge or diverge? Or equivalently, when does its limit exist? 
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The precise definitions of these terms are beyond the scope of A-Level Maths.°°’ You need 
only know—roughly and intuitively—what convergence, divergence, and limits are. 


It turns out that what exactly these terms mean is no simple matter. On the next page are 
two fun examples to give you a glimpse of the difficulties involved: 


Example 700. Consider Grandi’s series:°°° 
1-1+1-1+1-1+... 


Does this series converge or diverge? Or equivalently, does its limit exist? 


Remarkably, we can “prove” that this series is equal to 0, 1, and 1/2. 


e« To “prove” that it equals 0, pair off the terms: 
1-14+1-14+1-1+4---=(1-1)+(1-1)+(1-1)+---=04+0+0+4--- 
0 0 0 


e To “prove” that it equals 1, pair off the terms after the first: 


1-14+1-14+1-1+...=1+(-14+1)+(-14+1)+(-1+1)+... 
0 0 0 


=1+0+0+0+-:-=1. 
¢ To “prove” that it equals 1/2, lett S=1-1+1-1+1-1+..., then “show” that 1-S =S: 
1-S=1-(1-14+1-14+1-1+...)=1-1+1-1+1-1+---=8S. 
Since 1- S =S, simple algebra yields 1 = 2S or S = 1/2. 


We’ve just “proven” that the expression 1-1+1-—1+1-1+... equals 0, 1, and 1/2. 


Well, which is it then? It turns out that the series 1-1+1-—1+1-1+... is not equal to 
0, 1, or 1/2. Instead, it is divergent and its limit does not exist.°” 





Here’s a series whose convergence is unknown: 


Example 701. Consider this series: 


I 2 PAS OO Oe YS COS TY) 
-=+2-=4+—-—4+—-—+—-— +... 
Zoo ee oh Sa ly to. f 73 29 





The terms are fractions, with the numerators being the (positive) integers and the denom- 
inators being the prime numbers. ‘This series looks “simple” enough. But remarkably, 


mathematicians still do not know whether it converges or diverges!?! 








8°7Tf you're interested, see Ch. 143 (Appendices). 

3° The Italian priest-mathematician Luigi Guido Grandi (1671-1742) thought he had succeeded in proving 
that the sum of this series equalled both 0 and 1/2, and thus that “God could create the word out of 
nothing” (source). 

39° We prove this in Example 1577 (Appendices). 

3!0This problem is listed as equation (8) on this Wolfram Mathworld page and as Problem E7 in Richard 
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A8. Summation Notation > 


The symbol © is the upper-case Greek letter sigma (oc is lower case). 


An enlarged version of that symbol is » which we’ll read aloud as “sum”. We use the 


symbol >» to write down series more compactly, in what is called summation or sigma 
notation. 


B) 
Example 702. )\n? = 17 +27+37+4?+57=1+4+9+16425. 


n=1 


The variable n is the dummy, placeholder, or index variable. (As usual, we can replace 
all instances of n with any other symbol without changing the meaning of the above 
sentence. ) 


The integer below » is the starting point. So here, 1 tells us to start counting (the 
index variable n) from n = 1. 


The integer above is the stopping point. So here, 5 tells us to stop at n = 5. 
The expression to the right of De describes the nth term to be added up. So here, the 
nth term to be added up is n?. 


5 
Altogether then, yy n? tells us to add up the terms 1”, 27, 37, 47, and 5”. 


n=l 


3 
Example 703. >) (2n+ 3) = (2-1+3)+4(2-24+3)+(2-34+3) =5+74+9=21. 


n=l 


4 
Example 704. = 14+24+3+4=10. 
i 


6 
Example 705. S70 2n = 2-142-24+2-342-442-542-6=24+44+648410412 = 42. 


n=L 


7 
Example 706. )° 2” = 2'+27+2°+24+2?+2°+2'=2+4+8+16+32 +64 +128 = 254. 


n=1 


5 
Example 707. > 1=1+1+1+1+1=5. Here each term to be added up is simply the 


n=l 


5 
constant 1. And so, }° 1 is simply the sum of five 1s. 


n=l 


3 
Example 708. >> (10-2n) = (10- 2-1) +(10-2-2)+(10-2-3) =8+6+4. 


n=1 








Guy’s Unsolved Problems in Number Theory (3e, p. 316), where it is attributed to Paul Erdés. 
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gone Cee 
Example 709. )/—=7+5+3+ 


n=1 Tt 


cs 1 il 1 it 1 en ape ee 
Example 710. se 








22 s+ 5+ 5+ eee 
merce ly (ey ae Gay (ee) 4 2 6 25 


Example 711. Suppose x ¢ R. Then 


0 1 2 3 


=7 +a +a? +4 2493 + 4 


+e altactaert+ezi te. 








It’s nice to have 1 as the starting point and that’s what we’ll usually do. But there’s no 
reason why the starting point must always be 1. Examples: 


Example 712. Starting point 3: 


B) 
Yin? = 39 + 4° + 5% = 27+ 64 + 125 = 216. 
n=3 


Example 713. Starting point 0: 





Se en Ok tee er Samy cee) 
n=0 





The starting point can even be negative: 


Example 714. Starting point —2: 








z 2 —] 


Se eae 
+—+—4+—4+-—4-—=- 
AMES slog ed esl) fe) 2 





We'll follow this fairly standard (though not universal) convention—if the stopping point 
is smaller than the starting point, then the sum is simply 0: 


2 
Example 715. > = 
n=3 


5 
Example 716. > n? =0. 


n=-2 
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Exercise 201. Rewrite each series in summation notation. (Answer on p. 1835.) 


(a) 14+24+6+24+ 120+ 720+ 5040. 
(b) 24+54+84+11+144+17+ 20+ 23. 
(c) Yo+14+3/24+24+5/24+34 7/2. 

(d) 8+7+6+5+4+3. 


Exercise 202. Redo the last exercise, but with starting point 0. (Answer on p. 1835.) 


Exercise 203. Find the sum of each series. (Observe that here the dummy or index 
variables are not the usual n. Instead, they are 7, *, and 2.) (Answer on p. 1835.) 


33 


(a) d, (2-i). (b) >> (4% +5). (ce) G3). 


x=16 Lol 
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48.1. Summation Notation for Infinite Series 


Example 717. In the following series, the “n = 1” below the Sy symbol indicates that it 
has starting point 1. 





1 

oe 
The oo above the symbol indicates that there is no stopping point. This is thus an 
infinite series. 


As already mentioned and as we’ll soon learn, this series converges to 1. We may write 
oe 
foes PAL 


If it’s very clear from the context what the starting and stopping points are, then we'll 
sometimes be lazy/sloppy and omit them. And so here, we may also write 


Senda 


Example 718. “Clearly”, the series » t= So n=1+2+3+... diverges. 


(=i 


So, we may write sn = =o 


n=1 


ee Pe 


coma | il i ee | 
E le 719. Consid SS = eee 
xample onsi Te ee: i cas 


rt oat 
As mentioned in re 699, this series converges to 71/6. 
a: 


So, we may write Ys “YS =a 


n=l 1 
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Example 720. Consider 3 nx’ = 2 a= ar 4 ee, 


n=1 


This infinite series has starting point 1 and each term to be added up is nz”. 


Thus, yy nz? is the sum of infinitely many terms, namely 2”, 2x”, 327, 4a? ... 
n=1 


By the way, this series diverges for all x + 0. That is, for all x + 0, we have 


co 
Dy ve = es = OO. 
n=l 


And “clearly”, if x = 0, then this series converges to 0: 


yn =¥n-0?=0. 
n=1 
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Example 721. Consider the harmonic series: 


We have 





Sing SH 
at 


n=l 


1 
n 


300 4 
Do 0 282, 


n=l 


10° 4 
S —=12.090.... 
n 


n=1 


Does the harmonic series converge or diverge? From the above, it’s not obvious. 


It turns out that it diverges. Here’s a heuristic (i.e. not totally rigorous) “proof”: 


Loe 


1 
First, consider the series 1 + 5 + 5 + 5 +...—"“clearly”, this series diverges. 





We’ll show below that this series is “smaller than” the harmonic series.*!! and hence, 


“clearly”, the harmonic series must also diverge: 


+ —+—+—+—+—+ neha 
io 16 16 16 916 16 16. 16 
EE 
1/2 


ee UR ie a ae les ae 
~ + —+4+— 4+ —+—+—+—+— + 
O- 10 cb viz 18 14s 16 











311 Again, here we must be careful to define what we mean by one infinite series being “smaller than” 


another. 
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As before, it’s nice to have 1 as the starting point, but this need not always be so: 


=o) gee | 
Example 722. Consider, —=—+—-4+— 
2, 2 oY 


By the way, since > 1/2” = 1, it “clearly” follows that 


n=l 


1 1 


Example 723. 3 ie ces 
n — 


=z 


+ 


By the way, since the harmonic series diverges—i.e. » 1/n = co—we have 


That is, since the harmonic series diverges, this series also diverges. 


Exercise 204. Rewrite each series in summation notation. (Answer on p. 1835.) 


(a) 14+24+6+24+ 120+ 720+5040+.... 
(b) 24+54+84+11+144+17+20+23+... 
(c) Yo+1+3/24+24+5/24+34+7ot+... 

(d) 8+7+6+5+4+3+... 


Exercise 205. Redo the last exercise, but with starting point 0. (Answer on p. 1835.) 
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49. Arithmetic Sequences and Series 


Definition 120. An arithmetic sequence (or progression) is a sequence where the differ- 
ence between any two consecutive terms is constant (and called the common difference). 


Example 724. Below are six arithmetic sequences—three finite (left) and three infi- 
nite (right). 


(One dyin), (an) = (1,3,5,7,9,-.-), 
(Bn) ja = (4,7, 10, 13, 16, 19, 22) (b,,) = (4,7, 10, 13,16, 19, 22,...) 
(cn)>_, = (0,7, 272) (cn) = (0,70, 270,...) 


In each sequence, the difference between any two consecutive terms is a constant. 
In Cae and (a,,), the common difference is d = 2. 
In Ge and (b,), the common difference is d = 3. 


In (ee and (c,), the common difference is d = 71. 
Definition 121. Given an arithmetic sequence, its series is called an arithmetic series. 


Example 725. The six arithmetic sequences in the above example have these corre- 
sponding arithmetic series: 


14+3+5+7+9, 14+3+5+7+9+..., 
4+7+104+13+16+19 + 22, 4+74+104+13+16+194+22+..., 
O+7m+4 271, O+m+270+... 


Fact 109. /f Gas is a finite arithmetic sequence with d = az - a1, then 


(a) The nth term is An = a,+(n-1)d; 
a ak — ay 


(b) The number of terms is k 7 


+ 1; 


(c) 





Proof. (a) Since ay is (n-1) terms “after” a;, we must have a, =a, +(n-1)d. 
(b) By (a), a, =a, +(k-1)d. Rearranging, k = (ax, - a) /d+1. 


(c) Immediate from (a). 
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A9.1. Finite Arithmetic Series 


q3l2 


Example 726. You’ve probably heard the apocryphal story where a seven-year-ol 
Gauss added up the numbers from 1 to 100 in an instant. His trick was to pair the first 
number with the last, the second with the second last, etc., then multiply: % 


50 pairs 
ET , 
1+2+3+4+---+100 = (1+ 100) + (2+ 99) + (3+ 98) +---+ (50451) 
101 101 101 101 


= 101 x 50 = 5050. 





More generally, the sum of a finite arithmetic series is 


Number of Terms 


(First Term + Last Term) x ; 


A bit more formally, 


Fact 110. Suppose Coe is a finite arithmetic sequence. Then 


i k 
» An = (A, + Gp) =. 
n=l 2 





Our proof of this formula is simply a formalisation of Gauss’s apocryphal idea: 


Proof. First, suppose k is even. Then 
a, + dg +++: + ap = (A, + ap) + (a2 + ap_1) +20 + (axy2 + Ayj241)- 
Note that RHS has k/2 pairs of terms. 


Next, Q1 + Ap = Ag + Ap_1 = 03 + Ag_3 = +++ = Apso + psa. 


k 
So, S) Gn = (a, + ax) J 


n=l 


Next, suppose k is odd. ‘Then az_1 = az -—d 2 Ak - (| and 


Yan = Sidy ay 2 (0 +05 4) : tan? [ay +a - (EE) + 
n=1 n=1 2 | 9 
1 a, + Ak 
+ 


2 











1 a,-a 
+ 
2 

















_ ha 
= (ay +04) “> + az = (a, + ag) , = (ay +04) % 





312Or eight- or nine- or ten-year-old. Brian Hayes has collected 145 “Versions of the Gauss Schoolroom 
Anecdote”—“the earliest such instance I have found” is “a 1906 pamphlet authored by Franz Mathé”. 
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Example 727. Consider the arithmetic sequence Cole =i, linet, oj2 65800) 
The first and last terms are a; = 7 and a; = 837. The common difference is 10. 

So, by Fact 109(b), the total number of terms is k = (837 — 7) /10+ 1 = 83+1 = 84. 
And now by Fact 110, 


k 84 
An = (a1 + Gz) a (7 + 837) Die 35 448. 


n=1 


Example 728. Consider the arithmetic sequence aye S(O aoe). 
The first and last terms are b; = 1 and b; = 393. The common difference is 4. 

So, by Fact 109(b), the total number of terms is & = (393-1) /4+1=98+1=99. 
And now by Fact 110, 


k 99 
D bn = (bi + Be) 5 = (1+ 393) > = 19508. 
n=l 





Here’s another formula for the sum of a finite arithmetic series: 


Corollary 19. Suppose ais Von is a finite arithmetic sequence with d=a2-a,. Then 


k 

-—1 
> Qn = ka, + WGI 
n=1 2 





Proof. Use Fact 110, then Fact 109: 





k ie i) 


Example 729. XXX 
Example 730. XXX 


Exercise 206. Rewrite each series in summation notation, then compute its sum. 


(a) 2+74+124+17+22+27+32+---+997. 











D, On = (a1 + ax) 5 = [a +01 + (k- 1) d] 5 = kay + 
n=1 


(b) 3+204+37+54+ 71+---+1703. 
(c) 81+89+97+ 105+ 113+--- +8081. (Answer on p. 1836.) 
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A9.2. Infinite Arithmetic Series 


“Clearly”, the infinite arithmetic series 4+ 7+10+13+16+... does not converge. And 


more generally, 

















Proof. See p. 1614 (Appendices). 
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50. Geometric Sequences and Series 


Definition 122. A geometric sequence (or progression) is a sequence where the ratio 
between any two consecutive terms is constant (and called the common ratio). 


Example 731. Here are six geometric sequences, three finite (left) and three infinite 
(right): 


(an), = (,2,4,8, 16), (an) = (1,2,4,8,16,...), 
(es ae eae | 1 

bn), = 

(On )ret (1 pe ie wa 


1) 5) 48" 16? 32° 64” ) 
(cn)2. = (7,7, Tr), (a) Cam, 2.) 

In each sequence, the ratio between any two consecutive terms is a constant. 

In (ae and (a,,), the common ratio is r = 2. 

In Cae and (b,,), the common ratio is r = 1/2. 

In Coe and (c,), the common ratio is r = 7. 

Definition 123. Given a geometric sequence, its series is called a geometric series. 


Example 732. In the above example, the six geometric sequences have these correspond- 
ing geometric series: 


14+2+4+8+16, 14+24+44+8+16+..., 
1 1 ih ages [Et gs | 1 1 
SS a 1l+=+-+-+—+—+—+ 
16 32 64 2 4 8 16 32 64 


Te Tae me (2 ace OT ek 


Fact 112. /f (aes is a finite geometric sequence with common ratio r = ag/a,, then 


(a) The nth term is Gy aware 


(b) The number of terms is k = log, (ax/ai) + 1; 


(c) om = 3 (ar), 


n=1 





Proof. (a) Since a, is (n-1) terms “after” a,, we must have a, = a,r”"1. 
? 


(b) By (a), a, = ayr*, or az/a, =r", or log, (az/a,) =k -1, or k = log, (ag/a,) + 1. 











(c) Immediate from (a). 
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50.1. Finite Geometric Sequences and Series 


The sum of a finite geometric series is 


1—- Common RatioNt™>* % Te™s 





First ‘Term x 
1—Common Ratio 


A bit more formally, 


Fact 113. Leta,r¢R with r#1. If S=at+ar+ar?+ar>+---+ar*", then 





Proof. (1-r)S = (a tartar? +are tet ar®!)—-r (a tartar? +are+e.4 ar**) = a-ar 














Rearranging, S =a(1-r*)/(1-7r). 


Example 733. Consider the geometric series 1 + 2+ 27 +2? +---+270, 


The first term is 1, the common ratio is 2, and there are 21 terms. So, 


2 3 20 ie erel 
1+ 242428 4..49% = 1 = = 2007 152-1 = 2097 151. 





il il il 
Example 734. Consider the geometric series 1 + = +—=+—44+:-:-+—. 
7 ie sa pes) 


The first term is 1, the common ratio is 1/2, and there are 21 terms. So, 


1 ,1-0.57! 1-0.5% 221-1 2097151 


73-05 0.5 220 ~ 1048576" 
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Here’s a second formula for the sum of a finite geometric series: 


First Term — Common Ratio x Last Term 
1 —Common Ratio 


A bit more formally, 


Corollary 20. /f (an)i_4 is a finite geometric sequence with r = aj/a,, then 


5 a, — Tay 
Gy = : 
n=l 1 pool 








Proof. From Fact 113, S =a (1 — r”) /(1-r). Now, simply plug in ar*® = rar®! = rag. 











Example 735. Consider the geometric series 1+2+4+8+16+---+1024. 
The first and last terms are 1 and 1024, and the common ratio is r = 2. So, 


1-2-1024 1-2048 
= a ae 


14+2+4+8+4+16+---+1024= =2()47 


Example 736. Consider the geometric series 4+ 12 + 36 + 108 +--- +8748. 


The first and last terms are 4 and 8748, and the common ratio is 3. So, 


4-3-8748 4-26244 
3 qe 


4+12+ 364+ 108+---+8748 = = 13120 


Exercise 207. Rewrite each series in summation notation, then compute its sum. 


(a) 7+144+28+56+ 112 + 224 + 448 + 896. 
(b) 20+104+5 + 5/2 + 5/4 + 5/s, 
(c) 14 1/3 + Yo + loz + Ye + Maas. (Answers on p. 1836.) 
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50.2. Infinite Geometric Sequences and Series 


Perhaps surprisingly, an infinite geometric series converges if |r| < 1. Moreover, there’s a 
nice and simple formula for the limit: 


First Term 





1 —Common Ratio’ 


A bit more formally, 


Fact 114. LetaeR. If |r| <1, then 


atartar?t+are+-.-= 





127 





Proof. See p. 1614 (Appendices). 











If instead |r| >1, then the infinite geometric series diverges (for a # 0): 


Fact 115. Leta#0. If|r|>1, thena+ar+ar?+ar°+... diverges. 








Proof. See p. 1615 (Appendices). 











You’ve probably seen this “trick”: 


Dit Ree sel! 
E. l : =l+—+—-4+-—+—+.... Th 
xample 737. Suppose S VG a + i6 + en 


if oi eet ae 
NS 


2: So, $-25=(1+5+7 454 : 
2 43 


16 





+..)-(Q414 545454...) ot 8-2, 
2 4 8 
3. Hence, S = 2. 





Following this last example, we can write down this “proof” of Fact 114: 

















“Proof”. Suppose S =a+ar+ar?+are+.... Then 
1. rS=artar?+ari+ar*+... 
2. So, S-rS =(a+ar+ar?+ar?+...)-(artar?+ar+art+...) or (1-r)S =a. 
a 
3. Hence, S = ; 
l-r 


Unfortunately, the above “proof” is flawed. To see why, try it for r = 2: 


Example 738. Suppose S=1+2+4+8+16+.... Then 
1. 25=24+4+8+16+32+... 


2. So, S-2S5 =(14+2+4+8+16+...)-(2+4+8+4+16+32+...) or -S=-1. 
o- Hence, 6 = 1, 
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In the above example, “S = 1” is clearly wrong. But we used the exact same three-step 
procedure as in the previous example. So, what went wrong? Why did the procedure work 
in the previous example but not in this example? 


The quick answer is that here S' is divergent, so that the algebraic manipulations in the 
first two steps are illegitimate (in contrast, they were legitimate in the previous example 
where S' was convergent). For a more detailed answer, see Ch. 146.7 (Appendices). 


Exercise 208. Rewrite each series in summation notation, then compute its sum. 


(a) 64924 27/s+.... 


(b) 20+10+5+... 
(c) 1+1¥/3+4¥or+... (Answers on p. 1836.) 
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51. Rules of Summation Notation 


Fact 116. Suppose an,bn ¢ R for all n, ce R, and k,l¢ Z*. Then*® 


(a) Constant Rule: 


k 
(b) Constant Factor Rule: 2, ( (cae » Gn: 


(c) Sum Rule: > (an + by) = 2 Qn, + » ae 
=I 


k; 


k 
(d) Difference Rule: iG — bn) = » i 


k+l k+l 
(e) Breakup Rule: » Gne= 2 Qn, + > ae 


n=k+1 


k+l I 
(f) Change Start & Stop: Y = eee 


n=k+1 





k k times 














—a 
Proof. (a) Yic=etet-+c=ck. 
n=1 
k 
(b) YS) (can) = cay + Cag + +++ + Cap = C(Q1 + a2 +++ + AK) = CD An. 
n=1 
k 
(c) >, (Gn = (a, + b,) + (a + bo) +--+ + (ax + dg) 
n=1 
= (a, + ay +-*++.a_) + (by + bp +++ + Dg) = Ysa + Yb 
n=1 
k 
(d) S> (Gn — bn) = (a1 - b1) + (ag — bz) + +++ + (az — De) 
te k k 
= (@) + dg +--+ + ag) — (by + bg +--+ +04) = DY Gn - DY) bn. 
n=1 n=1 
k+l 
(e) Sdn = Ay + AQ $61 + Og + Aker + Apeg too + Anal 
— k+l 
= (@) +9 +--+ + GR) + (Ges + Oheg $27 + Ghat) = Doant Do an. 
n=l n=k+1 
k+l 
(f) S) Gn = ket + Ok42 + Okeg to0 + +O = D> Okan: 
n=k+1 





313\More of such identities here: Sg. 
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52. The Method of Differences 


il 1 il 1 
Example 739. Consider ——~ af 


+ —— + —— +--+ + ——___, 
Lx 2% 3 8 4 1000 x 1001 


Finding the sum of this series is easy using partial fractions. First, rewrite into sum- 
mation notation: 


1 1 1 1 oly 1 
e + oe: = } ——_.. 
22K 34 1000x1001 4 n(n+1) 








Next, decompose the nth term into partial fractions: 


il _A, Ato) om ee ee 
mii nn nel” “gamely | “wG@el) 


Comparing coefficients, A=1 and A+ B=0. So, B=-1. 


1 1 if 


H alk TAL es 
ti min+l) n n+l 


1 
* T0900 x 1001 


1 1 i i 
+ —eee — —m— + 
A 1000 1000 1001 





In the second line, every term with denominator 2 through 1000 is happily cancelled out. 
Your syllabus calls this process of mass cancellation the method of differences. (Some 
writers instead call this telescoping.)*"4 


More generally, 


x 1 1 1 I i 1 
> = + + eee ——__ = ] — —_ 
Fa i) 1D 2 a BA k(k+1) k+1 





We can also show that the corresponding infinite series converges: 





1 1 1 TT ee el ; 
is + tees lim YY = lim (1 - le 
lea 2 Sor ko M4 n(nt+1) ko k+1 








314ProofWiki says that this “arises from the obvious physical analogy with the folding up of a telescope”. 
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1 1 1 1 
E le 740. ee 
xample 740. Consider ee ee ee I 


1000 1 
First rewrite into summation notation: —_____—... 
d, n(n+1)(n+2) 


n=l 
Next, decompose the nth term into partial fractions: 


1 ne B A CG _ A(n+1)(n+2) + Bn(n+2)+Cn(n+1) 
n(nt+1)(n+2) n n+l n+2 n(nt+1)(n+2) 








AO ny ene ie 2a 
- n(n+1)(n+2) 


Comparing coefficients, A+ B+C 2 0, 3A+2B+C 2 0, and 2A=1 or A=0.5. 
Take 2 minus = to get 2A+ B=0or B=-1. And now = yields C' = 0.5. 


Altogether Lee ee et n 0.5 
oat CANG ene cae 


1 1 1 1 
tt tHE hrvdH FO 
teoxs 2xeaxt gxax5 1000 x 1001 x 1002 


Oc 1 Ose 0 91 Os Oy 1 05 0.5 1 0.5 
= —=+ ~ —=+ a ——+ free t ~ + ; 
| ee ae: 2 a A o 4° »D 1000 1001 1002 


And so, 








Observe that the three terms with denominator 3 cancel out. And the same will happen 
to all terms with denominator 4 through 1000. 


This leaves only terms with denominators 1, 2, 1001, and 1002: 


a 1 05-105 905 1 0.5 
> = = ac 7 + 
44 n(n+1)(n+2) 1 2 2 1001 1001 1002 





~ 0.5 2 0.5 ne 1 0.249... 
4 1001 1002 4 2-1001-1002 








More generally, 


k 1 (Cem Mass press O5 1 il il 
a CRSNICES) Ba Se Se % 
an(n+1)(n+2) 1 2 2 k+l k+2 4 2(k+1) 2(k+2) 


We can also show that the corresponding infinite series converges: 


1 P 1 n 1 ii 1 1 f i 1 
es Ae ee ne | 
2 x3 234 n(n+1)(n+2) keol4 2(k+1) 2(k+2)) 4 
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Above we used partial fractions. We will next use surd rationalisation. 


i il 1 il 
SS 
lg ey ey eee any ey | /9999 + \/10000 


Again, first rewrite into summation notation: 


Example 741. Consider 








1 . i 1 ron 1 a 1 
Wt FI 8 8 A /9999+./10000 “4 \/nt+Vn+1 








Next, rationalise the surds in the denominator of the nth term: 


if 1 Yn-vnt+1_ JYn-vatl 
J/n+J/n+1 | i nel aa ~ Woe) 








Jit V2. J2+V3 V3+V4. 9999 + /10 000 
= /2-V14+V3-V2+V4-vV3+---+V10000— V9999. 


The red terms cancel out. Likewise with the blue. This leaves 


9 999 i] 


> n+V/n+1 





= /10000- V1 = 100-1 = 99. 


More generally, 


1 1 1 
+ ——— + ———_{ + --- + ——___ = 
iy Nr Vie nee Svirn VO ee ee ey i JSk + il 
“Clearly”, the corresponding infinite series diverges: 


fore) k ‘| 
= ling ——__ 
Le ia J/n+V/n4+1 ee Aen 
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The next example again uses partial fractions and now also the formula for the sum of 
an arithmetic series: 


a i i 
E le 742. Consid = . 
SNe ee as 


Again, first rewrite into summation notation: 








il 1 i] 1 
—+ + ————— _ $e + —______—_ = 
1 1+2 1+2+3 14+24+3+4+---+1000 


Next, use the formula for the sum of an arithmetic series to rewrite the nth term: 


_n(n+1) 
2 





1l+---+n 


Now, decompose into partial fractions: 


Altogether, 


1 000 (| 


\) ——— ==+ + ——— 4 --- + — 
lee ee lee feo + 1000 


t= 


ee e000 


1 1001 1001° 


More generally, 


& 1 1 sal 1 
\) ———- = =+ + ————— $+ + ——________ 
ae ee eee 14+24+3+---+k 


And hence, the sum of the corresponding infinite series converges to 2: 


—+ a 
aS es k-+00 $4 L+-+:+1  k00 


it 1 it k il ( 





562, Contents www.EconsPhDTutor.com 


Example 743. We can also use the method of differences to find the sum of squares. 


De Gece) Cises)) 
7 ean, 


ae 


k 
We'll prove 1742743? 4+---+h? = Sn? 
i=l 


First, observe that (n+1)° -n? =3n?+3n+41. 


DOs 


a 


2 (k+1)?-13 = k3 + 3k? + 3k. 


i 
Plug + into 2: 2 SEED | p84 388+ 3b. 


_ k8 + 3k24+ 3k -3k(k+1)/2-k 
. 3 

Le Ee eee Re ASSO Aine) 
. a es ee ee 


Rearranging, 


Exercise 211. Rewrite each series in summation notation and find its sum. Next, write 
down its sum in the case where the series has k terms instead. Finally, determine if the 
corresponding infinite series converges. If it does, find its limit. | (Answer on p. 1838.) 


(a) + + a + a 1 a feet : ; (Hint in footnote.)*” 
3 8 15 24 35 Deere ey) 
i Z 
lg 5 +le 5 + le" ++-+]o aes (lg is the base-10 log.) 


1 1 1 
a a, ‘ 
V1? 370 8 100\/99 + 99,/100 


es 100 (Hint in footnote.)?"” 





(Hint in footnote.)*'® 














315 Hint: Think about the square numbers. 
316Do the surd rationalisation. Then persevere with the algebra and things will work out nicely. 
317Consider (n + 1 —n‘ and mimic the last example (be warned that the algebra will be more painful). 
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Part III. 
Vectors 
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The cultural problem is a self-perpetuating monster: students learn about math 
from their teachers, and teachers learn about it from their teachers, so this lack 
of understanding and appreciation for mathematics in our culture replicates itself 
indefinitely. Worse, the perpetuation of this “pseudo-mathematics,” this emphasis 
on the accurate yet mindless manipulation of symbols, creates its own culture and 
its own set of values. Those who have become adept at it derive a great deal of 
self-esteem from their success. The last thing they want to hear is that math is 
really about raw creativity and aesthetic sensitivity. Many a graduate student has 
come to grief when they discover, after a decade of being told they were “good at 
math,” that in fact they have no real mathematical talent and are just very good 
at following directions. Math is not about following directions, it’s about making 
new directions. 


— Paul Lockhart (2002, 2009). 


a teacher of mathematics has a great opportunity. If he fills his allotted time with 
drilling his students in routine operations he kills their interest, hampers their 
intellectual development, and misuses his opportunity. But if he challenges the 
curtosity of his students by setting them problems proportionate to their knowledge, 
and helps them to solve their problems with stimulating questions, he may give them 
a taste for, and some means of, independent thinking. 


— George Polya (1945). 
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53. Introduction to Vectors 


The Latin word vector means carrier. You may recall from biology that mosquitoes 
are vectors, because they carry diseases to humans. Similarly, in mathematics, a vector 
“carries” us from one point to another. 


Example 744. Let A = (-1,-3) and B = (2,1) be points. Then AB = (3,4) is the 
vector that “carries” us 3 units east and 4 units north from the point A to the point B. 


The vector AB has tail A and head B—remember, a vector goes from tail to head. 
(Just remember: arrowhead—the arrowhead is where the vector’s head is.) 


aN 


D=(2,4) 


F = (0.5, 2) 





eres) 





Let C =(-1,0), D= (2,4), EF =(4,0), and F = (0.5, 2) be points. 


The vector CD = (3,4) “carries” us 3 units east and 4 units north from the tail C' to the 
head D. 


The vector CE = (5,0) “carries” us 5 units east and 0 units north from the tail C to the 
head £. 


The vector CF = (1.5,2) “carries” us 1.5 units east and 2 units north from the tail C' to 
the head F’. 
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Formally, a vector in 2D space is an ordered pair of real numbers: 


Definition 124. Given the points A = (a1,a2) and B = (b;,b2), the vector from A to B, 
denoted AB, is this ordered pair of real numbers: 


AB) 





(Later on when we look at three-dimensional (3D) space, vectors will instead be ordered 
triples of real numbers. ) 


A vector is often contrasted with a scalar, which is simply any real number: 


Definition 125. A scalar is any real number. 


We now contrast a vector, a scalar, and a point: 


A vector is a two-dimensional (2D) mathematical object with the properties of 
magnitude (or length) and direction. 


(A line is also a 2D object with magnitude or length, but not direction.) 


In contrast, a scalar is a one-dimensional object with only the property of 
magnitude. 


And a point is a zero-dimensional object (with neither magnitude nor direction). 


Example 745. You may recall from physics that velocity is a vector quantity, while 
speed is a scalar quantity. In particular, speed is the magnitude of velocity. 





We'll have more to say about this in Ch. 58. 
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53.1. The Magnitude or Length of a Vector 
The magnitude or length of a vector is simply given by Pythagoras’ Theorem: 


Definition 126. Given the vector (p,q), its magnitude (or length), denoted |(p,q)|, is 
this number: 


l(p, D| = Vp? + @. 


Example 746. The magnitude or length of the vector AB = (3,4) is 


[AB] = |(3,4)| = 42 + (-3)" = 5. 


uN 
y D = (2,4) 





A=(-1,-3) 





Similarly, the magnitudes of Gi = G-. CE = (5,0), and Gir = (1.5,2) are 


ICDI= |(3,4)| = 42 + (-3)"=5, 
CEI |(5,0)| = V52+0?  =5, 
CF |= |(1.5, 2)[- V1.5? +2? = 2.5, 





568, Contents www.EconsPhDTutor.com 


53.2. When Are Two Vectors Identical? 


Informally, two vectors (p,q) and (r,s) are identical or equal if and only if they have the 
same direction and magnitude. Formally, 


(p.q)=(7,5) <> peHr,q=s. 


Example 747. Consider the 
vectors AB = (3,4) and CD = 
(3,4). Both have length 5. 
Also, both point in the 
same direction.*"® 

Informally, AB = CD because 


AB and CD have the same 
length and direction. 


A little more formally, AB = 
C'D because 3 = 3 and 4=4. 


Note that when determining 
whether two vectors are iden- 
tical, their tail and head do 
not matter. In the above ex- 
> > 

ample, AB = C'D even though 
they don’t have the same tail 
or head. 








Example 748. The vectors CD = (3,4) and CF = (1.5,2) point in the same direction. 
However, CD - CF because they have different lengths—|C'D| = 5, while CF =, 


More formally, CD +CF because (3,4) (lho, 2): 


Example 749. Each of AB = (202). Cp= (3,4), and C CE = (5,0) 0) has length 5. However, 
CE “obviously” points in a different direction from AB and CD. 


So, CE+AB and CE+#CD. 








318Below, Definition 137 will formally define what it means for two vectors to “point in the same direction”. 
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53.3. A Vector and a Point Are Different Things 


Here’s another important point. Although a vector and a point can each be described by 
an ordered pair of real numbers, they are entirely different mathematical objects. 


To repeat, a vector is a two-dimensional object possessing the properties of length 
and direction. In contrast, a point is a zero-dimensional object, with neither length 
nor direction. Example: 


Example 750. Let A = (-1,-3), B = (2,1), and G = (3,4) be points. 


Then AB = (3,4) is the vector that carries us 3 units east and 4 units north from A to 
B. It is a two-dimensional object endowed with the properties of length and direction. 
In contrast, the point G = (3,4), although also described by an ordered pair of real 
numbers, is a zero-dimensional object, with neither length nor direction. 


aN 
: @ C= (3,4) 





ies) 





Again, AB = (3,4) is a vector, while G = (3,4) is a point. They are completely different 
mathematical objects. Don’t confuse them. 





So far this textbook has used the notation (p,q) to mean three entirely different things: 


(a) The set of real numbers between p and q (excluding p and q); 
(b) The point with x-coordinate p and y-coordinate q; or 


(c) The vector that “carries” us p units east and q units north. 


As discussed in Remark 26, this could be confusing in theory, but shouldn’t be in practice. 
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53.4. Two More Ways to Denote a Vector 


The vector (p,q) can also be written in two other ways. First, we can write it “vertically”: 


wn-(") 
q 


Example 751. The vectors AB = (3, 4), CD = (3,4), CE = (5,0), Cin = (1.5,2) may 
also be written 


(2) (2) (2) (2) 


As we'll see shortly, we’ll be doing a lot of addition and multiplication with vectors. And 
so, this “vertical” notation for vectors is very useful, because it literally helps us see better. 
But in print, I’ll often prefer using the (a,b) notation, simply because it takes up less space. 





Second, we can denote a vector by a single, lower-case, bold-font letter: 


(p,q) =u. 


Example 752. The vectors AB = (3,4), Gp = (3.4), Cre (5,0); CF = (15,2) may 
also be written 


AB=a, CD=b, Ch=c, CF =d. 


(The choice of letters is somewhat arbitrary. For an obvious reason, v is a favourite.) 





We'll often use the bold-font letter notation in print. However, it’s hard to hand-write in 
bold font, so you can write W@ and V in place of u and v. 


Exercise 212. Let A = (-1,-3), B = (2,1), and G = (3,4) be points. 


Consider the five vectors AG, BA, BG, GA, GB. Write down each in three different 
ways. What is each vector’s tail, head, and length? How many units does each vector 
carry us in the z- and y-directions? (Answer on p. 1841.) 


Exercise 213. Provide a counterexample to show that the following is not always true: 


ju + v| = fu] + |v]. (Answer on p. 1841.) 
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53.5. Position Vectors 


Given a point A, its position vector OA is simply the vector that carries us from the 
origin O to the point A: 


—_> 
Definition 127. Given a point A, its position vector is the vector OA. 





So, given a point A = (aj, a2), its position vector is 


Th=a= (oxen) ( } 


a2 


Example 753. The points A = (-1,-3), B = (2,1), and C = (-1,0) have position vectors 


0 


OA=a= 1-2)=( - ) OB =»=(2.)( ; ) and OC =e= (1,0) = ( = } 
Again, take care not to confuse a point with its position vector. Although A and OA 
may both be denoted by (-1,-3), they are different mathematical objects—the former is 
a point while the latter is a vector. 





53.6. The Zero Vector 


Informally, the zero vector is the vector that carries us nowhere. Formally, 


Definition 128. The zero vector, denoted 0, is the origin’s position vector. 





And so, the zero vector is 0=00 = (0,0)< ( ; } 


¢ Once again, do not confuse the point O = (0,0) with its position vector 0. 


e Given any point P, the vector that carries us from P to P is the vector carries us precisely 
—> 
nowhere. Hence, PP = 0. 


The following result says that every vector has non-negative length. And moreover, the 
only vector with length 0 is the zero vector: 


Fact 117. Suppose v is a vector. Then |v|>0. Moreover, |v|=0 <> v=0. 




















Proof. Let v = (v1,v2). Then |v| = \/v? + v3 > 0 and |v|=0 <> v= (0,0) =0.°° 





319For a proof of this result in the general n-dimensional case, see p. 1618 (Appendices). 
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53.7. Displacement Vectors 


Definition 129. Suppose a moving object starts at point A and ends at point B. Then 


a= . . 
we call AB its displacement vector. 





So, if a moving object starts at A = (a,,a2) and ends at B = (b;,b2), then regardless of the 
path taken by the object, we say that its displacement vector is 


AB = (by — 1, bo — ay). 


Example 754. A particle starts at the 

point A = (-1,0), travels rightwards along 

the black arc, and comes to a stop at the 

point B = (2,3). So, this particle’s dis- 
—> 

placement vector is AB = (3,3). 








Example 755. A particle starts 
at the point A = (1,0), trav- 
els anti-clockwise around the unit 
circle centred on the origin, com- 
pletes one full circle, and comes 
to a stop at the point A. So, this 
particle’s displacement vector is 
AA = (0,0) = O (not depicted in 
figure). 
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53.8. Sum and Difference of Points and Vectors 


In this subchapter, we’ll learn these four things: % Aaye Pp 


1. Point + Point = Undefined 
2. Point —- Point = Vector 

3. Point + Vector = Point 

4. Point - Vector = Point 


1. Point + Point = Undefined 





If A and B are points, then there is no such thing as A+ B. 


Example 756. Let A = (1,2) and B = (5,0) be points. Then the sum A+ B is undefined. 
It makes no sense to talk about the sum of two points. 





The analogy is to points or locations in the real world: 


Example 757. Consider Athens and Berlin, two points or locations. The sum Athens + 
Berlin is undefined. It makes no sense to talk about the sum of two points or locations. 





2. Point — Point = Vector. 


Definition 130. Given two points A and B, the difference B — A is the vector AB. 


And so, given the points A = (a,,a2) and B = (bj, b2), their difference is 
Be Ae AR = (b, — ay, bo - ag). 
Example 758. Let A= (1,2) and B = (5,0). Then B- A is the vector AB: 
B= A248 2 (50) 2(12)= 4 
Similarly, the difference A - B is defined to be the vector BA: 
ASB= BA (2) 26 0) 02). 





We can continue with the same Athens-Berlin analogy: 
Example 759. The vector “Berlin — Athens” is the journey from Athens to Berlin: 


Berlin - Athens = (—500, 900). 


That is, the journey from Athens to Berlin carries us 500km west and 900 km north.?” 


Similarly, the vector “Athens - Berlin” is the reverse journey from Berlin to Athens: 


Athens - Berlin = (500, -900) . 


That is, the journey from Athens to Berlin carries us 500km east and 900 km south. 








320T made up these numbers. The actual journey is longer (Google Maps). 
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3. Point + Vector = Point. 


Definition 131. Given a point A = (a,,a2) and a vector v = (v1, v2), their sum A+v is 
this point: 


A+v = (a, +0 1,02 +02). 





Equivalently, if the vector v’s tail is at the point A, then its head is at the point A+v. 
Example 760. Let A = (1,2) and v= (4,4). Then their sum is the point (5,6): 
A+v= (1,2) + (4,4) = (5,6). 
Example 761. If v = (—500,900), then 


Athens + v = Athens + (—500, 900) = Berlin. 





Starting from Athens, travelling 500 km west and 900km north brings us to Berlin. 


4. Point — Vector = Point. 


Definition 132. Given a point B = (b,,b2) and a vector v = (v1, v2), their difference 
B-v is this point: 


B-v = (bj — v,b2- v2). 





Equivalently, if the vector v’s head is at the point A, then its tail is at the point A-v. 
Example 762. Let A= (1,2) and v = (4,4). Then their difference is this point: 
A-v = (1,2) - (4,4) = (-3,-2). 
Example 763. If v = (—500,900), then 
Berlin - v = Berlin - (—500, 900) = Athens. 


If we end up in Berlin after travelling 500km west and 900km north, then we must have 
started in Athens. 


Exercise 214. Consider the vector (4,-3). (Answer on p. 1841.) 


(a) Ifits tail is (0,0), then what is its head? 
(b) If its head is (0,0), then what is its tail? 
(c) Ifits tail is (5,2), then what is its head? 
(d) If its head is (5,2), then what is its tail? 
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53.9. Sum, Additive Inverse, and Difference of Vectors 


In this subchapter, we’ll learn these three things: 


1. Vector + Vector = Vector. 
2. — Vector = Vector. 
3. Vector — Vector = Vector. 


1. Vector + Vector = Vector. 


Definition 133. The sum of two vectors u = (uj,u2) and v = (v1,v2) is denoted u+v 


and is the vector u+ v = (uj + U1, U2 + V2). 





Place v’s tail at u’s head. Then u+v is the vector from u’s tail to v’s head: 


utv 





Informally, the vector —u is the vector u flipped in the opposite direction. Formally, 
Definition 134. The additive inverse of the vector u = (uj, U2) is this vector: 
—u = (-u1, -U2). 


Example 765. The additive inverses of u = (-1,3) and v = (4,4) are 





Jon 
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3. Vector — Vector = Vector. 


Definition 135. Given two vectors u and v, their difference u-v is the sum of u and 
the additive inverse of v. That is, 


u-v=u+(-v). 


Fact 118. Jf u= (u,uU2) and v= (v1, v2) are vectors, then 


u-v = (Uj) — U1, U2 — V2). 





Proof. By Definition 134, -v = (-v1,-v2). And so by Definition 133, 











u-v=u+ (-v) = (uw - U1, U2 — U2). 





To get u-v, first flip v to get —-v, then add -v tou: 


Flip v 


———_> uUu-VvV —V 


Or equivalently, place the heads of u and v at the same point. Then u - v is the vector 
from the tail of u to the tail of v: 
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In the previous subchapter, we defined B - A = AB. We'll now prove that OB - OA = AB: 


SS SSS 
Fact 119. Suppose A and B are points. Then OB-OA= AB. 


Proof. Let A = (a1,a2) and B = (b;,b2). By Definition 124, 


ABS (Gis by = a5), OA = (a1, 49), and OB = (by, bs). 














By Fact 118, OB -OA = (bj — a1, bg — a2). Thus, OB — OA = AB. 


More generally, 


Fact 120. Suppose A, B, and C are points. Then AB - AC = CB and AB + BC = AC. 





Proof. Let A = (a1, a2), B = (61,62), and C = (c,c2) be points. By Definition 130, 
AB = (b; — a1, b - a2), AC =(c-4@1,¢2-a2), and CB =(bj-c,b)-©). 


And now by Fact 118, 


> 


AB AC Gi 0p Gag = ts) | Ci, SCP. 














Observing that _CB = BC and rearranging, we also have AB+ BC = AC. 


Example 767. Suppose A = (-1,2) and B = (3,-1). Then 


8-08 -0a-( 


Exercise 215. Express each of these six vectors more simply: (Answer on p. 1841) 


AC +CB, DCCA, BD +DA, 





AD = CD, ae = BD BD + DB. 
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53.10. Scalar Multiplication of a Vector 
Scalar multiplication of a vector works in the “obvious” fashion: 


Definition 136. Given the vector v = (v1, v2) and the scalar c€ R, the vector cv is 


ev = (cuiyeu).. 





The vector cv is simply the vector that points in the same direction as v, but has c times 
the length. 


Fact 121. [fv is a vector and ce R, then |cv| = |c| |v]. 














Proof. See Exercise 216. 





Exercise 216. Let ce R and v = (v4, v2). (Answer on p. 1841.) 


(a) Write out cv. 
(b) Now prove Fact 121. Calan ey set ollie ii 


Exercise 217. Let A = (1,-3), B = (2,0), and C = (5,-1). (Answer on p. 1842.) 


(a) Write down AB, AC, BC, 2AB, 3AC, and 4BC. 
(b) Verify that 2AB| =o AB}, [3c ae AC], and l4Bc| a7 BCI. 
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53.11. When Do Two Vectors Point in the Same Direction? 


In words, we say that two non-zero vectors u and v point in 


(a) The same direction if they are positive scalar multiples of each other; 
(b) Exact opposite directions if they are negative scalar multiples of each other; and 
(c) Different directions if they are not — scalar multiples of each other. 


In a bit more formal notation, 


Definition 137. We say that two non-zero vectors u and v point in 


(a) The same direction if u = kv for some k > 0; 


(b) Exact opposite directions if u = kv for some k < 0; and 


(c) Different directions if u# kv for any k € R. 


Example 769. Let a = (2,0), b = (1,0), c = (-3,0), and d = (1,1). 
The vectors a and b point in the same direction because a = 2b. 
The vectors a and c point in the exact opposite directions because c = —1.5a. 


The vectors a and d point in different directions because a # kd for any k. 


Exercise 218. Continuing with the above example, explain if b points in the same, exact 
opposite, or different direction from each of c and d. (Answer on p. 1842.) 
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53.12. When Are Two Vectors Parallel? 


Two non-zero vectors u and v are parallel if they point in the same or the exact opposite 
directions and non-parallel otherwise: 





So, non-parallel and point in different directions are synonyms. 


As shorthand, we write a || b if a and b are parallel and a + b if they are not. 


Example 770. The vectors a = (2,0), b = (1,0), c = (-3,0) are parallel. And so as 
shorthand, we may write a || b, a || c, and b || c. 


The vector d = (1,1) is not parallel to a, b, or c. Equivalently, d = (1,1) points in a 
different direction from a, b, and c. And so as shorthand, we may write d | a, d + b, 
and d tc. 
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53.13. Unit Vectors 


Definition 139. A unit vector is any vector of length 1. 


Example 771. The vectors (1,0), (0,1), and (2/2, V2/2) are unit vectors: 


\(1,0)[=VP? +0? =1, 
(0,1)| = V0? 41? =1, 


(PA) -@)- 


2 


may 2 


Example 772. The vectors (1,1) and (-1,-1) are not unit vectors: 
(1, 1)| = V12 + 2 = V2 #1, 


(-1,-Dl = C1)? + C1)? = V2 41. 


Given a vector v, its unit vector, denoted V, is the vector that points in the same direction, 
but has length 1. Formally, 





Definition 140. Given a non-zero vector v, its unit vector (or the unit vector in its 
direction) is 





It is easy to verify that thus defined, any vector’s unit vector has length 1: 


Fact 122. Given any non-zero vector, its unit vector has length 1. 


1 
via lab 
Iv| 


1 
Iv| 














il 
Proof. By Fact 121, |¥| = ay 


v| 











Example 773. XXX 


Fact 123. Let © be a unit vector. Ifcé R, then the vector c¥ has length |cl. 


Proof. See Exercise 219. 














Example 774. XXX 
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Proof. See p. 1618 (Appendices). 
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53.14. The Standard Basis Vectors 


j= (0, 1) 











The standard basis vectors i = (1,()) and j = (0,1) are simply the unit vectors that point 
in the directions of the positive x- and y-axes. Formally, 


Definition 141. The standard basis vectors (in 2D space) are i= (1,0) and j = (0,1). 





It turns out that any vector can be written as the linear combination (i.e. weighted 
sum) of i’s and j’s: 


Example 775. Let A = (-2,-1), B = (1,2), and C = (5,-2) be points. Their position 
vectors can be written as linear combinations (i.e. weighted sums) of i’s and j’s: 
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53.15. Any Vector Is A Linear Combination of Two Other Vectors 


We just learnt that any vector can be written as the linear combination (i.e. weighted 
sum) of the standard basis vectors i and j. It turns out that more generally, we can do the 
same using any two vectors, so long as they are non-parallel: 


Fact 125. Let a, b, and c be vectors. Ifa tb, then there exist a,G €R such that 


c=aa+ Pb. 





Proof. For a heuristic proof, see the next page. For a formal proof, see p. 1619 (Appendices). 














Example 776. Consider the vectors a = (1,2) and b = (3,4). Since a + b, by Fact 125, 
any vector can be expressed as the linear combination of a and b. 


Consider for example the vector u = (2,2). We will find a, @ € R such that u = aa + bb. 


To do so, first write 
2 1 3 
u= =aat+Gbb=a + . 


Write out the above vector equation as the following two cartesian equations: 
2210+36 and 2220+48. 


Now solve this system of (two) equations: 2 minus 2x 4 yields -26 = -2 or 6 = 1, so that 
a=-l. So, 


Exercise 221. Let a= (1,2) and b= (3,4). Express each of the vectors v = (3,2) and 
w = (-1,0) as the linear combination of a and b. (Answer on p. 1842.) 


Exercise 222. Explain why any vector can be written as a linear combination of the 
vectors a = (1,3) and b = (7,5). Then express each of the vectors i = (1,0), j = (0,1), 
and d = (1,1) as the linear combination of a and b. (Answer on p. 1843.) 
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The condition that a + b is important. Ifa || b, then Fact 125 does not apply: 


Example 777. Consider the vectors a = (1,1) and b = (2,2). Since a || b, Fact 125 does 
not apply. 


For example, we cannot express v = (1,2) written as the linear combination of a and b. 


Example 778. The vectors c = (3,1) and d = (-3,-1) point in exact opposite directions. 
Since c || d, Fact 125 does not apply. 


For example, we cannot express v = (1,2) written as the linear combination of c and d. 





Here is a heuristic proof-by-picture of Fact 125. 
Let a, b, and v be non-zero vectors, with a | b. 


Place a’s tail at v’s tail. Then also place b’s head at v’s head. 





As the above figure suggests, “obviously”, we can always find real numbers a and £ so that 
the head of aa and the tail of Gb coincide. In other words, there are real numbers a and 
GB such that v = aa+ bb. 
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53.16. The Ratio Theorem 


Theorem 16. Let A and B be points with position vectors a and b. Suppose the point 
P is on the line segment AB and divides it in the ratio X: u. Then P’s position vector is 


at+A 
A+ 


The point P has position vector: 


at+A 
A+ 


The point A has 
position vector a. 





Proof. By Fact 119, AP 4 p-a and AB =b-a. 


Now observe that AP points in the same direction as AB, but has A/(A +) times the 
length. So, 





Putting + and 2 together, we have 





(A+pjat+A(b-a)  pa+ Ab 
A+ ae 














ped A se a 
A+ 
No need to mug the Ratio Theorem because List MF 26 (p. 4) has this: 


The point dividing AB in the ratio 1: w has position vector a 


+ Ml 
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Example 779. Let A = (3,4) and B = (-1,2) 
be points. Let P be the point that divides the 
line segment AB in the ratio 3: 2. 


By the Ratio Theorem, B=a(-1,2) 


o+Z 





9a+3b 2 ] 
— = + 





P = (0.6, 2.8). 


Example 780. Let C = (8,3) and D = (2,-6) be points. Let Q be the point that divides 
the line segment C’D in the ratio 3:7. By the Ratio Theorem, 


d 1 
ee ee Cee? Sema, oom: 
3+7 10\3)} 10\-6} 10\ 3 


Ay 





C = (8,3) 








Exercise 223. Let A= (1,2), B= (3,4), C = (1,4); D=(@)3), £= 1,2), fF =6,-4) 
be points. Find the points P, Q, and R which divide the line segments AB, C'D, and 
EF in the ratios 5:6, 5:1, and 2:3, respectively. (Answer on p. 1843.) 
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54. Lines 


Recall?! that any line 1 may be written as 
l= {(x,y):axr+by+c=0}, 


where at least one of a or 0 is non-zero. 


More simply, we say that the line | is described by this cartesian equation:*”” 


ax + by+c=0. 


In this chapter, we’ll learn a second method for describing lines, namely vector equations. 


We'll start by introducing the concept of a line’s direction vector: 


54.1. Direction Vector 


Informally, a direction vector of a line is any vector that’s parallel to the line. Formally, 


Definition 142. Given any two distinct points A and B on a line, we call the vector AB 
a direction vector of the line. 


Example 781. Consider the line / described by the cartesian equation y = 2x + 3. 


It contains the points 
A=(0,3) and B=(-1.5,0). 
Hence, a direction vector of / is 
AB=B-A=(0,3)—C18,0) =(1.5,3). 
The line / also contains the points 


Ce=i{-l1) end 2=(1,5). 





Hence, another direction vector of I is 





Si 15s mh ea eee ey 








321Ch.. @. 


322Some writers also call this a scalar equation. We won't. 
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The above example shows that direction vectors are not unique. If v is a direction vector 
of a line, then so too is any vector that’s parallel to v. 


But, no other vector is a direction vector of the line. That is, if u + v, then u is not a 
direction vector of the line. (And so, although the direction vector v isn’t unique, we can 
say that it is unique up to non-zero scalar multiplication. ) 


Altogether, if a line has direction vector v, then its direction vectors are exactly those that 
are parallel to v. Formally, 


Fact 126. Suppose v is a line’s direction vector. Then 


u is also that line’s direction vector 








Proof. See p. 1619 (Appendices). 











Example 782. The line®”’ y = 2x + 3 has direction vectors 


(3) = (1) 


We can easily verify that CD | AB: 


oo -(’ 
4 


The following are parallel to AB and CD and are thus also direction vectors of I: 


=(2)-() 


In contrast, these are not: 


(3}(2} = (e(2) 








323 Strictly and pedantically speaking, “y = 2x7+3” is an equation and not a line—so, we should say instead, 
“The line described by y = 22 +3,” or, “The graph of y = 2x +3,” or, “The line {(z,y) : y = 27+3}. But 
this gets cumbersome. So we shan’t be so strict and pedantic. We’ll often simply say things like, “The 
line y = 2x + 3,” as is done here. 


590, Contents www.EconsPhDTutor.com 


Fact 127. The line ax + by+c=0 has direction vector (b,-a). 


Proof. Let D = (p,q) be any point on the line. Since D is on the line, it satisfies the line’s 
cartesian equation—that is, 


ap + bq¢+c=0. 


Now consider the point E = (p+b,q-a). We now show that EF also satisfies the line’s 
cartesian equation and is thus is also on the line: 


a(pt+b)+b(q-a)+c=apt+ab+ bqg-ab+c=ap+bq+c=0. 


Since D and EF are both points on the line, by Definition 142, the line has this direction 
vector: 





DE = E- D =(p+b,q-a) -(p,q) = (b,-a). 











Example 783. By Fact 127, the line 5% - 2y+3 =0 or y = 2.5% +1.5 has direction vector 
(-2,-5). 


Since (1,2.5) || (-2,-5), by Fact 126, it also has direction vector (1,2.5). (As we’ll see 
on the next page, not coincidentally, this line’s gradient is also 2.5.) 


5a - 2y+3=0 
Ory = 2.07 1.5 








Next, the line 3y - 1 =0 or y = 1/3 has direction vector (3,0). 
And the line -2z + 1 =0 or x = 0.5 has direction vector (0,2). 
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The following result follows readily from Fact 127: 


Corollary 21. (a) A horizontal line has direction vector (1,0). 


(b) A vertical line has direction vector (0,1). 


(c) A line with gradient m has direction vector (1,m). 





Proof. By Fact 127, the line ax + by +c = 0 has direction vector (6, -a). 


(a) If the line is horizontal, then by Fact 25, a = 0. And so, the line has direction vector 
(b,0). Since (1,0) || (6,0), by Fact 126, the line also has direction vector (1,0). 


(b) Similarly, if the line is vertical, then by Fact 25, b= 0. And so, the line has direction 
vector (0,-a). Since (0,1) || (0,-a), by Fact 126, the line also has direction vector (0,1). 


(c) The line’s gradient is -b/a = m. But (b,-a) || (-b/a,1). And so by Fact 126, the line 
also has direction vector (1,m). 














Example 784. The horizontal line y = —1 has direction vector (1,0). 
The vertical line x = 2 has direction vector (0,1). 


The oblique line y = 7 +1 has gradient 1 and thus direction vector (1,1). 








Exercise 224. For each line, write down a direction vector. (Answer on p. 1844.) 


(a) 34-y+2=0. (b) Ty = -22-3. (eC) ga 7 (d) x=-5. 
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54.2. Cartesian to Vector Equations 


Example 785. Let / = {(2,y) : 3x” -y+2=0} be the set containing exactly those points 
(x,y) that satisfy the cartesian equation 37 -y +2 =0 or y = 3x+2. We say that I is the 
line that may be described by the cartesian equation 3x -y+2=0 or y=3r4+2. 


It turns out that we can also describe / using a vector equation. 


To do so, first observe that / contains the point P = (0,2). Also, it has gradient 3 and 
thus direction vector v = (1,3). Since / is a straight line, it must also contain these points: 


P+1v=(0,2)+1(1,3)=(1,5) and P-1v=(0,2)-1(1,3) =(-1,-1). 


Indeed, | contains exactly those points R that can be expressed as P+ Av = (0,2) +A(1,3) 
for some real number \. That is, 


1={R:R4+(0,2)+X(1,3) (AeR)}. 


More simply, we may say that the vector equation = describes I. 


Equivalently, / contains exactly those points R whose position vector r may be expressed 
as p+ Av = (0,2) + A(1,3) for some real number \. That is, 


l= {R:r2(0,2)+X(1,3) (AeR)}. 


Again, we may more simply say that the vector equation 2 describes I. 


(By the way, + and = are subtly different—more on this in Ch. 54.3.) 


As in Ch. 45 (Simple Parametric Equations), \ is a parameter. Here “ € R” says that 
\ takes on every value in R. And as 4 varies, we get different points of the line. 


For example, 4 = —-1, A =0, and A= 1 produce these points: 


(-1,-1) = (0,2)-1(1,3), (0,2) = (0,2) +0(1,3), and (1,5) = (0,2) +1(1,3). 


/ may be described by: 
32 —-yt+2=0, or 
r 2 (0,2) +A(1,3) (A€R). 











(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Of course, / also contains infinitely many other points—each distinct value of A « R 
produces a distinct point on J. 


We noted in Fact 126 that a line’s direction vector is unique up to non-zero scalar 
multiplication. A direction vector of | is v = (1,3), but so too is any non-zero scalar 
multiple of v. And so, here are three more ways to write 1: 


=| (ay)2= (! + 9( 30) (eR) | 


{om-(7) of: 
{ews (! 


More generally, for any k #0 and u=k (1,3), we may write 





l={(2,y):r=(0,2)+Au (A€R)}. 
The foregoing discussion suggests the following general Definition of a line. 


Definition 143. A line is any set of points that can be written as 


{R:OR=p+dv (\eR)}, 





where p and v # 0 are vectors. 


The above Definition says that a line contains exactly those points R whose position vector 
— 
OR =r may be expressed as 


OR=r2p+dv= (pi, po) + (v1, 09) (for some \ € R). 
Equivalently, a line contains exactly those points R that may be expressed as 


R= (p1,p2) + A(v1, v2) (for some \ € R) 


To repeat, here are what the vectors p and v and the number \ mean: 


° p=(p1,p2) is the position vector of some point on the line; 

¢ v=(v1, v2) is a direction vector of the line; and 

e The parameter takes on every value in R; each distinct value produces a distinct 
point on the line. 


Definition 143 subsumes and our earlier definition of a line, Definition 41. The difference is 
that Definition 41 “works” only in 2D space. In contrast, Definition 143 is more general—it 
“works” in 2D space and, as we’ll see, also in 3D space.?*4 





324Tndeed, it also “works” in any n-dimensional space. 
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And so, to write down a line’s vector equation, we need simply find any point on the line 
and any direction vector of the line. More examples to illustrate how this works: 
Example 786. Let / be the line y=1- 7. 


It contains the point (0,1). It has gradient -1 and so, by Corollary 21, the direction 
vector (1,-1). Hence, we can also describe the line / by 


R+=(0,1)+A\(,-1) or =r 2(0,1)+A(1,-1) (Ae R). 


Both 4 and 2 say the same thing: 
os says that | contains exactly those points R that may be written as (0,1) + A(1,-1), 
for some real number 4. 


» 2 says that / contains exactly those points R whose position vector r = OR may be 
written as (0,1) +A(1,-1), for some real number 4. 


IN 
The line / 





/ may be described by: 
y=1-2, or 
r=(0,1)+A(1,-1) (A€R). 





As the parameter \ takes on different values in R, we get different points of /. So for 
example, \ = -1, 1 =0, and A = 1 produce these points: 


(-1,2) = (0,1) -1(1,-1), 
(0,1) = (0,1) + 0(1, -1), 
(1,0) = (0,1) +1(1,-1). 
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Example 787. Let / be the line y = 3. 


It contains the point (0,3). Since it’s horizontal, by Corollary 21, it has direction vector 
(1,0). So, we can also describe the line / by 


R+=(0,3)+A(0,0) or  r2(0,3)+A(1,0) (Ae). 


IN 
Y 


The line 1 








As . varies, we get different points of 1. So for example, \ = -1, A =0, and A = 1 produce 


Cio, 
(0,3) = (0,3) +0(1, 0), 
(oa) =O, 3) 11,0), 


Example 788. The line 7 = 1. contains the 
point (-1,0). Since it’s vertical, by Corollary 
21, it has direction vector (0,1). So, we can 
also describe the line by 


R= (0,1) +A(0,1) or 
r2(0,1)+A(0,1) (AeR). 





As 4 varies, we get different points of the line. 
For example, 4 =-1, A=0, and A =1 produce 


(=1,.2) = (0,1) =1(0,.1), 
(0,1) = (0,1) + 0(0, 1), 
(1,0) = (0,1) +1(0,4) 
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Exercise 225. Each of the following (cartesian) equations describes a line. Rewrite each 
into a vector equation. Also, what points are produced when your parameter takes on 
the values -1, 0, and 1? (Answer on p. 1844.) 


(a) -5r+y+1=0. (b) x-2y-1=0. (c) y-4=0. (dr a=4 =i) 


Exercise 226. In Definition 143 (of a line), we impose the restriction that a line’s direc- 
tion vector v must be non-zero. By considering what the line becomes if v is the zero 
vector, explain why we impose this restriction. (Answer on p. 1844.) 
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54.3. Three Pedantic Points to Test/Reinforce Your Understanding 


As we’ve seen above, a line / may be described using either of these vector equations: 


Point Point Vector 
— 


nop = we 
LP LW (eR). 


. Vector 
Vector Vector 


Or, yr 2p +d (eR). 


Pedantic Point 1. = is consistent with what we learnt earlier (in Ch. 53.9): 
Point = Point + Vector. 
Likewise, 2 is consistent with what we learnt earlier: 
Vector = Vector + Vector. 


So, both vector equations + and 2 are perfectly correct ways to describe the exact same 
line. 


The difference is that + does so “more directly” than Zo Because, to repeat, 
03 says that / contains exactly those points R that may be written as P+ Av, for some 
real number 4. 
—> 
» 2 says that / contains exactly those points R whose position vector r = OR may be 
written as p+ Av, for some real number X. 


Pedantic Point 2. What would be wrong and unacceptable is this: 


Point Vector Vector 


Ri Dp +\’v_ (eR), x 


As we learnt earlier (Ch. 53.9), Vector + Vector = Vector. But the LHS of 3 is a Point 
while its RHS is a Vector. So, 3 is false. 


Similarly, the following is also wrong and unacceptable: 


Vector Point Vector 
as 4 = 
r = P+Av_ (ER). x 


As we also learnt earlier, Point + Vector = Point. But the LHS of * is a Vector while its 
RHS is a Point. So, * is false. 


Pedantic Point 3. A line is a set of points and not a set of vectors. So, take care to 


note that the line / contains the points R = (x,y) and P = (p,,p2)—it does not contain 
the vectors r = (x,y) and p = (pi, p2). 
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54.4. Vector to Cartesian Equations 
Suppose a line is described by this vector equation: 


r=pt+Av (eR). 


Or equivalently, a Ds a | Oe (A € R). 
y P2 V2 


Then given any point (z,y) on this line, there must be some real number \ such that 
x=p, trv} and Y = po + Avo. 


We say that the line may be described by the above pair of cartesian equations. 
Hm ... but aren’t we supposed to be able to describe a line with just one cartesian equation? 


Well, if we’d like, we can do some easy algebra to eliminate the parameter A: 
Example 789. The line / is described by 
r=(1,2)+A(1,1) (A€eR). 


We can also describe | by this pair of cartesian r = (1,2) +A(1,1) 
equations: 


c=1+X-1 and eee eae 


We can eliminate the parameter \ through sim- 


ple algebra. Take 2 minus 2: 


Vee — 1 Ome ne a, 


Example 790. The line / is described by 


r=(0,0)+\(4,5) (eR). 


We can also describe | by this pair of cartesian y= “ig = 


equations: r = (0,0) + A(4,5) 


c=0+-4 and y204+A-5. 





Os 
Take = minus 
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Example 791. The line / is described by 


f= 5 OF 
r= (3,1) +A(0,2) (AeR). r = (3,1) + A(0, 2) 


We can also describe | by this pair of cartesian 
equations: 


2+3+2-0=3 and y 214-2. 





Observe that in this example, we cannot use 
algebra to eliminate A. It turns out that this is 
actually a vertical line. 





As A varies, the value of x is fixed at x 2 3, while y varies along with \. And so, instead 


of doing any algebra, we’ll simply discard 2 The above pair of cartesian equations then 
reduces to the single equation: 


Example 792. The line / is described by 
r=(-1,2)+A(-1,0) (A€R). 


We can also describe | by this pair of cartesian y = 2 or 
equations: r = (-1,2)+ A(-1,0) 


v=-1+-(-1) and y22+\-0=2. 





Again, in this example, we cannot use algebra 
to eliminate \. It turns out that this is actually 
a horizontal line. 





As 4 varies, the value of y is fixed at y 2 2, while x varies along with \. And so, instead 


of doing any algebra, we’ll simply discard =. The above pair of cartesian equations then 
reduces to the single equation: 


Exercise 227. Each of the following vector equations describes a line. Rewrite each into 
cartesian equation form. (Answer on p. 1844.) 


(a) r=(-1,3)+A(1,-2) (A€R). (b) r=(5,6)+A(7,8) (A€eR). 
(c) r=(0,-3)+A(3,0) (AeR). (d) r=(1,1)+A(0,2) (A€eR). 
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In general, 


Fact 128. Let 1 be the line described by r = (pi,p2) + A(v1, 02) (AER). 


(a) If vi,v2 #0, then 1 can be described by 


L-Pi YPpe2 V2 ep) 
eo 
U1 v2 U1 U1 
(b) [fv =0, then l is vertical and can be described by x = py. 


(c) If v2 =0, then | is horizontal and can be described by y = po. 





: . 1 2 

Proof. First, write L=p,+Av1 and Y = po + Avo. 
a 1 

Take v,x = minus vox =: 


ULY — Ug = Vyp2 + AV {V2 — Vap1 — AV{V2 = ViP2 — VeP1- 





Or, U2 (x-p) 2 (y - po). 
(a) If v,,v9 #0, then 3 divided by vv» yields ~ _ me _ 
al 2 


(b) If v, =0, then 2 becomes x = p). 











3 
(c) If v2 =0, then = becomes y = py. 





Armed with Fact 128, we now revisit the last four examples. 


Example 793. The line r = (1,2) + A(1,1) (A€ R) may also be described by 





oe 
Y= 7 : 


Example 795. The line r = (3,1) + A(0,2) (A € R) may also be described by x = 3. 





Example 796. The line r = (-1,2) + A(-1,0) (A € R) may also be described by y = 2. 


Exercise 228. Use Fact 128 to redo Exercise 227. (Answer on p. 1844.) 
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55. The Scalar Product 


Definition 144. Given vectors u = (u1,u2) and v = (v1, v2), their scalar product, denoted 
u-v, is this number: 


U'V = UV + UQV2. 


a 
1 


5-8+(-3)-7 = 40-21 


a (Aya 30 





The scalar product is itself simply a scalar (i.e. a real number). Hence the name. 





Right now, the scalar product may seem like a totally random and useless thing, but as 
we'll soon learn, it is plenty useful. Let us first learn about a few of its properties. 


Recall from primary school that multiplication is commutative: 


Example 798. 3x5=15 and 5x3= 15. 


Moreover, multiplication is distributive (over addition): 
Example 799. 3x (5+ 11) =3x5+3x11 and 18x (7-31) = 18x 7+18 x (-31). 


It turns out that the scalar product is ikewise commutative and distributive: 
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Fact 129. Suppose a, b, and c are vectors. Then 


(a) a-b=bz-a. (Commutative) 
(b) a-(b+c)=a-b+a-c. (Distributive over Addition) 





The fact that the scalar product is both commutative and distributive is a simple conse- 
quence of the fact that multiplication is itself commutative and distributive. (The latter is, 
in turn, a fact we’ll simply take for granted in this textbook.) 


Proof. Let??? a = (a1,a2), b = (b;, 2), and ¢ = (c1,c2). Then 


(a) a-b = a,b; + Abo = bia, + boa =b-a. 











(b) a:(b+c) = a; (b) +c) + dy (bo + C2) = apd) + Agby + ajc, + aQcg =a-bt+a-c. 





Example 800. Continue to let u = (5,-3), v = (2,1), w = (-4,0), and x = (8,7). 


The scalar product is commutative: 
v-u=u-ve=7, w-u=u-w=-—20, = We x = 10, 


It is also distributive over addition: 


wivem =| (cd) ao-a- 8+ 7-20 =u-vt+u-w, % 
— + 


coeyw | ae } 7 | = -28+0 = -28 = -20-8 =u-wtv-w. t 
—3+ 





Here’s another “obvious” property of the scalar product: 


Fact 130. Suppose a and b be vectors and ce R be a scalar. Then 





(ca): b=c(a-b). 


Proof. Let®*® a = (a1, a2) and b = (b,b2). So, ca = (ca, caz). Hence, 














(ca) - b = (cay) by + (Caz) bp = C (a,b, + agb2) = c(a-b). 


Exercise 229. Let v = (2,1), w = (-4,0), and x = (8,7). Above we already computed 
v-w=-8. Now also compute these: (Answer on p. 1845.) 





V-X, W:X, W-V, X:v, X-w, w-(x+v), (2v)-x, and w- (2x). 





32°The proof covers only the two-dimensional case. For a more general proof, see p. 1620 (Appendices). 
26This proof covers only the two-dimensional case. For a more general proof, see p. 1620 (Appendices). 
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55.1. <A Vector’s Scalar Product with Itself 


A vector’s length is the square root of the scalar product with itself: 


Fact 131. Suppose v be a vector. Then |v| = Vv-v and |vf =v-v. 





Proof. By Definition 126, |v| = \/v? +13. By Definition 144, v-v = vv) + vave = UP + U5. 
Hence, |v| = /v-v and |v/ =v-v. 














Exercise 230. The lengths of u = (5,-3), v = (2,1), w = (-4,0), and x = (8,7) are 
jul = \/52 + (-3) = V34 


Ivi= V224+12 = V5 


lw] = /(-4)° +02 = 4 


x)= V82472 = V113 


And the square roots of the scalar product of each vector with itself are 





Ju-u = /(5,-3): (5, -3) V34 
Jw-w = \/(-4,0)-(-4,0) = 





Exercise 231. Let a = (-2,3), b = (7,1), and c= (5,-4). Verify that for each vector, its 
length equals the square root of its scalar product with itself. (Answer on p. 1845.) 
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56. The Angle Between Two Vectors 


Place the tails of u and v at the same point. 


Informally, we might define the angle between u and v as 
the “amount” we must rotate u so that it points in the om - a 
same direction as v. 


The thing is, we can either rotate u anticlockwise by a 
or clockwise by 277- a. So, we have an ambiguity here— 
which is the angle between u and v? Is it a or 21- a? 


Uu 


To resolve this ambiguity, we will simply define the angle between two vectors as the 
smaller of these two angles. In other words, we’ll define the angle between two vectors 
so that it’s always between 0 and 7 (inclusive). 


And so, in the above figure, the angle between u and v is a (and not 27- a). 


In contrast, in the figure below, the angle between w and x is (@ (and not 27 - {). 


xX 


27 — B 


Previously in Ch. 30, we gave an informal definition of angle (Definitions 88). As promised 
then, we now give our formal Definition of the angle between two vectors. 


Warning: This definition comes seemingly outta nowhere. But don’t worry, Exercise 232 
(next page) will help you understand where this Definition comes from. ae as 
Definition 145. The angle between two non-zero vectors u and v is this number: 


=i u:vVv 
COS 





fal |v] 


Recall (Definition 99) that Range cos! = [0,7]. So, by the above Definition, the angle 
between two vectors is indeed always between 0 and 7. 
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Exercise 232. Let u and v be vectors and @ be the angle between them. 


u 


u-v 
This Exercise will help you understand why we define @ = cos! Fae 
ul |v 


(a) Write down the vector that corresponds to the third side of the above triangle. 
(b) Write down the lengths of the triangle’s three sides in terms of u and v. 


(c) The Law of Cosines (Proposition 7) states that if a triangle has sides of lengths a, ), 
and c and has angle C' opposite the side of length c, then 


c? =a? +b — 2abcosC. 
Use the Law of Cosines to write down an equation involving 0, u, and v. 


(d) Use distributivity (Fact 129) to prove that (u-v)-(u-v) =u-u+v-v-2u-v. 


(e) Now take the equation you wrote down in (c), do the algebra (hint: use Fact 131), 
and hence show that 


=¢oq = (Answer on p. 1846.) 


jul ly 





A simple rearrangement of Definition 145 produces this result: 
Fact 132. Jf u and v are two non-zero vectors and 6 is the angle between them, then 
u-v = |u||v|cos 0. 


Example 801. Let 6 be the angle between the vectors i = (1,0) and u= (1,1). Since i 
points east, while u points north-east, primary-school trigonometry says 6 = 71/4. 


Let’s verify that this is consistent with Definition 145: % 


=a -1 (1,0)2 (11) 





= COs 


(1, 0), 1] 
ESE SO 
V 12 +02V 12 + 12 
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Example 802. Let @ be the angle between the vectors i = (1,0) and j = (0,1). Since i 
points east, while j points north, primary-school trigonometry says 0 = 77/2. 


Let’s verify that this is consistent with Definition 145: % 


tj _ (ot G0) +40.) 
i O10.) 


10-60 1 
V 12 + 020? + 12 


v 


COs 


Example 803. Let v = (3,2) and w = (-1,-4) be vectors. By Definition 145, 


LW sggrt (3:2): (C1,-4) v= (3,2) 
Iv] fw] I(3, 2) |(-1, -4)| 
3 x (-1) +2 x (-4) 


V3? + 22\/(-1)° + (-4)° 


-3-8 =) 
= cos! (==) = cos! (=) wx 2.404. % 
V7 13/17 af 2271 





By the way, here’s a possible concern. We’ve defined the angle between two vectors as 


2] u:vVv 
COs — ——_. 
jul [v| 


But recall (Definition 99) that Domain cos"! = [-1,1]. So, how can we be sure that the 
above expression is always well-defined? That is, how can we be sure that 
u-v u-v 


—— e[-1,1] or = eee ak 
ju [v| 


Fortunately, we can, thanks to Cauchy’s Inequality:°2" 


Fact 133. (Cauchy’s Inequality.) Suppose u and v are non-zero vectors. Then 


je 
jul |v 





Equivalently, —jul[v|<u-v<|ullvy}] or (u-v)’ <|ul’ vp. 





Proof. See p. 1620 (Appendices). 














327 Also known as the Cauchy-Schwarz Inequality in its more general form. 
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Fact 134. Suppose 0 is the angle between two non-zero vectors u and v. Then 


(a) 


u-v 

hull 

al And thus, 

ul |v 

mall (i) u-v>0 <> 6 is acute or Zero; 
Vv 

ar (ii) Wev =O <= 0 is right; 


jul [v| 


aM _ (iii) u-v <0 <=> @ is obtuse or straight. 
Vv 























Fact 134(ii) motivates the following Definition: 


Definition 146. Two non-zero vectors u and v are perpendicular (or normal or orthog- 
onal) if u-v =0 and non-perpendicular if u-v # 0. 





As shorthand, we write u 1 v if two non-zero vectors u and v are perpendicular; and u { v 
if they aren't. 
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Proof. “Obviously”, (c) and (d) simply follow from (a) and (b). For the proof of (a) and 





(b), see p. 1621 (Appendices). 











More examples: 


Example 804. Let 0 be the angle between the vectors u = (1,-3) and v = (-2,4). Then 


ane = cos =(C0ss nls x 2.999. 


Jul|v| > nae 3) a= = + 42 10/2 


So, u and v are neither perpendicular nor parallel; instead, they point in different direc- 
tions. Moreover, the angle between them is obtuse. 





v = (-2,4) 


6 x 2.999 
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Example 805. Let @ be the angle between the vectors u = (1,-2) and v = (—2,4). Then 


1 


= cos! =eos =<l=7. 


u-v —2- ——— = = 
—— = CO 
|u| fv| a ae 2 /( (3; 2 5/20 


So, u and v are parallel; more specifi- 
cally, they point in exact opposite direc- 
tions. Moreover, the angle between them 
is straight. % 


Example 806. Let @ be the angle between the vectors u = (3,-1) and v = (1,3). Then 


1 


ial ic — 1) Sita 


So, u and v are perpendicular and the an- v = (1,3) 
gle between them is right. % 





@ = cos 


Sil 7 
0 = pac 
= COS = 5 


Example 807. Let @ be the angle between the vectors u = (1,-2) and v = (2,-4). Then 


10 
= COs =cos!1=0. 


24+8 
Pls eee Iya. (4)? “Javan 


So, u and v are parallel; more specifically, they 
point in the same direction. Moreover, the an- 
gle between them is zero. % 
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We can use Fact 134(i)—(iii) to quickly check if the angle between two vectors is acute, zero, 
or obtuse: 


Example 808. Even without any graphs or exact calculations, we can quickly see that 


e The angle between the vectors (817,-2) and (39,-55) is acute, because 


817-39 + (-2)-(-55) > 0. 


e The vectors (v 79300, -470) and (47, V 793) are perpendicular, because 


79300 - 47 + (-470) - 793 = 0 
¢ If k <0, then the angle between (67,k) and (—485, 32) is obtuse, because 
67 - (-485) + 32k < 0. 


Exercise 233. For each, find the angle between u and v. Is it zero, acute, right, obtuse, 
or straight? Are u and v perpendicular or parallel? Do they point in the same, exact 
opposite, or different directions? (Answers on p. 1846.) 


(Ale a= 230) and vy = (0,17): (b) = (5,0) “amd vy =(—3,0), 
(ce) m= (10) andy = Hl) (d) u=(2,-3) and v= (1,2). 
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56.1. Pythagoras’ Theorem and Triangle Inequality 


Recall Pythagoras’ Theorem (Theorem 2)? Here it is again, but now in the language of 
vectors: 


Theorem 17. (Pythagoras? Theorem.) Jf u 1 v, then |u+v|" =|ul’ +|v/. 














Proof. See Exercise 234. 





utv 


Exercise 234. Use Facts 129 and 131 to show that Ju+v|° = |u|’ +2u-v+|v|. Then use 





u 1 v to complete the proof of the above Theorem. (Answer on p. 1847.) 


Recall the Triangle Inequality (Fact 106)? Here it is again, but now in the language of 
vectors: 


Fact 136. (Triangle Inequality.) Jf u and v are vectors, then |\u + v| < |u| + |v]. 














Proof. See Exercise 235. 





Exercise 235. Exercise 234 already showed that |u+v| = ful’ + 2u-v+|v/. 


Prove Fact 136 using these steps: First apply Cauchy’s Inequality (Fact 133) to the above 
equation. Then complete the square and take square roots. (Answer on p. 1847.) 
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56.2. Direction Cosines 


Definition 147. The x- and y-direction cosines of the vector v = (v1,v2) are these 
numbers: 





Observe that the unit vector of v = (vj, v2) is 


‘A (7 v2 
v=(—,—]. 
Iv| Iv] 
And so, equivalently, v’s x- and y-direction cosines are the x- and y-coordinates of its unit 
vector. 


We now explain why the x- and y-direction cosines are so named. Place the tail of v = 
(v1, v2) at the origin. Let a be the angle between v and the positive x-axis. Similarly, let 
6 be the angle between v and the positive y-axis.°”° 














Let V = (a,b) be v’s unit vector. It has length 1 and forms the hypotenuse of two right 
triangles. From the lower-right triangle, we have a = cosa. 


Similarly, from the upper-left triangle, we have b = cos (3. 


This explains why v’s unit vector’s x- and y-coordinates are also its x- and y-direction 
cosines. 





328More formally, a is the angle between v and i= (1,0), while 3 is the angle between v and j = (0,1). 
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We can state and prove what was just said a bit more formally: 


Fact 137. Let v = (v1,v2) be a non-zero vector. Suppose a and 2 are the angles between 
v and each of i andj. Then 





Proof. Since a is the angle between v and i, by Definition 145, we have 


v-i v1,-1+v9-0 Ul 
COS @ = —T = FT 
Iv fil [v|-4 Iv| 


Similarly, since @ is the angle between v and j, we have 





v-j v1,-O+v9-1 V2 











Mil ved 


Example 809. Consider the vector v = (3,2). Its z- and y-direction cosines are 


V1 3 3 sn V2 _ 2 _ 2 
lv] V324+22 V13 








i) 4a 7 
3 == 
4/18 s/13) 


Which means, of course, that its unit vector is V = ( 


v= (3,2) 


2 
#0.588 and £=cos! ~ 0.983. 


ie 
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Example 810. Consider the vector v = (—2,-1). Its x- and y-direction cosines are 


and 


ene eee (Dee 
Ml aysay v? Ml (aye 


-2 -l 
Which means, of course, that its unit vector is V = ( } 


V5 5 





Let a and ( be the angles it makes with the positive x- and y-axes. Then 


-2 —1 
a = cos | —=#2.678 and B=cost —»# 2.034. 
go V5 


Exercise 236. For each vector, find its x- and y-direction cosines, and its unit vector. 


(Gy sk (b) (4,2). (c) €1,2). (Answer on p. 1847.) 
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57. The Angle Between Two Lines 


(Below, Corollary 22 will give the “formula” for the angle between two lines. If you’re a 
well-trained, mindless Singaporean monkey who cares only about “knowing” the “formula” 
without understanding where it comes from, you can skip straight to Corollary 22.) 


“Obviously”, any two non-parallel lines /, and J, in 
2D space must intersect at exactly one point. And 
at this intersection point, two angles are formed. 
Let’s call the smaller angle a, so that the larger 
angle is  =™-a. 


Now, we have a potential ambiguity: When we 
talk about the angle between /; and ly, are we 
talking about the smaller angle a or the larger 
angle {? 


To resolve this ambiguity, this textbook will adopt 
the convention that the angle between two lines is 
the smaller one. So, by this convention, the angle 
between J; and Ij shall be a (and not (3). 


Note that by our adopted convention, the angle 
between two lines will always be acute (i.e. be- 
tween 0 and 7/2, inclusive) and never obtuse. 





We now work towards a formal definition of the angle between two lines. 


Example 811. Given the lines J; and lz, we pick for 
each the direction vectors u and v. 


Observe that 


ae Angle between _ Angle between 


1, and l» u and v. 


That is, a is the angle between the two lines; moreover, 
it is also the angle between the two vectors. 





The above example suggests the following “Definition” for the angle between two lines. 
Given two lines J, and lg, pick for each any direction vectors u and v. Now define 


Angle between — Angle between 


1, and I» u and v. 


This “Definition” works well in the above example, but only because the angle between u 
and v happens to be acute. 


Unfortunately and as the next example illustrates, this “Definition” doesn’t work so well if 
the angle between the two chosen direction vectors is instead obtuse: 
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Example 812. We continue with the same two 
lines as before. We continue to pick the direction 
vector u for the line /;. But this time, we pick 
the direction vector w for the line lo. 


The angle between the two lines remains the 
acute angle a. 


However, the angle between the two chosen di- 
rection vectors is now the obtuse angle (. 


And so, the above “Definition” fails because 


ae Angle between r Angle between ere 


l, and lb» u and w 


In this case, the angle between the two lines, a, is actually the supplement of the angle 
between the chosen two direction vectors, 3. That is, 


a=nm-£. 





Let’s introduce a new term, the non-obtuse angle between two vectors: 


Example 813. The angle between the vectors c and d is 6, which is obtuse. We say the 
non-obtuse angle between them is a =7- {. 


d 


The angle between the vectors e and f is y, which is acute. We say the non-obtuse angle 
between them is also y. 





Formally, 


Definition 148. Let 6 be the angle between the vectors u and v. Then the non-obtuse 
angle between u and v is denoted a and is defined by 


a for 6 not obtuse, 
mB, for 6 obtuse. 


a= 





We are now ready to write down our formal Definition of the angle between two lines: 
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Example 814. Given the lines /; and ly, we pick the direction vectors u and v. 


The angle between u and v is a, which is acute. And so, the non-obtuse angle between 
u and v is also a. So, by Definition 149, the angle between the two lines is a. 


Given the lines /3 and 14, we pick the direction vectors w and x. 


The angle between w and x is 3, which is obtuse. And so, the non-obtuse angle between 
w and x is y=7-{£. So, by Definition 149, the angle between the two lines is ¥. 





We just wrote down the Definition of the angle between two lines. We now work towards 
Corollary 22, which will give us our “formula” for the angle between two lines. 


u-v 
By Definition 145, the angle between u and v is cos | ——. 


It turns out that we can get the non-obtuse angle between u and v simply by slapping |-| 
(the absolute value function) onto the numerator: 


Fact 138. The non-obtuse angle between two non-zero vectors u and v is 


=) Ju-v| 
On >, 
[ul {v| 





Proof. Suppose 0 < 0 < 7/2. Then u-v >0. And so, by Definition 148, the non-obtuse angle 
between u and v is 


_~u-v _, |u-v| 
cos ——— = cos ——. 
|u| |v| |u| |v| 


Suppose instead @ > 7/2. Then u-v < 0. And so, by Definition 148 and the identity 
m—cos! a = cos! (-a) (Fact 101), the non-obtuse angle between u and v is 


, uev _j;-u-v _,|u-v| 











7 — COs 





——— = COS = ’ 
ful |y| jul |y| jul fy| 
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From Fact 138, this “formula” for the angle between two lines is immediate: 


Corollary 22. The angle between two lines with direction vectors u and v is 


cos”! eo 
ju] [v| 


Example 815. Two lines /; and /» are 
described by 


PSs leon) ee) (A€R), 


v2 
—— 


r = (-1,-1) +A (1,3) (A€R). 


By Corollary 22, the angle between the 
two lines is % 


-1 [v1 - v9 = zl |(2, 1) -(1,3)| 


COs = 
[vi] fvo| (2, FIC, 3)| 





ie) = cos = 
V5V'10 v2 


== COS= 





Example 816. ‘Two lines /; and /» are described by 


V1 
— 


r = (0,0) + A(-2,3) and r = (1,0) + (3,1) 


By Corollary 22, the angle between the two lines is 


1 |Vi- V9 -1 [(-2,3)-(8,1)| in) 
=(COs = cos ~—==— = # 1.305. 
[vi] fvo| \(-2,3)||(3, 1)| /1310 





cos 


IX 
Y 
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The following formal definition of when two lines are parallel or perpendicular will supersede 
the definitions given in Ch. 8.5. 


Definition 150. Two lines are (a) parallel if they have parallel direction vectors; and 





(b) perpendicular if they have perpendicular direction vectors. 


Corollary 23. Suppose 0 is the angle between two lines l, and ly. Then (a) 0=0 <=> 
Ly || la; (b) O = 7) 2 — en 





Proof. See p. 1622 (Appendices). 











620, Contents www.EconsPhDTutor.com 


Example 817. Two lines /; and /) are described by 


V1 


r= (2,-2)+(3,3)  and—sr=(1,1) +A(-1,-1) 


By Corollary 22, the angle between J, and /» is 


alvievel 1 18,3)-(L-DI 
Millval “13,311, 





COs 


By Corollary 23, the lines /, and / are parallel. 














Two lines /3 and /, are described by 


V3 
——— SS 


r = (0,3) + A(-1, 2) and 
By Corollary 22, the angle between J, and [» is 


-1 lv3 “va = cog7! le) ; (6, 3)| Sod 
[vs] [va (1, 2)||(6, 3)| 


By Corollary 23, the lines /3 and /, are perpendicular. 


COs 
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We can also define when a line and a vector are parallel or perpendicular: 


Definition 151. A line and a vector are (a) parallel if the line has a direction vector 


that’s parallel to the given vector; and (b) perpendicular if the line has a direction vector 
that’s perpendicular to the given vector. 





Here are two “obvious” Facts you may recall from primary school: 


Fact 139. Suppose two lines are ... 


(a) Identical. Then they are also parallel. 
(b) Distinct and parallel. Then they do not intersect. 


(c) Distinct. Then they share at most one intersection point. 














Proof. See p. 1623 (Appendices). 

















Proof. See p. 1623 (Appendices). 





Example 818. The lines /; and /» are identical. And indeed, they 
are also parallel. 


The lines J, and /3 are distinct and parallel. And indeed they do 
not intersect. 


The lines /; and /, are distinct and non-parallel. And indeed they 
share exactly one intersection point. 


Exercise 237. Find the angle between each given pair of lines. State if they are parallel 
or perpendicular. (Answer on p. 1848.) 


(a) r=(-1,2)+A (-1,1) andr =(0,0)+A (2,-3) (A€eR). 
(b) r=(-1,2)+A (1,5) andr =(0,0) +A (8,1) 
(c) r=(-1,2)+.A (2,6) andr=(0,0) +A (3,2) i 
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58. Vectors vs Scalars 


We now contrast vectors with scalars. We’ll reuse Example 666 from Ch. 45 (Simple 
Parametric Equations): 


Example 819. A moving particle’s position vector s is a function of time f: 
S(4)= (2 (4) xs, (2)) = (cost. sing), tc) 


At time t (seconds, s), the particle is cost metres (m) east (or to the right) of the origin 
(0,0) and sintm metres (m) north of (or above) the origin (0,0). 


For brevity, we'll often omit “(t)”. That is, instead of the above, we’ll often write 
S=(s2s,) = (cost. cing). t>0. 


As time t progresses from 0 to 27t seconds, the particle traces out, anti-clockwise, the 
unit circle: 


(= ees) 1 0) (27 (8 .6,) = (—1,0), 
t=m/4 —> (52, 8y) = (v2/2,V2/2), t= 5/4 => (52, 8y) = (-V2/2,-V2/2), 
eg Oe) OE (30/2 (ee =e 
t= 3n/4 => (82, sy) = (-V2/2,V2/2), t= T/A => (se, 5y) = (V2/2,-V2/2). 


N = 
Arrows indicate y Att=1, 
instantaneous Ss = (s;,5,) * (0.54m,0.84m), 
direction of ae V = (Ue; ¥y) * (-0.84ms7,0.54ms"), 
travel. ‘ 


ee lis -: 
At t =0, 


Direction of S = (sz, Sy) * (1m,0m), 


* aad F Vv = (Uz, vy) ¥ (1ms',0ms"‘), 
| v=ims!. 








(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


The particle’s velocity vector v(t) is the first derivative of s with respect to time tf: 








v(t) = (ve (8) ,¥y (8) = F = (2, St) = (SSS 


ds (= =) = (<= dsint 
dt - 


) = (-sint, cost). 


So, at time t, the particle is travelling in the eastern direction at v, = —sintms (or 


equivalently, in the western direction at sintms~') and in the northern direction at Vy = 
costms !. 


The magnitude of the particle’s velocity vector is denoted v and is called the particle’s 


speed: 
en ee 
Since sin? t + cos*t = 1, for all t, we have 
v =|v|=/u2 +02 = Vsin?t + cos?t = V1 =1. 


Aha! So, interestingly, the particle travels at the constant speed of 1ms !. That is, at 
every instant in time ¢, it is moving lms in its direction of travel. 





We now prove that the particle always moves in a direction tangent to the circle. 
In other words, its direction of travel is always perpendicular to its position vector. 


To do so, we need simply prove that v-s = 0 for all t: 
v-s=(-sint,cost) - (cost,sint) = —sintcost + costsint = 0. J 


Velocity is a vector—it has both magnitude and direction. In contrast, speed is a 
scalar—it has only magnitude. 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Similarly, the particle’s acceleration vector is the first derivative of the velocity vector 
(or, equivalently, the second derivative of the position vector): 











dv d’s /dv, du d(-sint) dcost 
a (t) = (ae (4) ,ay(t)) = Se = $3 = (Se, Se) = (A Se 


) = (-cost,-sint). 


So, at time t, the particle is accelerating eastwards at a, = —costms” and northwards 
at ay = ~sintms ’. Or equivalently, it is accelerating westwards at costms? and south- 
wards at sintms ’. (Note that ms stands for metre per second per second.) 


The magnitude of the particle’s acceleration vector, denoted a, is 


a=|al =,/a2 +a? = (—cost)* + (-sint)? = V1=1. 


Aha! So, interestingly, the particle accelerates at the constant rate of lms’. That is, at 
every instant in time t, it is accelerating 1ms~ in its direction of acceleration. 


(Note that for velocity, we gave its magnitude the special name of speed. But in contrast, 
the magnitude of acceleration has no special name. We simply call it the magnitude of 
acceleration. ) 


Above we proved that the particle’s direction of movement is always tangent to the 
circle. Here we can similarly prove that its direction of acceleration is always towards 
the centre of the circle. (Or equivalently, the acceleration vector points in the exactly 
opposite direction as the position vector.) To do so, we need merely observe that the 
acceleration vector a = (-cost,-sint) and the position vector s = (cost,sint) point in 
exact opposite directions.°”? 

Next, suppose the particle’s mass is m = 1kg. Recall (physics) Newton’s Second Law:3° 


Vector Scalar Vector 
——— 


Force = Mass x Acceleration or Sopa. 


Note that mass is a scalar quantity. So, force, being a product of a scalar and a vector, 
is itself a vector quantity. 


The force vector points in the same direction as the acceleration vector (i.e. towards the 
centre of the circle). Moreover, it has constant magnitude: 


F| = [mal = |m| lal = (1kg) x (1ms~) = 1kgms~ = 1N, 


where N stands for newton (the SI unit for force and which is equal to kgms~’). 


Physicists call such a force (which results in circular movement) a centripetal force. 








32°Note though that it would be wrong to write a = —s. This is because acceleration is measured in ms”, 
while position is measured in m. 
330See e.g. Exercise 188. 
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59. The Projection and Rejection Vectors 


Let a and b be vectors. 


Motivation: We’d like the projection of a on 
b, denoted proj,,a, to be the vector that is®*! 





e Parallel to b; and 

e Perpendicular to a — projj,a (this vector is 
depicted in blue and will be called the re- 
jection vector). 


a—projpa 





Let’s work towards a formal definition of proj,a. projpa b 
First, since proj,a || b, by Definition 138, we must have projj,a = kb for some k #0. 
Next, the length of proj,a is |k|. But what is k? 


In the right triangle above, the hypotenuse corresponds to the vector a, while the base 
corresponds to the vector proj,a. If @ is the angle between these two vectors, then by our 
right-triangle definition of cosine, 


Adjacent _ |proj,al 1 [Ay 


¢ = — 2 
as Hypotenuse lal lal 
-b b 
But by Definition 145, we also have ie 
lal[b] al 
kl a-b P 
Plug = into = to get la a atlas or |k|=a-b. 
jal al 


The above discussion motivates this definition of the projection vector:** 


Definition 152. Suppose a and b # O are vectors. Then the projection of a on b, denoted 
projpa, is this vector: 


; a- 
projpa = (a. or equivalently, projpa = be” 





We'll also call the blue vector the rejection vector: 


Definition 153. Suppose a and b # 0 are vectors. Then the rejection of a on b, denoted 
rejpa, is this vector: 


rej,pa = a — projpa. 








33! These two properties are to hold so long as proj,,a and a — projj,a are non-zero. 

332TThis definition imposes only the condition b + 0 to ensure proj,a is well-defined. It doesn’t impose the 
condition a # 0 (even though the foregoing motivating discussion implicitly assumed this). So, we are 
allowed to speak of the projection of 0 on any non-zero vector b—but of course, such a projection is 
always simply 0. 
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From the above two Definitions, Fact 141 is “obvious”: 


Fact 141. Leta and b #0 be vectors and projp,a = (a b) b. 


(a) Ifa- b> 0, then proj,a is a positive scalar multiple of b. 
(b) Ifa- b <0, then projp,a is a negative scalar multiple of b. 
(c) Ifa-b=0, then proj,a = 0 and rejpa = a. 





Geometric interpretation of Fact 141: Suppose 6 is the angle between a and b. Then 
(a) If @ is acute, then proj,,a points in the same direction as b. 


(b) If @ is obtuse, then proj,,a points in the exact opposite direction as b. 


(c) If 6 is right (ie. if a 1b), then projy,a = 0 and rej,a =a. 






“— rejpa=a-—projpa 





projp,a b proj,a b 





proj,pa =0 b 


Above we already argued that projja has length la b|. We now formally prove this: 


Fact 142. Suppose a and b + 0 are vectors. Then 


|proj,al = la b| , 





Proof. By Definition 152 and Fact 121, 


A 


=|a-b]-1=|a-b : 





b 











|projpal = (a b) b| = la -b 
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Example 820. Let u = (3,2) and v = (1,1) be vec- 
tors and proj,u be the projection of u on v. Then 


(3,2)-(1,1)|_3+2_ 5 
(eR ay oe 


[proj ul = |u- v| = 


Now consider w = (-2,-2)—it points in the exact 
opposite direction from v and has twice the length. 
It turns out that the projection of u on w, proj,,u, 
is identical to proj,u. 


We can easily verify that |proj,,u| = |proj,ul: 


(3, 2) : (-2, -2) 
|(-2, -2)| 


pee e | 





-|=— -|=2 5 
t v8 | [val v2 





As the above example suggests, if v and w are parallel, then the projections of any vector 
u on v and w are identical. Formally, 


Fact 143. Let u, v, and w be vectors. If v || w, then 





proj,,u = proj,,u. 


Proof. If v || w, then by Fact 124, ¥=+w. And so, 














proj,u = (u- Vv) ¥ = [u- (+w)] (4W) = (u- W) Ww = proj,,u. 
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Example 821. Let a = (-6,1) and b = (2,0) be vectors and proj;,a be the projection of 
aon b. Then 


eae0|— eae ee 
Com a 


pproinal = fab] = 


=6/ 


a = (-6,1) 
So bi=(2,0) 


projp,a 


Now consider c = (3,0)—it points in the same direction as b, but is half again as long. 
By Fact 143, the projection of a on c is the same as that of a on b. That is, 


proj,a = projpa. 


We can easily verify that |proj,a| = |proj,a| = 6 


|proj.a| = |a-é| = 








(=6,1)-@,0) =f a 7 
(3,0), | : ; 


a = (-6,1) 
lo c= (3, 


Fact 143 can help us simplify some calculations: 





Example 822. Let u = (5,-7) and v = (51V347,68V347). What is [projyu| (ie. the 
length of the projection of u onto v)? 


Here it seems that the calculations will be pretty tedious. But observe that v is a multiple 
of w = (3,4). So, v || w. Hence, by Fact 143, 


proj,U = proj,,u. 


Thus, instead of computing |proj,u|, we can simply compute |proj,,ul: 











|proj,u| = |proj,,ul = |u- w| = 


I(3, 4)| 


Exercise 238. Find the lengths of the projections of (Answer on p. 1848.) 


V3? + 42 


_|G,-7)-@,4) |= ~28 8 


(a) (1,0) on (33,33); and (b) (33,33) on (1,0). 


(c) Explain if this statement is true or false: 





“Given any vectors a and b, |proj,al = |proj,b].” 
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60. Collinearity 


Definition 154. Two or more points are collinear if some line contains all of them. 


Example 823. The points A and B are collinear. 


AB 


r= OA+)AB (\€R) 











“Obviously”, any two points must be collinear. Indeed, given any two points, there is a 
unique line that contains both of them: 


Fact 144. Suppose A and B are distinct points. Then the unique line that contains both 
A and B is described by 


r= OA+)AB, (\€R). 





Proof. First, plug in \ =0 and A = | to verify that the given line contains A and B. 


Next, this line is unique because any line that contains both A and B must have direction 











vector AB and so must be described by the given vector equation. 





In contrast, three distinct points could be collinear but won’t always be: 


Example 824. The points A, B, and C are collinear: 


A, B, and C are collinear. AB r=a+)\AB (\ € R) 





A 
eo 





In contrast, the points D, FE, and F' are not: 


D, E, and F are not collinear. ee ADE (\€R) 
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Here’s one possible procedure for checking whether three points are collinear: 


1. First use Fact 144 to write down the unique line that contains two of the three points. 


2. Then check whether this line also contains the third point. 


Example 825. To check if the points A = (1,2), B = (4,5), and C = (7,8) are collinear, 


1. First write down the unique line that contains both A and B: 


r= OA+ AB = (1,2) + A(3,3), (\€ R). 


2. This line also contains C' if and only if there exists \éR such that 


i A 
ae 2 2 eae, 
8 =2+4+3X. 


As you can verify, \ = 2 solves the above vector equation (or system of two equations). 
So, our line also contains C’. Hence, A, B, and C’ are collinear. 
Example 826. To check if the points D = (1,0), E = (0,1), and F’ = (0,0) are collinear, 
1. First write down a line that contains both D and EF: 

£200 #1De = (1,0) ait), (\€R). 
2. This line also contains F' if and only if there exists \ eR such that 


041-1), 
Wea 


From S, we have \ = 1, which contradicts 2. So, there is no solution to the above vector 
equation (or system of two equations). 


Hence, our line does not contain F’. Thus, D, &, and F' are not collinear. 
Exercise 239. In each, determine if the points A, B, and C are collinear. 


(a) A=(@,1), B=(1,6), and C = (0,=1). 
(b) Aa, 2), 2 = (0,0) nand'C =.(3, 6). (Answer on p. 1849.) 
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61. The Vector Product 


Definition 155. Let u = (uj,u2) and v = (v1,v2) be vectors. Their vector product, 
denoted u x v, is this number: 


UX V = U1V2 — UV. 


Example 827. Let u = (5,-3), v = (2,1), w= (-4,0), and x = (8,7). Then 


1 


|. ( ot) 8-0-9) -0- 12-12 


|. (* | -5-7-(-3)-8= 35424 =59, 


SON Sey = of cd) Ae 
| «(Gi J-20 1-(-4)=04+4=4. 


We now discuss three properties of the vector product. 


=5-1-(-3)-2=5+6=11, 





Recall that multiplication and the scalar product are both distributive (over addition) 
and commutative. It turns out that the vector product is also distributive: 


Example 828. Continuing with the above example: 


5 ie a ee ay ae ar 
ux (v+w) i (Fh Jeo (-3) - (-2) 


-1=11+(-12)=uxvtuxw. 


| ie |. (| Se ae) 


-34+1 0 


=-8=-12+4=uxwdivxw. 
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However, the vector product is not commutative. Instead, it is anti-commutative: 


axb=-bxa. 


Example 829. Let u = (5,-3), v = (2,1), and w = (-4,0). We already showed that 


uxve=ll and w xu=-l2. 


We now show that v x u=—-11 and wx u= 12: 


vxue= (2,1) x (5,-3) =2-(-3)-1-5=-6-5=-11. 





w xu = (-4,0) x (5,-3) (-4)- (-3) -0-5=12-0=12. 
The third property is that a vector’s vector product with itself is 0: 
Example 830. Continuing with the above example, we have 
uxu = (5,-3) x (5,-3) =5-(-3) - (-3)-5 =-15+15=0. 
vxv= (21) * @.) 2-1-1-2-2-2=0, 


w x w = (-4,0) x (-4,0) (-4)-0-0-(-4) =0-0=0. 





In summary, 


Fact 145. Suppose a, b, and c are vectors. Then 

(a) ax(b+c)=axbtaxc (Distributive over Addition) 
(b) axb=-bxa (Anti-Commutative) 

(c) axa=0 (Self Vector Product Is Zero) 














Proof. See Exercise 240. 





Exercise 240. Let a = (a1,a2), b = (b1,b2), and c€ = (c1,c2) be vectors. Prove the 
following (Fact 145): (Answer on p. 1850.) 


(a) ax(bt+c)=axb+axc (b) axb=-bxa (c) axa=0 
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Fact 145(c) says that a x a = 0 and yields this result: 


Corollary 24. /fa || b, then axb=0. 


Proof. If a || b, then there exists c # 0 such that ca = cb. So, 











ax b=ax (ca) =c(axa)=c:0=0. 





Example 831. Let a = (1,2) and b = (-2,-4). Since a || b, by Corollary 24, ax b=0. 





It turns out that the converse of Corollary 24 is also true: 


Fact 146. Let a and b be non-zero vectors. Ifaxb=0, then a|| b. 


Proof. See p. 1628 (Appendices). 














Example 832. Let a = (3,-1) and b = (2,k), where k is some unknown constant. We 
are now told that ax b=0. What is k? 


By Fact 146, ax b = 0 implies a || b. So, b is a scalar multiple of a. 
Hence, 2/3 = k/(-1). And so, k = -2/3. 





For future reference, let’s combine Corollary 24 and Fact 146: 
Corollary 25. Suppose a and b are non-zero vectors. Then 


axb=0 <= ab. 





Here’s another “obvious” property of the vector product: 
Fact 147. Suppose a and b are vectors andcéeR. Then 


(ca) x b=c(axb). 





Proof. Let a = (a1, a2) and b = (01, b2). Then ca = (ca,, caz) and 





(ca) x b = (cay) - by — (cag) by = c(ayb2 - agb,) = c(axb). 











Exercise 241. Let a = (1,-2), b = (3,0), and c = (4,1). Compute ax b, axc, bxc, 





bxa,cxa,cxb,andax(b+c). (Answer on p. 1850.) 


634, Contents www.EconsPhDTutor.com 


61.1. The Angle between Two Vectors Using the Vector Product 


Recall (Fact 132) that a-b = |al| |b] cos 6. 


With the vector product, we have a very similar result: 


Fact 148. Suppose @ is the angle between the vectors a and b. Then 


|a x b] = |a| |b] sin @. 








Proof. See Exercise 242. 











Exercise 242. Let @ be the angle between the vectors a = (a), a2) and b = (01, be). 


(a) Express |al, |b], |a x b], and cos@ in terms of a1, a2, bj, and by. (You do not need to 
expand the squared terms.) 


(b) Since 6 € [0,7], what can you say about the sign of sin@? (That is, is sin@ positive, 
negative, non-positive, or non-negative?) 
(c) Now use a trigonometric identity to express sin # in terms of cos@. (Hint: You should 


find that there are two possibilities. Use what you found in (b) to explain why you 
can discard one of these possibilities. ) 


(d) Plug the expression you wrote down for cos@ in (a) into what you found in (c). 


(e) Prove??? that (aj + az) (b; - b3)-(ayby + ayb9)” = (ayby — ab)”. (Hint: Simply expand 
the terms and do the algebra.) 


(f) Use (a) and (d) to express |a||b|sin@ in terms of a1, a2, 61, and b2. Then use (e) to 
prove that |a x b] = |a| |b] sin é. (Answer on p. 1850.) 





Take |a x b| = |a||b|/sin@ and rearrange: 





Now apply sin”! to get this result:3°+ 





333By the way, this is simply an instance of Lagrange’s Identity. 

34Corollary 26 adds the condition that 6 € [0,7/2] (whereas the angle @ between two vectors can more 
generally be between 0 and 71, inclusive). The reason for this addition is that Range sin“! = [-7/2, 7/2], 
so that if 0 € (7/2, 7], then clearly the conclusion of Corollary 26 can’t be true. 

_,laxbj 
lal [by ° 


In fact, if 6 € (7/2, 70], then we instead have 6 = 7- sin as we now show: 


Proof. Suppose 6 € (7/2,7]. Let 6 =1-6€[0,7/2). Then sin@ = sin(m- 3) =sin 8. 
la x _, lax bl 


b 
But sin? = ——— =sin{. Since (€ [0,7/2), 6 =sin 
lal [b| lal [b| 


b 
. Hence, 0= 1-6 =n7-sin! ieee | 
la] |b| 




















635, Contents www.EconsPhDTutor.com 


Corollary 26. Suppose 0 € [0,7/2] is the angle between the vectors a and b. Then 


a _;|axbl 


gh ihe ; 
|a| |b| 





Corollary 26 is thus the sine or vector product analogue of Definition 145. 


Note though that we won’t be using Corollary 26 to compute the angle between two vectors. 
This is because, as we'll see shortly, it’s usually easier to compute the scalar product than 
the vector product.®*’ And so, it’s easier to just use Definition 145. 





33°Plus, as noted in the previous footnote, Corollary 26 only covers the case where 6 € [0, 7/2]. 
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61.2. The Length of the Rejection Vector 


The vector product will be more useful only when we look at 3D space. Nonetheless, even 
in 2D space, it has this use: 


Fact 149. Suppose a and b are vectors. Then 





lrej,al = la x b| 






rejpa = a-—projpa 


projpa b 


So, Fact 149 is the vector product analogue of Fact 142. 


Proof. By the right-triangle definition of sine,*°° 


Opposite _ |rej,al 





sin 9 = ——————_ = 
Hypotenuse la| 


Rearrange, then use b| = 1 and Fact 148: 














|rej,a| = [a] sin 0 = |al b| sin 6 = la * b| 


As we'll see next, Fact 149 will help us compute the distance between a point and the line. 





336For a proof that makes no mention of the sine function, see p. 1627 (Appendices). 
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62. The Foot of the Perpendicular (from a Point to a Line) 


We reproduce from Ch. 15.1 this result: 


Corollary 4. Suppose A is a point not on the line 1. Then there exists a point B that 


is both (a) the unique point on | that’s closest to A; and (b) the unique point on | such 
that 1 1 AB. 





Recall that we also call the point B the foot of the perpendicular from A to /. Definition 
reproduced (also from Ch. 15.1): 


Definition 156. Let A be a point that isn’t on the line 1. The foot of the perpendicular 


from A to lis the (unique) point B on / such that AB 1 I. 





Earlier in Part I (p. 181), we didn’t fully prove Corollary 4.°°° Now, armed with the 
language of vectors, we can do so quite easily. The proof below 
e relies on several previously hard-fought results involving the scalar product; 


e may seem a bit abstract and difficult, but only because it involves a bit of vector algebra 
(if you’ve had as much practice with vector algebra as you’ve had with “usual” algebra 
since primary school, this proof wouldn’t seem difficult at all); 


e is general in that it proves the claim not only in the case of 2D space, but also for 3D 
and all higher dimensional spaces. 


Proof. Describe | by R= P+.,v (A€R). 
So, the vector from A to a point R on I is AR = AO+OR=OP+%,.v-OA= AP +), v. 


. (AP + A,v)] (AP + Av) =4/ AP) + 2\,AP -v+ Iv’. 


Since \/- is increasing, the above surd expression is minimised when the quadratic inside is 
minimised. And by Fact 34(d), the quadratic is minimised at 


Hence, JAR = [AP +A;V 





“ae QAP -v _ Ais 


a awh ve 





‘Vv 

Wwe on / that minimises the distance 
Vv 

between A and 1. In other words, S is the point on / that’s closest to A. 


Thus, there exists a unique point S = P+A,v = P- 


Next, AR-v = (AP +,v)-v = AP-v+),|vp- 





So, AR 11 «> ARiv — AR-v=0 <= AP-v +|v/ =0 —> M=-AP-v/|v/’. 


Hence, there exists a unique point 7 = P+;v onl such that AT’ 1 I. 


Now, observe A, = x. So, S = T is the point B with the claimed properties (a) and (b). 





337We did in the Appendices but not in the main text. 


638, Contents www.EconsPhDTutor.com 


We reproduce from Ch. 15.1 this result: 


Corollary 5. Let! be the line ax+by+c=0. Suppose A is a point not onl. Then the 
point B that is both (a) the unique point on | that’s closest to A; and (b) the unique 
point on | such that! 1 AB is 


( ap+bq+c oe) 
B=|p-a———_., ¢ - b ————_ 
a? + b? a? + b? 





(To repeat our warnings from Ch. 15.1, do not try mugging the above result. Instead, 
understand and remember the methods by which we can derive the above result—to be 
reviewed in the examples below.) 


We can now restate the above result in the language of vectors: 


Fact 150. Let 1 be the line R= P+Av (AER). Suppose A is a point not onl. Then the 
point B that is (a) the unique point onl that’s closest to A; and (b) the unique point on 
l such that 1 1 AB is 


— 
B= P +projPA. 





PA-v _ AP-v 
VS =e 
Iv| Iv| 


: x ; * AP-v 1 -, ah 
Taking = from the previous proof, we have B = P - Wwe = P+proj,PA. 
Vv 


a 9. aaa al 
Proof. By Definition 152, proj, PA = 














Fact 150 gives us an intuitive and easy-to-remember method for finding the point B: 
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Example 833. Let A = (1,2) be a point and / be the line r = OP +\v = (0,1) + A(9, 1) 
(A ¢R). Let B be the point on / that’s closest to A (B is also the foot of the perpendicular 
from A to l). 


B 
v = (9,1) 


We'll find B using four methods: P = (0,1) 
Method 1 (Projection Vector). Compute we (12) =(0,1) = (141) and 


= Ceajao GV geet 5 
ley 1,1) = —~--— = j= == (601) 
proj, PYOJ(9,1) ( ’ ) Q2 + 12 l 89 (9; ) 4] (9, ) 


Now simply apply Fact 150: 


= 5 if 
B= P j,PA = (0,1) + — (9,1) = — (45,46). 


Method 2 (Quadratic-Cartesian). In Part I, we learnt to find B this way: 
The line / has cartesian equations x = 9A and y= 1+A—or, y=2/9+1. 


Pick any arbitrary point R = (r,r/9+1) on lL. 


The distance between A and R is \/(1-r)’ + (2-1r/9-1)° = \/82r2/81 — 20r/9 + 2. 


This last surd expression is minimised when the quadratic expression inside is minimised. 
And by Fact 34(d), this occurs when 


op -20/9 90 45 


oe Pe 


ee 45 46 
H 5? ee (pac eae, | (ee (catia (, 
ee (== us (=5) 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Method 3 (Quadratic-Vector). Pick any arbitrary point R = (0,1) + A(9,1) on U. 
Then the distance between A and R is 


|AR| = |OR - OA] =|(0, 1) + (9,1) - (1,2) = |@A-1,A-D 


=4/ (9\=1)°+ (A=1)* = V8202 = 200 + 2. 


As usual, this last surd expression is minimised when 





Eee 


Oa” 9x 82 AL 


) 1 
H B=(0,1) + —(9, 1) = — (45, 46). 


Method 4 (Scalar Product). Let B = (0,1) + \4(9,1). Then AB = OB =O4 = 
Cai =) 


We have AB 11 or AB 1 (9,1) or AB- (9,1) =0 or 


0 = (9A, — 1, A» - 1) - (9, 1) = 81A0 - 9 + Ay — 1 = 82Az - 10 or Ay = 10/82 = 5/41), 


i) 1 
H B=(0,1) + —(9, 1) = — (45, 46). 


Exercise 243. Find the feet of the perpendiculars from the points A = (-1,0) and 
== 
B = (3,2) to the line r= OP + Av = (2,-3) + 4(5,1) (AER). (Answer on p. 1851.) 
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62.1. The Distance Between a Point and a Line 


We reproduce from Ch. 15 this definition: 


Definition 157. The distance between a point A and a graph G is the distance between 


A and B, where B is the point on G that’s closest to A. 





AQ 






The line 1 B, the foot of the perpendicular, 


is also the closest point. 


Let B be the point on the line J that’s closest to the point A (and is also the foot of the 
perpendicular from A to /). 
Then by the above definition, the distance between A and / is simply the distance between 


A and B—or equivalently, the length of the vector AB. Let’s jot this down as a formal 
result: 


Corollary 27. Suppose B is the point on the line | that’s closest to the point A. Then 





—> 
the distance between A and I is [AB]. 
We revisit Example 833: 


Example 834. Let A = (1,2) be a point and 1 be the line 
r= OP+Av = (0,1) +A(9,1) (A€R). Let B be the point on / 
that’s closest to A. 


We'll find the distance between A and / using five methods. 


Methods 1-4 will be the same as the four methods we used ; 
earlier for finding B. Pei 1) 


Method 5 (new) uses the vector product. 
Method 1 (Projection Vector). Compute PA= (1.2) = (0,1) —(1,1)-and 


aah : Liye (9, L749 9+1 
projyPA = projig 1) (1,1) = | 1 |-%3 = (9, 1) = > (9, IDR 


men 1 
Now by Fact 150, B = P+ proj, PA = (0,1) + 2 (Ql) = 7 (45, 46). 


So, the distance between A and / is 


es 4 
AB] -|5 (4, -36)|= 2 =|a.-9I- <P + (-9)' = {VS » 0.888 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


With Method 1, we first explicitly find 6B, then compute |AB . 


In contrast, with Methods 2-5, we’ll find |AB| without explicitly finding B (but of course, 
if we wanted to, we could also easily find B): 


Method 2 (Quadratic-Cartesian). The line / has cartesian equations x = 9A and 
y=1+A—or, y=2/9 +1. 


Now, pick any arbitrary point R = (r,r/9+1) on I. 





The distance between A and R is \/(1-r)” + (2—r/9-1) = \/82r2/81 — 20r/9 +2. 


As usual, this last surd expression is minimised when 


tb -20/9 _ 90 45 


So era 


So, the minimised distance or more simply distance between A and I is 


25 100 50 50 32 
ye ee eas, 
“4. 41° a 7 iia 


Method 3 (Quadratic-Vector). Pick any arbitrary point R = (0,1)+A(9,1) onl. The 
distance between A and R is 


|AR| = |OR - OA] =|(0, 1) + A(9,1) - (1,2) = |@A-1,A-D 


=f (9A-1)° + (A-1)* = V82A? - 20) + 2. 


As usual, this last surd expression is minimised when 


She —20 5 


~ 2%g? «2x 82 AL 


So, the minimised distance or more simply distance between A and / is 


2 
2x25 1 2 
2 =) -20(>-) +2- eS oe 
41 41 41 41 41 41 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Method 4 (Scalar Product). Let B = (0,1) + \4(9,1). Then AB = Op = 0A = 
Ge iy i, 


We have AB 11 or AB 1 (9,1) or AB- (9,1) =0 or 


0 = (9A, - 1, Ay-1)- (9, 1) = 8144-9 + Ay- 1 = 82Ay,- 10 Np = 10/82 = 5/41. 


So, the distance between A and / is 


|AB| = Vf (9X9 - 1)" + Cg - 1)? = 1/8202 - 2029 + 2 


2 
2(=] -20(=) + = Bee Seas eee 
Al Al Al 41 Al Al 
Method 5 (Vector Product). Recall that AB is rejyPA—the rejection of PA on v. 
Recall also Fact 149: rej, PAl = IPA x 4. So, 


[AB] = |rej PA = |PA x o| = |(1,1) x (9, D/Ve?+ P| 


= [1 x1-1 x 9] /V82 = 8/V82 x 0.883. 





Methods 1-4 don’t yield any neat and easy-to-remember general formulae, so we won’t 
bother writing down any general formal result for them. 


But Method 5 does, so let’s jot down this formal result: 


Corollary 28. Let A be a point. Suppose | is the line R= P+Av (AER). Then the 
=—- 
distance between A and I is |PA x «|. 





Another example illustrating all five methods: 


Example 835. XXX 


Exercise 244. In each of the following, a point A and line / are given. Let B be the foot 
of the perpendicular from A to /. Find B and also the distance between A and I. 


The point A The line / Answer on p. 


(a) (73) r = (8,3) + A(9,3) 1852. 
(b) (8,0) Contains the points (4,4) and (6,11) 1853. 
(c) (8,5) r = (8,4) + (5,6) 1854. 
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So far, we’ve looked only at two-dimensional (2D) space 
(or the cartesian plane). In the remainder of Part III, we’ll 
look instead at three-dimensional (3D) space. 


I will also often make use of Paul Seeburger’s CalcPlot3D.°°° 
Whenever you see a tiny version of this icon: 


a~ NN 





click/touch it??? and you’ll be brought to the relevant 3D 
graph, where you can pan, zoom, rotate, etc., so as to get a 
better sense of what the 3D graph looks like. 


(2021-11-06: Unfortunately Seeburger moved his web- 
site and all of these links are now broken. I’ll fix them 
when I get the time.) 





338T’ve examined dozens of 3D graphing software and all things considered (user-friendliness, accessibility, 
features, etc.), this is the best 3D graphing web app I’ve found so far. Please let me know if you know 
of any other better software/app. (I was gonna use GeoGebra, but it had too many critical flaws.) 

339Note that you'll be routed through TinyURL.com first. The reason is that the CalcPlot3D links are 


often thousands of characters long and were confusing my computer. 
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63. Three-Dimensional (3D) Space 


In 2D space, we had ordered pairs. In 3D space, we'll instead have ordered triples:*“° 


Definition 158. Given an ordered triple (a,b,c), we call a its first or x-coordinate, b its 
second or y-coordinate, and c its third or z-coordinate. 


Example 836. The ordered triple (Cow, Chicken, Dog) has 2-coordinate Cow, y- 
coordinate Chicken, and z-coordinate Dog. As with ordered pairs, the order of the 
coordinates matters. So for example, 


(Cow, Chicken, Dog) + (Cow, Dog, Chicken) # (Chicken, Cow, Dog) . 


In contrast, with a set of three elements, order doesn’t matter: 
{Cow, Chicken, Dog} = {Cow, Dog, Chicken} = {Chicken, Cow, Dog}. 


Example 837. The ordered triple (2,5,-7t) has x-coordinate 2, y-coordinate 5, and 
z-coordinate -7t. Again, order matters, so that for example, 


(2,5, -7) # (2,-7,5) # (5, 2,-71). 
Again, in contrast, with a set of three elements, order doesn’t matter: 


{2,5,-7} = {2,-7, 5} = {5, 2, -71} . 





In 2D space, a point was simply any ordered pair of real numbers. Now in 3D space, 


Definition 159. A point is any ordered triple of real numbers. 


Example 838. The ordered triple (Cow, Chicken, Dog) is not a point because at least 





one of its coordinates is not a real number. (Indeed, all three aren’t.) 


Example 839. The ordered triple (2,5, -71) is a point because all three of its coordinates 
are real numbers. 





40For the formal definition of an ordered triple (and n-tuple), see Definition 267 (Appendices). 
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In 2D space (the cartesian plane), we could depict points y 4 
(ordered pairs of real numbers) by drawing on a piece 














of paper. The x-axis went right and the y-axis up. A= (2,1) 
A e 
y A= (a1, a2, a3) 
> 
ue 
ig gap art en esn sees pr ecre cca ss 
re | Cereeeeee oe 

In 3D space, we can again depict points 
a (now ordered triples of real numbers) by 
i. oe e drawing on a piece of paper. Again, the 
! a x-axis goes right and the y-axis up. But 
A oe ere now, we also have the z-axis, which “comes 
out of the paper towards your face” and is 

O = (0,0,0) re] perpendicular to both the xz- and y-axes. 





We say that this coordinate system follows the right-hand rule. To see why, have the 
palm of your right hand face you. Fold your ring and pinky fingers. Have your thumb point 
right, your index finger up, and your middle finger towards your face. Then these three 
fingers correspond to the 2-, y-, and z-axes. (Try it!) 


(If instead the z-axis “goes into the paper away from your face”, then our coordinate system 
would instead follow the left-hand rule. Can you explain why?) 


In 2D space, the origin was the point O = (0,0) (Definition 37) and was where the x- and 
y-axes intersected. And the generic point A = (a), a2) was a, units to the right and a2 units 
above the origin. 


Analogously, in 3D space, 


Definition 160. The origin is the point O = (0,0,0). 


In 3D space, the origin is where the x-, y-, and z-axes intersect. And relative to the origin, 
the generic point A = (a1, d2,a3) is a, units right, a2 units up, and a3 units “out (towards 
your face)”. 
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63.1. Graphs (in 3D) 


In 2D space, a graph was any set of points, where points were ordered pairs of real numbers 
(Definition 38). 


This remains true in 3D space, where a graph is any set of points (the only difference being 
that points are now ordered triples of real numbers). We reproduce from Ch. 7.3 this 
definition: 

Definition 38. A graph (or curve) is any set of points. 

Example 840. XXX 


Example 841. XXX 


In 2D space, the graph of an equation was the set of points for which the equation was true 
(Definition 39). 


This remains true in 3D space (again, the only difference between that points are now 


ordered triples of real numbers). We reproduce from Ch. 7.4 this definition: 


Definition 39. The graph of an equation is the set of points for which the equation is 
true. 


Shortly, we’ll learn about the equations (and systems of equations) used to describe planes 
(and lines). For now, here are two quick examples: 
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Example 842. Consider the equation: 


The plane q 
gt+y+ze=l1. 


It turns out that this equation describes 
a plane q in 3D space. (We'll learn more 
about this in Ch. 70.) 


Specifically, q contains exactly those 
points (x,y,z) that satisfy 


et+y+ze=l. 
So for example, it contains the points 


(1,0,0), (0,1,0), and (0,0, 1). 
A little more formally, the plane gq is a set: 


q={(2,y,z):c2€R ye R,zeR r+yt+z=1}. es) 


In words, q is the set of ordered triples (x,y,z) such that x, y, and z are real numbers 
satisfy r+y+z=1. 


As with ordered pairs, we'll usually look only at ordered triples of real numbers, i.e. 
points. And so, we’ll be a little lazy/sloppy and not bother mentioning that x, y, and z 
are real numbers. That is, we'll usually more simply write 


q={(2,y,2)i@tytz=1}. 





In words, q is the set of points (x,y,z) such that x, y, and z satisfy r+ y+z=1. 
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Example 843. Consider this system of (two) equations (in three variables): 
C= Sand P=. 
Or equivalently and more simply, tha. 


It turns out that this system of (two) equations describes a line / in 3D space. (We’ll 
learn more about this in Ch. 67.) The line / contains exactly those points that can be 
written as (A, A, A), for some real number 1. © 


So, for example, it contains the points (1,1,1), O = (0,0,0), and (-1,-1,-1). 


A 
Yy 


The line 1 








A little more formally, the line / is a set: 


Lace) ea y= et. =) 


In words, I is the set of ordered triples (x,y,z) such that x, y, and z are real numbers 
that satisfy x = y = z. 


As per © above, we can also write 
l={Q,4,A):A€R} = {AC,1,1):A€R}. 


In words, | is the set of points that can be written as (A, A, A) or A(1,1,1) for some real 
number 4. 
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64. Vectors (in 3D) 


We now give the basic definitions and results concerning vectors in 3D space. Everything 
we learnt about vectors in 2D space finds its analogy in 3D space. (Indeed, we'll simply 
reproduce verbatim many of the definitions and results from before). 


Most of the time, the analogy is obvious. We will therefore go fairly briskly in this chapter. 


Definition 161. Given the points A = (a@1,a2,a3) and B = (b,, b2, bs), the vector from A 
— 
toB is Ab = (b; —- a1, by a az, b3 ae a3). 


Example 844. The vector from the point A = (1,5,0) to the point B = (-2,6,3) is 


AB =(-3,1,5)= 


(Observe that there are, again, at least four ways to denote a single vector.) 


A 
y 


AB = (-3,1,3) 
(The vector 
from A to B) 
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Definition 125. A scalar is any real number. 


We may again contrast vectors with scalars: In 3D space, vectors are three-dimensional 
objects, while scalars are one-dimensional. 


Definition 127. Given a point A, its position vector is the vector OA. 





—_> 
And so, the point A = (a1, a2,a3) has position vector OA =a = (a4, do, a3). 


Example 845. The point A = (1,5,0) has position vector Oia = (1,5,0). 





Once again, do not confuse a point (a zero-dimensional object) with a vector (a three- 
dimensional object). 


Definition 128. The zero vector, denoted 0, is the origin’s position vector. 





And so, in 3D space, the zero vector is 0 = OO = (0,0,0). 


Definition 129. Suppose a moving object starts at point A and ends at point B. Then 
—_ 
we call AB its displacement vector. 


And so, if a moving object starts at A = (a,,a@2,a3) and ends at B = (01, b2,b3), then its 
— 
displacement vector is AB = (b; — a4, b2 — a2, b3 — a3). 


Exercise 245. Let A = (2,5,8) and B = (0,1,1) be points. 


(a) What is the vector from A to B? 


(b) What are the position vectors of A and B? 


(c) If a particle starts at A, travels to B, then travels back to A and stops there, then 
what is its displacement vector? (Answer on p. 1855.) 





652, Contents www.EconsPhDTutor.com 


64.1. The Magnitude or Length of a Vector 


In 2D space, the length of a vector u = (uw, u2) was defined as |u| = \/u? + u2 (Def. 126). 


Our Definition of a vector’s length in 3D space is very much analogous: 


Definition 162. Given the vector u = (wu, U2, U3), its magnitude or length, denoted |ul, 
is this number: 


jul = \/u? + ue + ud. 


Example 846. If u = (1,2,3), then the length of u is jul = V1? + 22 + 3? = V14. 





y To see why the above definition makes 
D sense, pick any point A = (a1, a2, a3). 
a \ aa It is natural to define the length of the 
ei = 
B= (0,a9,03) vector OA to be the length of the line 


——_* ie (a1,02,09 segment OA, i.e. |OAI. 


Our goal then is to find |OA]. To do so, 
we first consider the point B = (0, a2, a3). 


x Observe that the line segment OB is the 
7 > hypotenuse of the right triangle ODB. 
Yay And so, by Pythagoras’ Theorem, 


JOB| = \/a3 + a2. 


Next, the line segment OA is the hypotenuse of the right triangle OBA. Moreover, |B A| = 
a,;. And so, again by Pythagoras’ Theorem, 


2 
JOA| = \/ |BA)? + |OB/ = \/ ait (Va + a3] =\/a?+a3+ ai. 


This completes our explanation of why the above Definition makes sense. 








He 


= (0,0, 0) 





As before, the length of every vector must be non-negative. Moreover, a vector has zero 
length if and only if it is the zero vector: 


Fact 117. Suppose v is a vector. Then |v|>0. Moreover, |v|=0 <> v=0. 


Exercise 246. Let A = (2,5,8) and B = (0,1,1) be points. What is the length of the 
vector from A to B? (Answer on p. 1855.) 
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64.2. Sums and Differences of Points and Vectors 


As before, 


Point + Point = Undefined. 
Point — Point = Vector. 
Point + Vector = Point. 
Point — Vector = Point. 


= 





And so, given the points A = (a1,@2,a3) and B = (by, bo, bs), their difference is 


B-A=AB = (by - a, by —a9,b3 — a3). 


Example 847. Given the points A=(1,5,0) and B = (-2,6,3), their difference is the 
vector B- A= AB = (-2-1,6-5,3-0) = (-3,1,3). 


A 
y 


CCl) 
ee eo ane, 


(The difference between 
the points A and B) 
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Definition 163. Given a point A = (a1,a2,a3) and a vector v = (v1, 2,03), their sum 
A+-v is this point: 


A+v = (a1 + U1, A2 + Vo, 43 + U3). 


Example 848. Given the point A= (1,5,0) and the vector v = (-3,1,3), their sum is 
the point A+ v = (1-3,5+1,0+3) = (-2,6,3). 


A 


(The sum of a point A y 


and a vector v) 


A+v = (-2,6,3) 
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Definition 164. Given a point B = (by, bo, b3) and a vector v = (v1, V2, v3), their difference 
B-v is this point: 


B-v = (b; — v1, be — v2, b3 — U3). 


Example 849. Given the point B = (—2,6,3) and the vector v = (-3, 1,3), their difference 
is the point B - v = (-2 - (-3) ,6-1,3-3) = (1, 5,0). 


A 
y 


B-v=(1,5,0) 
(The difference between a 
point B and a vector v) 





Exercise 247. Let A = (1,2,3), B = (-1,0,7), and C = (5,-2,3) be points. What are 
(a) A+B; (b) A-B; (c) 4+ (B+C); and (d) 4+ (B-C)? (Answer on p. 1855.) 
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64.3. Sum, Additive Inverse, and Difference of Vectors 


As before, 1. Vector + Vector = Vector. 
2 — Vector = Vector. 
3. Vector — Vector = Vector. 


Definition 165. Let u = (uj, u2,u3) and v = (v1, v2,v3) be vectors. Then their sum, 
denoted u + v, is the vector u+ v = (wu; + Uj, U2 + V2, Ug + U3). 


Example 850. Given the vectors u = (1,2,3) and v = (-1,0,1), their sum is the vector 
u+v=(1-1,2+0,3+1) =(0,2,4). = 


Ay 
v = (-1,0,1) 


Sl. 


u+v = (0, 2,4) Place the tail of v at the head of u. 
(The sum of two vectors) Then the sum u + v is the vector 
from the tail of u to the head of v. 








Definition 166. The additive inverse of u = (uz, U2, ug) is this vector: 


= (-u1, —Ua, —ti3). 


Example 851. The additive inverse of u = (1,2,3) is the vector -u=(-1,-2,-3). & 


Ay 
H=(1,2.2) 
Flip u in the exact 


opposite direction to get 
its additive inverse —u. 





—u = (-1,-2,-3) 
(The additive inverse) 
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Definition 135. Given two vectors u and v, their difference u-v is the sum of u and 
the additive inverse of v. That is, 


u-v=u+(-v). 


Fact 151. Jf u = (uy, U2, U3) and v = (v1, v2, U3) are vectors, then 


U- V = (tH — U1, Ue — V2, Ug — U3). 





Proof. By Definition 166, —v = (v1, -v2, -v3). And so by Definition 135, 











u-v=u+4 (-Vv) = (uy — U4, U2 — V2, Ug — U3) - 





Example 852. Given the vectors u = (1,2,3) and v = (-1,0,1), their difference is the 
vector u- v = (1-(-1),2-0,3-1) = (2, 2,2). Se 


Ay 
v =(-1,0,1) 


“= 12,3) 


Place the heads of u and v 
at the same point. Then the (The difference 
difference u — v is the vector of two vectors) 
from the tail of u to that of v. 








Exercise 248. Let u = (1,2,3), v = (-1,0,7), and w = (5,-2,3) be vectors. What are 
(a) u+v; (b) u-v; (c) u+(v+w); and (d) u+(v-w)? (Answer on p. 1855.) 
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Fact 119. Suppose A and B are points. Then OB -OA=AB. 


Proof. Let A = (a1, a2,a3) and B = (bj, bo, 63). Then 


—> 


AB= (b; — 41, bo — ag, bg = a3), OA = (a1, 42, 43), and OB - (b;, bo, b3). 














By Fact Le. OB = OA = (b; —- aj, by — a9, bs = a3). Thus, AB = OB = OA. 


Example 853. Let A= (1,5,0) and B = (-2,6,3) be points. <2] 
Then AB = B- A = (-2,6,3) - (1,5,0) = (-3,1,3), OA = (1,5,0), and OB = (-2,6,3). 
And indeed, OB - OA = (-2,6,3) — (1,5, 0) = (-3, 1,3) = AB. Y, 


A 
y 


B = (-2,6,3) 


-— 


Op204-468—e3 13) 
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Fact 120. Suppose A, B, and C are points. Then AB —- AC = CB and AB + BC = AC. 





Proof. Let A = (a, a2,a3), B = (by, 2,63), and C = (cy, c2,c3). Then 


AB = (b; —- a1, bo — ag, bs — a3), 
AC = (cr — aj, C2 — AQ, C3 —az), 
CB 


= (b; —¢1, by - a, b3 — cs), 














AB — AC = (by — a1, bz — ag, bz — a3) — (€1 — a1, C2 — G2, C3 — 3) 


7 (b, — CI, by — C2, bs = C3) = CB. 





Observing that _CB = BC and rearranging, we also have AB+ BC = AC. 











Example 854. Let A= (1,5,0), B = (-2,6,3), and C = (4,-2,1) be points. & 


Then ae = = 40-968) 0) i ae Sr SA 9 11 5 
fo til mand BO == 8 14-2 1 Ge 2). 


And indeed, AB - AC = (-3, 1,3) - (3,-7,1) = (-6,8,2) = -BC = CB. v 
Alan Ae BC 245 ayo 28 S02 = (oe 1) a J 


B= (-2,6,3) gob = 319) 


BO = (6, -8,-2) 


We =o 1) 





Exercise 249. Let A = (5,-1 9), B= (3,( 6, —5), and C= Q, 2 2,3) be points. Find AB, 
AC, and BC: and show that AB - AC’ = CB and AB + BC = AC. (Answer on p. 1855.) 
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64.4. Scalar Multiplication and When Two Vectors Are Parallel 


Definition 167. Given the vector v = (v1, v2, v3) and the scalar c € R, the vector cv is 
CV = (CU}, CV2, CU3). 
Fact 121. If v is a vector and ce R, then |cv| = |c||v|. 


Example 855. Let v = (-1,0,1) be a vector. Then 2v = (-2,0,2) and —3v = (3,0, -3). 


~3v = (3,0, -3) 
eee eee 





4 V= (-1,0,1) 


Now, lv] = \/ (-1)? + 02+ 2 = V2 


And so by Fact 121, we have 


2v}=2V2 and — |-3v|=3v2. 
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Definition 137. We say that two non-zero vectors u and v point in 


(a) The same direction if u = kv for some k > 0; 
(b) Exact opposite directions if u= kv for some k < 0; and 
(c) Different directions if u # kv for any k € R. 


Definition 138. Two non-zero vectors u and v are parallel if u= kv for some k € R and 
non-parallel otherwise. 


Example 856. Let u = (1,0,1). Then u points in 


e The same direction as v = (3,0,3) because v = 3u. 
e The exact opposite direction as w = (—2,0,-2) because w = -2u. 


¢ A different direction from x = (5,1,0) because x # ku for any k. 











So, u is parallel to both v and w, but not to x. As shorthand, we may write 


u || v,w and u tH x. 


Exercise 250. Continue to let u = (1,0,1), v = (8,0,3), w = (-2,0,-2), and x = (5, 1,0). 
State if each of the following pairs of vectors point in the same, exact opposite, or different 
directions; and also if they are parallel. (Answer on p. 1855.) 


(a) vand w. (b) vandx. (c) wandx. (d) uand 0. 
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64.5. Unit Vectors 


The following definitions and results about unit vectors are exactly the same as before: 


Definition 139. A unit vector is any vector of length 1. 


Definition 140. Given a non-zero vector v, its unit vector (or the unit vector in its 
direction) is 


(a) <= aandb point in the same direction; 
(b) = point in exact opposite directions; 


(c) b <= 
(d) 


Exercise 251. Find the length and unit vector of each vector. (Answer on p. 1856.) 


(a) aac (b) b=(4,5,6). (c) a-b. 
(d) 2a. (e) 3b. (f) -4(a-b). 
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64.6. The Standard Basis Vectors 


Recall*“! that in 2D space, the (two) standard basis vectors were the unit vectors that 
point in the directions of the positive x- and y-axes: 


i= (1,0) and j= (0,1). 


Analogously, in 3D space, the (three) standard basis vectors are the three unit vectors 
that point in the directions of the positive x-, y-, and z-axes: 


Definition 168. The standard basis vectors are 


i=(1,0,0), j=(0,1,0), and k=(0,0,1). 














Not surprisingly, every vector can be written as the linear combination of i, j, and k:3” 


Example 857. Let u = (7,5,3). Then u = 7i+ 5j + 3k. 


Exercise 252. Write each of the vectors v = (9,0,-1) and w = (-7,3,5) as a linear 
combination of the standard basis vectors. (Answer on p. 1856.) 








Ch, Bale, 

342You may also recall (Fact 125) that in 2D space, every vector can be written as the linear combination 
of two non-parallel vectors. It turns out that there is an analogous result in 3D space. 
For this result, we first define what it means for three (or more) vectors to be linearly independent: 


Definition 169. Three (or more) non-zero vectors are linearly independent if the first vector cannot 





be written as a linear combination of the other two vectors. 


We then have the following Fact (proof omitted). This Fact is beyond H2 Maths and isn’t something 
you need worry about. 


Fact 152. Every vector can be written as the linear combination of three linearly independent vectors. 
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64.7. The Ratio Theorem 
The Ratio Theorem is exactly the same as before and now reproduced: 


Theorem 16. Let A and B be points with position vectors a and b. Suppose the point 
P is on the line segment AB and divides it in the ratio A: uy. Then P’s position vector is 


_ pat Xb 
oe 


The point P has position vector: 


_ pa+ Ab 


A+ 
The point A has 
position vector a. 


The point B has 
position vector b. 





Let’s end this chapter with three more exercises: 


Exercise 254. Fill in the blanks. (Answers on p. 1856.) 


(a) Informally, a vector is an “arrow” with two properties: and 


(b) A point and a vector are entirely different objects and should not be confused. 
Nonetheless, each can be described by 


(c) Let A = (a1, a2,a3) be a point and a = (aj, @2,a3) be a vector. We say that a is A’s 


(d) The vector a = (a1, @2,a3) carries us from the to the point A = (a1, a2, a3). 


Exercise 255. Let A = (a1, a2,a3) be a point. Write down the vector from the origin to 
A in every possible way. (Answers on p. 1856.) 


Exercise 256. Let A = (a1, d2,a3) and B = (b1, ba, b3) be points. What are A+B, A+OB. 


OA+ OB, OA= OB, OA-BA? (Answers on p. 1856.) 
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65. The Scalar Product (in 3D) 


In 2D space, we defined the scalar product of u = (u1,u2) and v = (v1, v2) to be 
U'V= UYU + U2QV9. 
We define the scalar product in 3D space analogously: 


Definition 170. Given vectors u = (ui, U2, U3) and v = (v1, v2, v3), their scalar product, 
denoted u- v, is this number: 


U-V =H UV + U2QV2 + UZV3. 


Example 858. Let u = (5,-3,1), v = (2,1,-2), and w = (0,-4,3). Then 


=10-3-2=5. 


e123 = 15. 





Recall*#? that in 2D space, the scalar product was both commutative and distributive 
over addition. The same remains true of the scalar product in 3D space: 


Fact 153. Suppose a, b, and c are vectors. Then 


(a) a-b=b.-a. (Commutative) 
(b) a-(b+c)=a-bta-e. (Distributive over Addition) 





Proof. Let®“* a = (ay, a2,a3), b = (b,b2,b3), and ¢ = (c,,c2,c3). Then 
(a) a-b= a,b, + agby +E a3b3 = biay at bag + b3a3 =b-a. 


(b) a:(b+c) =a; (b1 + C1) + Gg (b2 + C2) + a3 (b3 + €3) 
= 0101 + dgby + a3b3 + acy + AoCo +0303 =a-bt+a-c. 

















33 Pact 129. 
344OQur proof here covers only the 3D case. For a more general proof, see p. 1620 (Appendices). 
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Example 859. Continue to let u = (5,-3,1), v = (2,1,-2), and w = (0, -4,3). 


To illustrate commutativity, we can easily verify that 


v-u=u-ve=)5) and w-u=u:'w=D. 


And to illustrate distributivity, we compute the following: 


2+0 
u-(v+w) a daa jeer os 1 = 20. 


(u+v)-w “| <4 | = 0+8-3 <5. 
3 





And again, a vector’s length is the square root of its scalar product with itself: 


Fact 131. Suppose v be a vector. Then |v| =\/v-v and |v| =v-v. 





Proof. By Definition 162, |v| = , foe + ue + ue By Definition 144, v-v = vyv1 + vove + U3U3 = 
2 
vs + vs + v3. Hence, |v| = /v-v and |v|" =v-v. 














Exercise 257. Compute (1, 2,3)-(4,5,6) and (-2, 4, -6)-(1,-2,3). (Answer on p. 1857.) 
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65.1. The Angle between Two Vectors 


Place the tails of two vectors at the same point. Then as before, the angle between 
these two vectors is, informally, simply the (smaller) “amount” by which we must rotate 
one of the two vectors so that both point in the same direction. 


Example 860. As shown in the figure below, a, b, c, and d are vectors. The angle 
between a and b is a, while that between c and d is (. 


A 
Y 








(Observe that here it so happens that a is acute, while 3 is obtuse.) 





And formally, we'll use the exact same Definition as before: 


Definition 145. The angle between two non-zero vectors u and v is this number: 





And as before, a simple rearrangement of Definition 145 yields: 


Fact 132. [fu and v are two non-zero vectors and @ is the angle between them, then 


u-v = |u| |v|cosé. 
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Example 861. The angle between the vectors (1,3,-2) and (0,2, 1) is 


tig de) 1 21) nee 4 
|(1, 3, -2)||(0, 2, 1)| 7 V 14/5 


cos x 1.072. 


A 
y (3D 





Example 862. The angle between (8,5,0) and (—2,—3,5) is 


yn 
cos”! (8,5,0)-(=2,-3,5) = cos” 


I(8, 5, 0)| (-2, =3; 5)| 


1 





A 
Y 











The following results and Definition are reproduced verbatim from before: 


Fact 133. (Cauchy’s Inequality.) Suppose u and v are non-zero vectors. Then 





Equivalently, —jul[v|<u-v<|ullvy] or (u-v)’ <|ul’ vp. 
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Fact 134. Suppose 6 is the angle between two non-zero vectors u and v. Then 


uv 
u : 


navi Ty i € (0,1) And thus, 


walvi = (i) u-v>0 <= @ is acute or zero; 
u| |v 


u-v 
——— e€ (-1,0) (ii) u-v =0 <=> 6 is right; 

jul [v| 

u-v — 
(iii) u-v <0 <= @ is obtuse or straight. 


ful fy] 


Definition 146. Two non-zero vectors u and v are perpendicular (or normal or orthog- 
onal) if u-v =0 and non-perpendicular if u- v # 0. 


Fact 135. Suppose u and v are non-zero vectors. Then 

(a) ——= <> u and v point in the same direction; 
<= u and v point in exact opposite directions; 
— 1 || Vv; 


—— ¢€ (-1,1) <= u andv point in different directions. 


Theorem 17. (Pythagoras’ Theorem.) /fu tv, then ju +-v|" = ful’ +|v|’. 
Fact 136. (Triangle Inequality.) Jf u and v are vectors, then \u+ v| < |u| + |v]. 


Exercise 258. Find the angle between each pair of vectors. Also, state whether each 
pair of vectors is parallel or perpendicular. (Answer on p. 1857.) 


(a) @= (12,3) and b=(4 5.0) (b) u=(-2,4,-6) and v = (1,-2,3) 
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65.2. Direction Cosines 


In 2D space, we defined the x- and y-direction cosines of the vector u = (wu, U2) to be u;/|ul 
and u2/ |u|. We shall define direction cosines in 3D space analogously: 


Definition 171. The x-, y-, and z-direction cosines of the vector v = (v1, U2, v3) are 


U1 V9 ae U3 
Iv fv Iv] 





Again, the direction cosines are so named because each direction cosine is equal to the 
cosine of the angle the given vector makes with each (positive) axis: 


Fact 154. Let v = (v1, v2,v3) be a non-zero vector. Suppose a, B, and y are the angles 
between v and each of i, j, and k. Then 


UI 
cos @ = — cos 3 = — 


Iv] 





Proof. Since a is the angle between v and i, we have 


v-i V1-1+v29-0+03-0 UI 
cos 2 = —_ = =F 
Iv] |i] Iv|-1 Iv| 


Similarly, since @ is the angle between v and j, we have 


cos 8 = vj _ Ur OF vg-1+ 03-0 _ V2 
Iv lil [v|-1 Iv] 


And since ¥ is the angle between v and k, we have 





vk _ uy-04+2-04+03-1 U3 
[v| [k| [v|-1 Iv] 














Example 863. Let v = (2,3,2). Compute |v| = V2? + 3? + 2? = V17. 
So v’s x-, y-, and z-direction cosines are 2/V17, 3/V17, and 2/V17. 


And the angles it makes with the positive x-, y-, and z-axes are 


2 3 
cos’! ~ 1.064, cos’! x (0.756, and cos” 


V17 V17 


Exercise 259. For each vector, write down its unit vector and z-, y-, and z-direction 
cosines. ‘Then compute also the angles it makes with the positive x-, y-, and z-axes. 


(a) (1,3,-2). (b) (4,2,-3). (c) (-1,2,-4). | (Answer on p. 1857.) 
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66. The Projection and Rejection Vectors (in 3D) 


Our definitions and results about the projection and rejection vectors carry over from 
2D space in the “obvious” fashion. Here are the same Definitions reproduced: 


Definition 152. Suppose a and b # O are vectors. Then the projection of a on b, 
denoted proj;,a, is this vector: 


projpa = (a b) b or equivalently, projpa = BFE 


Definition 153. Suppose a and b # 0 are vectors. Then the rejection of a on b, denoted 
rejpa, is this vector: 





rejp,a = a— proj,a. 


As before, the two key properties*”’ are that 





; : projpa | b 
rejpa = a- proj,a 





and 


rejpa 1 proj,a, b. 


projpa b 


The 2D figure above is simply reproduced from before. Here’s a figure depicting the pro- 
jection and rejection vectors in 3D: S&S 


A 





Y 


projp,a 


rejpa =a-—projpa 














345 Acain, these two properties must hold provided proj,a and rej,,a are both non-zero. 
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Example 864. Let a = (5,-2,3) and b= (0,1,2). Then b = (0, 1,2) /V5 and 


_ (5,-2,3) - (0, 1,2) ; 0-246 
7 5 a: 
2 


rejjpa = a — proj,a = (5, -2,3) — (0, 0.8, 1.6) = (5, -2.8, 1.4) 


A 
y 


projpa 





rejpa = a-—projpa 


We can easily verify that projya = kb for some k and hence that proj,a || b: 
proj,a = (0, 0.8, 1.6) = 0.8 (0, 1,2) = 0.8b. A 
We can also verify that rej,a-b =0 and hence that rejy,a 1 b (and also rej,a 1 projy,a): 


rej,pa-b = (5,-2.8, 1.4)- (0, 1,2) =0-2.8+2.8=0. 7 
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As before, the length of the projection vector is given by the scalar product: 


Fact 142. Suppose a and b ¢ 0 are vectors. Then 


|proj,al = la b| 


Example 865. Continue to let a = (5,-2,3) and b = (0,1,2). We already found 
b=(0,1,2)//5 and _ proj,a = (0,0.8, 1.6). 

Now, |proj,a| = |0.8 (0, 1, 2)| = 0.802 + 12 + 22 = 0.8/5. 

Also, la- 6] = (5,-2,3) -(0,1,2) /V5 = 4/V5 = 4V5/5 = 0.8V5. 


And so, it is indeed true that |proj,al = la b|. 





As before, the sign of a-b tells us whether proj,,a points in the same or exact opposite 
direction as b: 


Fact 141. Let a and b #0 be vectors and proj,a = (a b) b. 


(a) Ifa- b>0, then proj,a 7s a positive scalar multiple of b. 
(b) Ifa- b <0, then projp,a is a negative scalar multiple of b. 
(c) Ifa-b=0, then proj,a = 0 and rejpa =a. 


Example 866. Let a = (2,5,-1), b = 
(1,1,4), and c = (-2,-2,1). Then iS] 


(a) a-b=2+5-4>0, so that the angle 
between a and b is acute and proj,a 
points in the same direction as b. 

(b) a-c=-4-10-1<5, so that the angle 
between a and c is obtuse and proj,a 
points in the exact opposite direction 
as b. projpa 

(c) b-c = -2—2+4=0, so that the angle 
between a and c is right and proj,b = 0. 
Moreover, rej, b = b. 
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As before, if v || w, then the projections of any vector on each of v and w are identical: 


Fact 143. Let u, v, and w be vectors. If v || w, then 
proj,U = proj,,u. 


Example 867. Let u = (2,5,-1), v = (1,-2,1), and w = (-2,4, -2). Since v || w, by the 
above Fact, it should be that proj,u = proj,,u, as we now verify: 


1 
projyu = (u-V)¥ = 5 
1? + (-2)°+1? 


ee oe Cele ee) 


proj, U = projwu 








Exercise 261. Given a = (1,2,3) and b = (4,5,6), find proj,a, rej,a, |proj,al, and 


lrejpa|. Verify that projpa || b and rejpa 1 b. Does proj,a point in the same or exact 
opposite direction as b? 





(Answer on p. 1858.) 
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67. Lines (in 3D) 


Our definition of a line in 3D space is the general one given earlier and now reproduced: 


Definition 143. A line is any set of points that can be written as 


{R:OR=p+ dv (eR)}, 


where p and v # 0 are vectors. 





As before, the above Definition says that a line contains exactly those points R whose 
— 
position vector OR =r may be expressed as 


P1 U1 
OR=r= Pt+Av=] po |+Al vo for some real number 4. 
P3 U3 


Equivalently, a line contains exactly those points R that may be expressed as 


Pi U1 
R=] po | +A] ve for some real number 4X. 
P3 U3 


As before, here are what the vectors p and v and the number \ mean: 


* p=(p1,p2,p3) is the position vector of some point on the line; 
¢ v =(v1, V2, v3) is a direction vector of the line; and 


e The parameter takes on every value in R; each distinct value produces a distinct 
point on the line. 


Direction vectors are defined exactly as before: 





And as before, direction vectors are unique up to non-zero scalar multiplication. In 
other words, if a line has direction vector v, then that line’s direction vectors are exactly 
those that are parallel to v. Formally, 


Fact 126. Suppose v is a line’s direction vector. Then 


u is also that line’s direction vector 
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Example 868. Let / be the line described by this vector equation: 


0 
+A} 1 (A€R). 
1 


The line / contains the point P = (1,2,3) and has direction vector v = (0,1, 1). & 


As the parameter \ varies, we get different points of /. So for example, when \ takes on 
the values 0, 1, and —1, we get these three position vectors (and corresponding points): 


1 1 0 1 0 il 1 
2 |=] 2 7+] 1 J, 2 eat 1 21-114 
3 - 


ae 


x 








> 


Note that the direction vector v = (0, 1,1) has x-coordinate 0. Informally, one implication 
of this is that the line doesn’t “move” in the direction of the z-axis. 


A little more formally, the line is perpendicular to the z-axis. Indeed, we can easily verify 
that v is perpendicular to the first standard basis vector i = (1, 0,0): 


1 
0 }=0-14+1-04+1-0=0. 
0 





677, Contents www.EconsPhDTutor.com 


Example 869. Let / be the line described by this vector equation: 


r=OP+)\v= (\€R). 


The line / contains the point P = (0,0,0) and has direction vector v = (1,0,0). 2 


As the parameter varies, we get different points of /. So for example, when \ takes on 
the values 0, 1, and —-1, we get these three position vectors (and corresponding points): 


0 


0 
0 
0 





Note that the direction vector v = (1,0,0) has y- and z-coordinates 0. Again, this means 
that the line is perpendicular to the y- and z-axes—we can easily verify that v is perpen- 
dicular to both j = (0,1,0) and k = (0,0, 1). 


Indeed, this line actually coincides with the x-axis—it passes through the origin (0,0, 0) 
and its direction vector is parallel to i. 
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67.1. Vector to Cartesian Equations 


Suppose a line / is described by this vector equation: 


r = (pi, Po, p3) + A(U1, V2, V3) (A€R). 
Then for each point (x,y,z) on J, there is some \ € R such that 
L=p,t+Av1, y=potAvo, and z=p3+Avs. 


These three cartesian equations also describe /. 


In 2D space, we could eliminate the parameter 4 and thus reduce the two cartesian equations 
to one. Here in 3D space, we can also eliminate the parameter \, but this time we’ll merely 
reduce the three cartesian equations to two. 


We start with three examples where none of the direction vector’s coordinates are zero: 


Example 870. Let / be the line described by this vector equation: 
(A €R). 


That is, let 12( Rive (23) 46). 


In words, / is the set of points R whose position vector can be written as (1,2,3)+A(4, 5,6), 
for some real number 4. 


Write out = as the following three cartesian equations: 
1 2 3 
x=1+A4), y=2+5d, z=3+6A. 
Rearrange each equation so that is on one side: 


= 
24 


5 ) 
Eliminating \ leaves these two cartesian equations that describe the line 1: 


Lae 


4 5 6 


And so, we may also write 


y—2 — 
5 Go) 
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Example 871. Let /) be the line described by this vector equation: 
(A€R). 


Write out the three cartesian equations: 


g2-2+d, y254+5d, 220-2). 
Rearrange each equation so that is on one side: 


y-95 Zz 
A=—, \=—. 

5 —2 
So, lo may also be described by these two cartesian equations: 


E+2 


Example 872. Let /; be the line described by this vector equation: 


2 
+A} 3 (A €R). 
5 


Write out the three cartesian equations: 


e20+2\, y204+3A, 220+5). 
Rearrange each equation so that is on one side: 


a5 y Zz 
A== A == Aza. 
i a 5 


So, /; may also be described by these two cartesian equations: 
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We now look at examples where exactly one of the direction vector’s coordinates is zero: 


Example 873. The line ly is described by r = (1, 2,3) + A(0, 5, 6) (A€R). 
Write out the three cartesian equations: 2 Sole S( te ea 2 oS Les 


Observe that the direction vector (0,5,6) has x-coordinate 0. And so, lg must be per- 
pendicular to the x-axis. Indeed, the x-coordinate of every point on J, is fixed as x <1. 


1 2 3 
Leave = alone. But as before, rearrange = and = so that A is on one side: 


=a 
Xr Jae and 
ie 


oak 


Example 874. The line /3 is described by r = (1, 2,3) + A(4, 0, 6) (A€R). 
Write out the three cartesian equations: 7 Sie a 29+0A\= De ee eno 


Observe that the direction vector (4,0,6) has y-coordinate 0. And so, 13 must be per- 
pendicular to the y-axis. Indeed, the y-coordinate of every point on /3 is fixed as y 29. 


Leave y 22 alone. But as before, rearrange + and 2 so that \ is on one side: 


-] 
and 


Thus, /3 may also be described by these two cartesian equations: 





y=2 


Example 875. The line I, is described by r = (1, 2,3) + A(4, 5, 0) (A€R). 
Write out the three cartesian equations: 2 21+ Ar, y 2904 As 2 $34+0\=3. 


Observe that the direction vector (4,5,0) has z-coordinate 0. And so, l4 must be per- 
pendicular to the z-axis. Indeed, the z-coordinate of every point on l4 is fixed as z 33. 


2 1 2 ; 
Leave z = 3 alone. But as before, rearrange = and = so that \ is on one side: 


— 
34 


5 


Thus, /4 may also be described by these two cartesian equations: 


f= 
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We now look at examples where exactly two of the direction vector’s coordinates are zero: 


Example 876. The line I; is described by r = (1, 2,3) + A(0, 0, 6) (A €R). 


Write out the three cartesian equations: 
1 2 3 
yel+0A4 = 1, H=2+0A=2, Last On, 
Observe that the direction vector (0,0,6) has z- and y-coordinates 0. And so, /; must be 


perpendicular to both the x- and y-axes. 


Indeed, the x-and y-coordinates of every point of I; are fixed as x +1 and y 29. 


On the other hand, z is free to vary along with \. Unlike in any of our previous examples, 
there is no restriction on what z can be. And so we call z the free variable. 


Hence, in this example, there’s actually no algebra to be done. We simply discard 3 and 
say that /; may be described by these two cartesian equations: 


2 


Example 877. The line Ig is described by r = (1, 2,3) + A(0, 5, 0) (A€R). 


Write out the three cartesian equations: 
1 2 3 
e=1+04 = 1, Y= 2+ or pao OA = 3: 


Observe that the direction vector (0,5,0) has x- and z-coordinates 0. And so, lg must be 
perpendicular to both the x- and z-axes. 


Indeed, the x-and z-coordinates of every point of lg are fixed as x +1 and z33. 


Hence, the free variable is y. We simply discard 2 and say that /g may be described by 
these two cartesian equations: 


Example 878. The line I; is described by r = (1, 2,3) + A(4, 0, 0) (A€R). 


Write out the three cartesian equations: 
1 2 3 
ge l4 44, y=2+0A =2, £=3+0A=3. 


Observe that the direction vector (4,0,0) has y- and z-coordinates 0. And so, J7 must be 
perpendicular to both the y- and z-axes. 


Indeed, the y-and z-coordinates of every point of /7 are fixed as y 29 and z 33. 


Hence, the free variable is x. We simply discard + and say that /7 may be described by 
these two cartesian equations: 
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In total, we have seven possible cases, depending on which of the direction vector’s 
coordinates are zero. These seven cases are summarised in the following Fact (and were 
illustrated by the above examples). 

Fact 155. Suppose the line | is described by r = (pj, p2, p3) + A(V1, V2, U3) (A € R). 

(1) If v1, v2,v3 #0, then I can be described by 


ce Re es a ce) 
U1 V2 {0 





(2) Ifv, =0 and vy, v3 #0, then l is perpendicular to the x-axis and can be described by 





t= pi 


(3) If v2=0 and v1, v3 #0, then l is perpendicular to the y-axis and can be described by 


L-Pi _ %—P3 
Y = po and = 
U1 U3 


(4) If v3 =0 and v1, v2 #0, then | is perpendicular to the z-axis and can be described by 
z= P3 
(5) Ifv1,v2 =0, then | is perpendicular to the x- and y-axes and can be described by 


1G = fOh| Uf) = [25- 


(6) If v1,v3 =0, then l is perpendicular to the x- and z-axes and can be described by 


t= pi & = p3- 


(7) If v2,v3 =0, then l is perpendicular to the y- and z-axes and can be described by 


aS, Y= Po: 








Proof. See p. 1624 (Appendices). 











Exercise 262. Each vector equation below describes a line. Rewrite each into cartesian 
form. Also, state if each line is perpendicular to any axes. (Answer on p. 1859.) 


(a) r=(-1,1,1)+A(3,-2,1) (eR). (b) r=(5,6,1)+\(7,8,1) (eR). 


(c) r=(0,-3,1)+A(3,0,1) (A€R). (d) r=(9,9,9)+A(1,0,0) (A€R). 
(e) r=(0,0,0)+A(4,8,5) (AER). (f) r=(1,3,5)+A(0,-4,0) (A€R). 
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67.2. Cartesian to Vector Equations 
In the last subchapter, we started with a line’s vector equation, then wrote down its carte- 
sian equations. We’ll now go the other way round. 
To write down a line’s vector equation, all we need are a point that’s on the line and a 


direction vector of the line. 


Example 879. Let / be the line described by these cartesian equations: 


af=9  2y-8 2-1 
| re 





First, rewrite the above cartesian equations so that the coefficients on x, y, and z are all 
1. This is easily done by dividing the numerator and denominator of each fraction by the 
variable’s coefficient: 


Reading off, the line / contains the point (3,4,1) and has direction vector (2,1,3). So, it 
can also be described by this vector equation: 


Pe(a4 a as) (A €R). 


=e 0,0U toe 


Example 880. Let /, be the line described by 5 03 


Za): 


w= Ue eH 2 
. | 06 71 


Rewrite as 





Reading off, 1; contains the point (7,-2,2) and has direction vector (5,0.6,1). So, it can 
also be described by 


r = (7,-2,2) + A(5, 0.6, 1) (\€R). 


5 -12 3z-15 
Example 881. Let /2 be the line described by — = —— ee a. 


Rewrit 
ewrite as aT 


Reading off, /2 contains the point (0,12,5) and has direction vector (0.4,6,3). So, it can 
also be described by 


r= (0,12,5)+A(0.4,6,3) (eR) 
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Three examples where exactly one of the direction vector’s coordinates is zero: 
Example 882. Let /3 be the line described by 


oy-—12 2 3z-15 
ae 


By =. every point on ls has x-coordinate 17. Thus, /3 is perpendicular to the x-axis. 
Me 
Rewrite = as 


Reading off, 13; contains the point (17,2.4,5) and has direction vector (0,20,3). So, it 
can also be described by 


r = (17, 2.4, 5) + A(0, 20, 3) (A €R). 
Example 883. Let /, be the line described by 


1 —£ 2 z+10 


3 -5 
By = every point on l, has y-coordinate —2. ‘Thus, l, is perpendicular to the y-axis. 


Te i) 
= a oe 





2 
Rewrite = as 


Reading off, /, contains the point (0,-2,-10) and has direction vector (—3,0,-5). So, it 
can also be described by 


r = (0,-2,-10) + A(-3, 0, -5) (A €R). 
Example 884. Let /; be the line described by 


1 7x -6 » 2y+10 
4 = d = , 
‘ es 35 18 


By 2 every point on ls has z-coordinate 3/4. Thus, J; is perpendicular to the z-axis. 


n-6/7_y-(-5) 
o 9 


2 
Rewrite = as 


Reading off, J; contains the point (6/7, —5,3/4) and has direction vector (5,9,0). So, it can 
also be described by 


r = (6/7, -5, 3/4) + A(5, 9,0) (A €R). 





685, Contents www.EconsPhDTutor.com 


Three examples where exactly two of the direction vector’s coordinates are zero: 


Example 885. Let /g be the line described by x = 5 and z= 9. 


Every point on lg has x- and z-coordinates 5 and 9. Hence, the direction vector must 
have 0 as its x- and z-coordinates. (Equivalently, this line must be perpendicular to both 
the x- and z-axes.) 


The free variable is y. Altogether then, Jg contains exactly those points (5,y,9), for all 
real numbers y. For example, it contains the points (5,0,9) and (5,-100,9). Hence, for 
any non-zero k, (0,k,0) is a direction vector of Ig. 


For simplicity, we pick (0,1,0) as our direction vector and describe lg by 
r=(5,0,9)+2(0,1,0) (eR). 


Example 886. Let /7 be the line described by 3x = —-12 and y = 0. 


Every point on [7 has z- and y-coordinates —4 and 0. Hence, the direction vector must 
have 0 as its z- and y-coordinates. (Equivalently, this line must be perpendicular to both 
the x- and y-axes.) 


The free variable is z. Altogether then, /7 contains exactly those points (—4,0, z), for all 
real numbers z. For example, it contains the points (—4,0,0) and (—4,0,7t). Hence, for 
any non-zero k, (0,0,k) is a direction vector of lz. 


For simplicity, we pick (0,0, 1) as our direction vector and describe l7 by 
r= (-4,0,0)+A(0,0,1) (AeR). 


Example 887. Let lg be the line described by y = -11 and —4z = 52. 


Every point on lg has y- and z-coordinates —11 and —13. Hence, the direction vector 
must have 0 as its y- and z-coordinates. (Equivalently, this line must be perpendicular 
to both the y- and z-axes.) 


The free variable is x. Altogether then, /g contains exactly those points (#,-11,-13), for 
all real numbers x. For example, it contains the points (0,-11,-13) and (v2, —11, -13). 
Hence, for any non-zero k, (k,0,0) is a direction vector of lg. 


For simplicity, we pick (1,0,0) as our direction vector and describe lg by 
r = (0,-11,-13) + A(1,0,0) (A €R). 


Exercise 263. Each pair of cartesian equations below describes a line. Rewrite each into 
vector form. State if each is perpendicular to any axes. (Answer on p. 1859.) 


2-2 O3y—5 8z 
a = —, = 11 
(a) = 7 7 (d)  3y and 


(b) 26 3 = oe, (e) =e 


= 
5 


sya. 


(c) 1?7x-4= ae. (f) 182+5=0 and y=5z-2. 
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67.3. Parallel and Perpendicular Lines 


Our definition of when two lines in 3D space are parallel or perpendicular is exactly the 
same as before and now reproduced: 


Definition 150. Two lines are (a) parallel if they have parallel direction vectors; and 
(b) perpendicular if they have perpendicular direction vectors. 


Example 888. Suppose two lines are described by 
r= (1,01) + 4C1, 2,3) and r = (5,0,9) + us (-2, -4, -6) (A,u€R). 


Since (1, 2,3) || (—2,-4,-6), by the above Definition, the two lines are parallel. 


(A € R) 








Example 889. Suppose two lines are described by 
r = (5,-1,4) + A(8,2,-1) and r = (3,1,6) + (1, -2,4) (A, weR). 


We have (8,2,-1)-(1,-2,4) =8-4-4=0. So, (8,2,-1) 1 (1,-2,4) and by the above 
Definition, the two lines are perpendicular. 


Example 890. Suppose two lines are described by 


r= (3,2,-1)+2(1,0,0) and  r=(0,0,0)+y(1,1,0) (A,weR). 


Since (1,0,0) + (1,1,0) and (1,0,0) { (1,1,0), the two lines are neither parallel nor 
perpendicular. 
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The following Fact remains true in 3D space: 


Fact 139. Suppose two lines are ... 


(a) Identical. Then they are also parallel. 
(b) Distinct and parallel. Then they do not intersect. 


(c) Distinct. Then they share at most one intersection point. 


Example 891. As in Example 888, 
two lines are described by 


r = (1,0,1)+A(1, 2,3) and 
r= (5,0,9) + u(-2,-4,-6) (A,weR). 


Since (1,2,3) || (-2,-4,-6), the two 
lines are parallel. 


Observe that the point (1,0,1) is on 
the first line (plug in A = 0), but not 
on the second (to see this, observe that 
the only point on the second line with 
x-coordinate 1 corresponds to p = 2). 
Thus, the two lines are distinct. 





Since they are parallel and distinct, by 
Fact 139(b), they do not intersect. & 


Example 892. ‘Two lines are described by 
r = (3,6,9) + A(1, 2,3) and r = (0,0,0) + ys (-2, -4, -6) (A,u€R). 


Since (1, 2,3) || (-2,-4,-6), by the above Definition, the two lines are parallel. 
Observe that the point (3,6,9) is on both lines (plug in A = 0 and pz = -1.5). 


Since the two lines are parallel and do intersect, by Fact 139(b), they cannot be distinct. 
Equivalently, they must be identical. 





We will next learn how to determine whether two lines in 3D space intersect and if they 
do, how to find their intersection point. 
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67.4. Intersecting Lines 


Fact 139(c) says that two distinct lines share at most one intersection point. Here are two 
examples of when two distinct lines in 3D space intersect and how we go about finding their 
(only) intersection point. 


Example 893. ‘Two lines are described by 


These two lines are not parallel and hence distinct. And so, by Fact 139(c), they share 
at most one intersection point. 


Suppose they intersect. Then there must be real numbers \ and fi such that 


From 2 \=-1. Plug this into = to get fi=—-1. You can verify that these values of \ and 
{i also satisfy Z. Hence, the two lines do indeed intersect. 


To find their intersection point, plug \=-1 or ji = —1 into either line’s vector equation: 


0 
O J+ 
0 


a 
a 
-1 
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Example 894. ‘Two lines are described by 


(A, we R). 


These two lines are not parallel and hence distinct. And so, by Fact 139(c), they share 
at most one intersection point. 


Suppose they intersect. Then there must be real numbers \ and fi such that 
ae eee 
Le 2. 
ae é 
2 = 


34+\2244f. 


+ minus 2 yields -1 = 4-1 or f =0. Plug this into = to get N= -1. You can verify that 


these values of \ and {i also satisfy 3. Hence, the two lines intersect. 


To find their intersection point, plug \=-1lor fi = 0 into either line’s vector equation: 
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67.5. Skew Lines 


Recall**® that in 2D space, two non-parallel lines must intersect. In contrast, in 3D space, 
two non-parallel lines need not intersect. We have a special name for such lines: 


Definition 172. Two lines are said to be skew if they are not parallel and do not intersect. 


Example 895. ‘Two lines are described by 


+A} 2 (A, €R). & 
5) 


These two lines are not parallel and hence distinct. And so by Fact 139(c), they share at 
most one intersection point. 


To check if they intersect, suppose there are real numbers \ and fi such that 
1 zs 
A =1+4f, 
93 214+5f, 


3 224+ 6f. 


Now, 2x + minus 2 yields 0 = 1+ 3fi or fi = -1/3. Plug this back into = to get \= -1/3. 
But these values of \ and fi contradict Z. 


This contradiction means that there are no real numbers \ and ji such that = holds. In 
other words, the two lines do not intersect. And since they are not parallel either, by 
the above Definition, they are skew. 


AY 














346Fact 140. 
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Example 896. ‘Two lines are described by 


(A, R). 2 


These two lines are not parallel and hence distinct. And so by Fact 139(c), they share at 
most one intersection point. 


To check if they intersect, suppose there are real numbers \ and fi such that 


Now, + minus 2x 2 minus yields O25 eter = 2 Plug this back into = to ser = 1. 


But these values of \ and fi contradict Z. 


And so again, the two lines do not intersect. And since they are not parallel either, 
they are skew. 


Ay 
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67.6. The Angle Between Two Lines 


Example 897. As in 2D space, two inter- 
secting lines in 3D space form two angles 
a and 3 = 7- a at their intersection point. 
We define the smaller of these two angles 
to be the angle between the two lines. 


So in the figure on the right, the angle 
between the two lines is a and not £. 


Y 











In the figure on the left, the black and 
solid red lines do not intersect. But 
even so, we will still find it useful to talk 
about the angle between them. 


To do so, translate the red line upwards so that it intersects the black line. As usual, two 
angles a and 6 =7-a are formed at the intersection point. We then define the angle 
between the black and solid red lines to be the smaller of these two angles, namely a. 





Our formal definition of the angle between two lines is reproduced from before: 


Definition 149. Given two lines, pick for each any direction vector. We call the non- 
obtuse angle between these two vectors the angle between the two lines. 





And so, we have the same results as before: 


Corollary 22. The angle between two lines with direction vectors u and v is 


cos”! [ea 
Jul {v| 





Corollary 23. Suppose 0 is the angle between two lines l, and lz. Then (a) 0=0 <=> 
ly | lo; (b) O = 7) 2 == 1 lo. 
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First, two examples where we find the angle between two intersecting lines: 


Example 898. Two lines are described by r (3,3,3) + A(1,2,3) and 
r = (3,3,3) + 4(4,5,6) (A,ueR). 2 


Observe that these two 
lines intersect at (3,3,3). 
(And so, they aren’t skew.) 


The angle between these 
two lines is the non-obtuse 
angle between their direc- 
tion vectors and is given by 
Corollary 22: % 


=] |, 23) ° (4, 5, 6)| 


COS 
I(1, 2, 3)|1(4, 5, 6)| 








i |4+ 10+ 18] 
V 12 4+ 22 + 32\/42 + 52 + 62 


= COS 





oe 
a 
= cos ~ ——— 8 0..226 
V 14/77 


This angle is neither zero nor right. And so by Corollary 23, the two lines are neither 
parallel nor perpendicular. 


Example 899. ‘Two lines are described by 

r = (0,0,0) + A(1,0,1) and r=(6,1,3)+7(5,1,2) (A, weR). = 
Observe that these two lines intersect at (1,0,1). (And so, they are not skew.) 
Again, the angle between these two lines is given by Corollary 22: 


-1 0, 1) : (5, 1,2)| Seoss. [5 +0 + 2| feos. 7 
\(1, 0, 1) | ((5, 4,2) J12 +02 + 12/52 + 12 + 2? VO 


This angle is neither zero nor right. And so by Corollary 23, the two lines are neither 
parallel nor perpendicular. 


cos 70.442. %& 
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And now, two examples where we find the angle between two non-intersecting lines. 


Example 900. ‘Two lines are described by 


From z, N= 1. Plug this into 2 to get fi = 0. But now, these values of \ and fi contradict 
Z. Hence, the two lines do not intersect. 


Even though the two lines do not intersect, we will still find it useful to talk about the 
angle between them. This we can compute as usual:%* 


12,1) *(0,38,=2 0+6-2 
eels ’ ) (0, ’ ese | a | 


\(1, 2, 1)||(0,3,—2)| af 294 4 (2 02 + 32 + (-2)’ 7 


This angle is neither zero nor right. And so by Corollary 23, the two lines are neither 
parallel nor perpendicular. 


Since the two lines do not intersect and are not parallel, they are skew. 








The two lines do not intersect. Nonetheless, we can always translate one of the two lines 
so that they intersect. In the above figure, we’ve translated the black line so that it 
intersects the red line at the origin. 
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Example 901. ‘Two lines are described by 


r=(0,1,2)+A(9,1,3) and r=(4,5,6)+y(3,2,1) (AweR). 


If they intersect, then there are real numbers \ and fi such that 


9) £4432, 
144254 Zu, 
24+3\ 26+ [i. 
+ minus 3x 2 yields -6 = -14, which is a contradiction. Hence, the two lines do not 


intersect. Nonetheless, we can as usual compute the angle between them: 


=i (9, Lo) ; (3, 2, 1)| = cos! |27 ea 3 _ cos”! 32 
(9, 1, 3)||(3, 2, 1)| /02 + 12 + 32/32 4 22 4 72 J91V/14 


This angle is neither zero nor right. And so by Corollary 23, the two lines are neither 
parallel nor perpendicular. Since they do not intersect either, they are skew. 


Ay 


cos x 0.459. 











The two lines do not intersect. Nonetheless, we can always translate one of the two 
lines so that they intersect. In the above figure, we’ve translated the red line so that it 
intersects the black line at the point (0, 1,2). 


Exercise 264. Each of (a)—(d) gives a pair of lines in vector form. Find any inter- 
section points and the angle between the two lines. State if the two lines are parallel, 
perpendicular, identical, or skew. (Answer on p. 1860.) 
(a) re (0,11) s4(1-1,1) and. r=(,3,3)42 (00,2). 

(b) r=(-1,2,3)+A (0,1,0) and = r=(0,0,0)+ (8, -3, 5). 

(Cea i8 4 (83) and =r =(9,3,7)+ (3, -4, -3). 

(d) r= (0,0,1)+A(1,2,1) = ands r=(1,0,0)+p (-3, -6, -3). 
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67.7. Collinearity (in 3D) 
Our Definition of collinearity is the same as before: 
Fact 144 is reproduced from before and says that any two points are always collinear: 


Fact 144. Suppose A and B are distinct points. Then the unique line that contains both 
A and B is described by 


r= OA+)AB, (\€R). 


Example 902. Any two points A and B are collinear. 


ae r= OA+)AB (\€R) 











And as before, three distinct points could be collinear but won’t always generally be: 


Example 903. The points A, B, and C are collinear: 


A, B, and C are collinear. AB r=a+ AB (A €« R) 





A 
@ 





In contrast, the points D, E, and F' are not collinear. 


D, E, and F are not collinear. DE er ae \DE (\€R) 














We'll use the exact same procedure to check whether three points are collinear: 


1. First use Fact 144 to write down the unique line that contains two of the three points. 


2. Then check whether this line also contains the third point. 


Two examples: 
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Example 904. Let A = (1,2,3), B = (4,5,6), and C = (7,8,9) be points. 
To check if they are collinear, 


1. First write down the unique line that contains both A and B: 
— ——S L 
r=OA+AAB= (A€R). 


2. This line also contains C’ if and only if there exists \ eR such that 


T2149), 


1 3 
2/+A1 3], 9 or S209). 
3 3 


G2 23), 


As you can verify, \ = 2 solves the above vector equation (or system of three equations). 
So, our line also contains C’. 


We conclude that A, B, and C are collinear. & 


y » 
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Example 905. Let D = (1,0,0), E = (0,1,0), and F' = (0,0,1) be points. To check if 
they are collinear, 


1. First write down a line that contains both D and E: 
1 —1 


r=OD+\DE=| 0 |[+\ 1 (A€R). 
0 0 


2. This line also contains F' if and only if there exists \eéR such that 


From é, we have \ = 1. But this contradicts 2. This contradiction means that there is 
no solution to the above vector equation (or system of three equations). So, the line we 
wrote down above does not contain F’. 


We conclude that D, E, and F’ are not collinear. <>] 


E = (0,1,0) 


F =(0,0,1) 





Exercise 265. Determine if A, B, and C are collinear. (Answer on p. 1861.) 


(a) A=(3,1,2), B=(1,6,5), and C =(0,-1,0). 
(b) A=(1,2,4), B=(0,0,1), and C = (3,6, 10). 
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68. The Vector Product (in 3D) 


In 2D space, the vector product was simply a scalar (real number): 





In contrast, in 3D space, the vector product will be a vector (hence the name)! 


As we'll see later, we’ll often have the need to find a vector that’s perpendicular to two 
other vectors. It is this need that motivates the concept of the vector product (in 3D). 


Let a = (a1, a2,a3) and b = (61, be, bs) be vectors. Can we find some vector c = (C1, C2, ¢3) 
that’s perpendicular to both a and b? 


Well, if c 1 a,b, then a-c+Oand b-c20. Or, 


1 
(a1, a2, 43) - (C1, €2,€3) = a1C] + A2QCz + a3C3 = 0, 


(b1, bo, bs) + (C1, C2, €3) = bicy + bec2 + bgcs 20. 


Our goal is to find c that solves + and 2. Observe that bs x + minus 3X 2 yields 


3 
0 = a b3c; + dgb3cg +azb3C3 — ag3byc, — agbece = a3b3C3 


= (agb3 = a3b2) — (a3b1 = a,b3) : 
Now, notice that c, . agb3 — a3bo and co 2 a3b1 — a;,b3 solves s 


Next, get the corresponding value of c3 by plugging * and 2 into 2: 


Cl ce 
a S——_e 


j< ay (aabs - a3b2) + a9(a3b;=a7b3) + A3C3 = —ayagby + agagby + @§C3 OF C3Z= ayb al Ao. 


Hence, a vector that solves + and 2 (i.e. is perpendicular to both a and b) is 


al agb3 — agby 
C=] co [=] a3b1 —aj,b3 
C3 aybz — aby 


We will simply use the above as our Definition of the vector product: 


Definition 173. Let a= (a1, a2,a3) and b = (bi, b2,b3) be vectors. Their vector product, 
denoted a x b, is this vector: 


azb3 — agbo 
ax b= a3b1 — a;b3 


aybz — aad, 





By the way, no need to mug Definition 173, because it’s already on List MF26 (p. 4). 
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Example 907. The vector product of a = (1,2,3) and b = (4,5, 6) is 


4 2-6-3-5 -3 
axb= ee eee re ae (oa 8 
3 6 1-5-2-4 -3 
From our above discussion, we already know that axb 1 a,b. Indeed, this was the 


geometric property that motivated our definition of the vector product. Nonetheless, as 
an exercise, let’s go ahead and verify that (a x b)-a=0 and (ax b)-b=0: 


(ax b)-a = (-3,6,-3)-(1,2,3) = -3+12-9=0, J 
(ax b)ob = (-3,6.=3)- (48,6) -=19 4-30-1810: 7 
A 


Y 
b=(4 
a x b = (-3,6, -3) ( 0,6) 





Example 908. The vector product of u = (1,0,-1) and v = (3,-1,0) is 
pel) 
= = 14) 
1-(-1)-0-3 
Again, let’s verify that ux v 1 u,v: 
(uxv)-u =(-1,-3,-1)-(1,0,-1) =-1+0+1=0, 
(ux v)-v =(-1,-3,-1)-(3,-1,0) =-3+3+0=0. 


AY 
u = (1,0,-1) 


u x v = (-1,-3,-1) 
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Here’s the formal statement of the vector product’s key geometric property: 


Fact 156. Suppose a and b are vectors withaxb#0. Thenaxbta,b. 


Proof. See Exercise 266(c). 














Exercise 266. Let u = (0,1,2), v = (3,4,5), w = (-1,-2,-3), and x = (1,0,5). 


(a) Find ux v and verify that ux v 1 u,v. 
(b) Find w x x and verify that w x x | w,x. (Answer on p. 1862.) 
(c) Let a = (a1, a2,a3) and b = (6, 62,63). Prove that (ax b)-a=0 and (ax b)-b=0. 





All of our results about the vector product in 2D space continue to hold in 3D space and 
are now reproduced. First, it remains true that the vector product is distributive and 
anti-commutative. Moreover, the vector product of a vector with itself is zero: 


Fact 145. Suppose a, b, and c are vectors. Then 


(a) ax(bt+c)=axbtaxc (Distributive over Addition) 
(b) axb=-bxa (Anti-Commutative) 
(c) axa=0 (Self Vector Product Is Zero) 








Proof. See Exercise 267(a), (b), and (c). 











Next, we have the following “obvious” property reproduced from before: 
Fact 147. Suppose a and b are vectors andcée R. Then 


(ca) x b=c(axb). 

















Proof. See Exercise 267(d). 


Exercise 267. Suppose a = (a1, d2,a3), b = (bj, ba, 63), ¢ = (C1, C2,¢3), and de R. 


(a) Prove that ax (b+c)=axb+t+axce. 


(b) First verify that (4,5,6)x(1,2,3) =-(1,2,3)x(4,5,6). Then prove that axb = -bxa. 
(c) Prove that axa=0. 
(d) Let de R. Prove that (da) x b=d(axb). (Answer on p. 1863.) 





Also, from Fact 145(c), we again have the following result. The proof is exactly the same 
as before and is simply reproduced: 


Corollary 24. [fa || b, thenaxb=0. 


Proof. If a || b, then there exists c # 0 such that ca = cb. So, 











axb=ax (ca) =c(axa)=c:0=0. 
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And again, we have the converse of Corollary 24: 


Fact 146. Let a and b be non-zero vectors. Ifaxb=0, thena || b. 


Proof. See p. 1628 (Appendices). 














So again, together, Corollary 24 and Fact 146 yield 


Corollary 25. Suppose a and b are non-zero vectors. Then 


ax b=: <= a'||'b: 


Example 909. Let s = (1,2,3) and t = (2,4,6) be vectors. Since s || t, by Corollary 25, 
we must have s x t = 0, as we now verify: 


You can verify that c x d 1 c,d. 


Note that c and d “nearly” but do not point in exact oppo- 
site directions. If they pointed in exact opposite directions 
(and were thus parallel), then by Corollary 25, their vector 
product would have to be the zero vector, i.e. cxd = 0. 
Which isn’t the case here. 


z 
<4 





cxd= (-2, 4, =) 





The next result says that a vector is parallel to a x b if and only if it’s perpendicular to 
both a and b: 


Fact 157. Suppose a, b, and c are vectors, witha + b. Then 


c||axb —a €.l-a./b: 








Proof. See p. 1628 (Appendices). 
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68.1. The Right-Hand Rule 


Given two non-parallel vectors a and b, there are exactly 
two (unit) vectors that are perpendicular to both a and b. 
One is (the unit vector of) our vector product a x b. The 
other is (the unit vector of) the vector that points in the 
exact opposite direction—this, of course, is simply —a x b. 


The vector product a x b is defined so that it satisfies the 
right-hand rule. To see why, have the palm of your right 
hand face you. Fold your ring and pinky fingers. Have 
your thumb point right, your index finger up, and your 
middle finger towards your face. Then these three fingers 
correspond to the vectors a, b, and c. (Try it!) 








In contrast, —a x b, the other vector that’s perpendicular to a and b, satisfies the left-hand 
rule. (Can you explain why?) 


Fun Fact 


Why do we use the right-hand rule rather than the left-hand rule? One possible 
explanation might be that the right-handed majority is, as usual, being tyrannical. 


But more likely, this is simply an arbitrary convention, similar to how most of the world 
drives on the right (Remark 23). 


Indeed, according to Mitiguy (2009),°#* 


Until 1965, the Soviet Union used the left-hand rule, logically reasoning that the 
left-hand rule is more convenient because a right-handed person can simultane- 
ously write while performing cross products. 








“The short answer is that (i) we earlier adopted the convention that our coordinate system obeys the 
right-hand rule; and (ii) (a,b,a xb) is positively oriented with respect to (i,j,k) (what exactly 
positively oriented means is the bit that’s beyond the scope of this textbook). Had we instead adopted 
the convention that our coordinate system obeys the left-hand rule, then as currently defined, our vector 
product ax b would also obey the left-hand rule. 

348T can’t find any other sources for this story. So I’d treat it as apocryphal rather than fact. 
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68.2. The Length of the Vector Product 


As before, the vector product a x b has length |a||b|sin@. Formally, 


Fact 148. Suppose @ is the angle between the vectors a and b. Then 


|a x b| = |a| |b] sin @. 

















Proof. Exercise 268 guides you through a proof of this Fact. 


Exercise 268. Let 6 be the angle between the vectors a = (a1, a2, a3) and b = (bj, bo, b3). 


(a) Express |a|, |b], ja x b], and cos@ in terms of aj, a2, a3, b1, b2, and bs. (You need not 
expand the squared terms. ) 

(b) Since 6 € [0,7], what can you say about the sign of sin@? (That is, is sin@ positive, 
negative, non-positive, or non-negative?) 

(c) Now use a trigonometric identity to express sin # in terms of cos 6. (Hint: You should 
find that there are two possibilities. Use what you found in (b) why you can discard 
one of these possibilities. ) 


(d) Plug the expression you wrote down for cos@ in (a) into what you found in (c). 


(e) Prove the following algebraic identity.*“? (Hint: Fully expand each of LHS and RHS. 
Then conclude that LHS = RHS.) 


(a7 + a3 + a2) (b7 + B3 + 63) — (ayy + ayby + agb3)* 


= (agb3 = agby)” + (a3b, = aba te (a,b> = eo é 


(f) Use (a) and (d) to express |a||b| sin @ in terms of a1, a2, a3, b1, b2, and b3. Then use 
(e) to prove that 


|a x b] = |a| |b] sin @. (Answer on p. 1864.) 








*49By the way, this is again simply an instance of Lagrange’s Identity. 
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68.3. The Length of the Rejection Vector 


In 2D space, the vector product was a scalar (real number). In contrast, in 3D space, it is 
a vector (hence the name). 


Nonetheless and perhaps surprisingly, Fact 149—which says the rejection vector’s length is 
given by the vector product—remains true and is now reproduced: 


Fact 149. Suppose a and b are vectors. Then 





|rejpal = la x b| 














Proof. See p. 1627 (Appendices). 


Example 911. 1 The points A = a 5,=2),, 8 = (2.8, 3,1), and C’ = (2,7,-1) form a right 
triangle, with Ap= (1, -2,3), AC= (1,2,1), and BC = (0,4, -2). =) 


The lengths of the line segments AB and AC are simply 
AB =4/12+(-2)?+3?=V/14 and |AC| Ee 2/6, 


As an exercise, let’s verify that Fact 149 C = (2,7,-1) 
“works”. Observe that 


AB = -rej.BC and AC = rej BC. 
And so, by Fact 149, 


BC xAC| _\(s,-2,-4) a ae 
ey an 


S70 (9) a4) eee od 
ie i ae ey ee 
14224 7 6 Z 


JBC x AB] \(g,-9, -a)| 
|AB| |(1, -2, 3)| 


2a) aD A sd 
unas = —_ = V6. S. 
12 + (-2)° +32 14 


AB) - [BC x ae - 





[Ac = [BC x 4B] - 
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69. The Foot of the Perpendicular (3D) 


As in Ch. 62, we reproduce from Ch. 15.1 the following result and definition: 


Corollary 4. Suppose A is a point not on the line 1. Then there exists a point B that 
is both (a) the unique point on | that’s closest to A; and (b) the unique point on | such 
thatl 1 AB. 


Definition 174. Let A be a point that isn’t on the line 1. The foot of the perpendicular 
from A to lis the (unique) point B on / such that AB 1 I. 





In 3D space, this result from Ch. 62 still holds: 
Fact 150. Let 1 be the line R= P+2Av (AER). Suppose A is a point not onl. Then the 
point B that is (a) the unique point onl that’s closest to A; and (b) the unique point on 
l such that 1 1 AB is 


Bp Je 2p proj, PA. 





In Ch. 62, we already proved both of the above results generally (i.e. in 2D, 3D, and also 
higher dimensional spaces). 


Again, the distance between a point and a graph is the minimum distance between them: 


Definition 61. The distance between a point A and a graph G is the distance between 
A and B, where B is the point on G that’s closest to A. 


Corollary 27. Suppose B is the point on the line | that’s closest to the point A. Then 
—_ 
the distance between A and I is |AB . 
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Example 912. Let A = (1,2,3) be a point and / be the line r = OP + v = (0,1,2) + 
A(9, 1,3) (Ae R). Let B be the point on / that’s closest to A (B is also the foot of the 
perpendicular from A to 1). & 


As in Ch. 62, we’ll find B or the distance between A and / using four or five methods. 


A 
y Aa (l23) 








z 


Method 1 (Projection Vector). Compute Pe (0,1,2)=—(1,2,3)=(1,1,1) and 


(1 e193) 9+1+3 1 
9,1,3) = Gol) = (9, 13): 
92 + 12 + 3? a 91 ae 7(9,1,3) 





proj,PA = projig;3) (1,1,1) = 
Now simply apply Fact 150: 
— 1 1 
B=P+proj,PA = (0,1,2) + 79, La) = 7 (0.8, U7). 


And the distance between A and I is 


ere a 1 
[AB = |; (9,8, 17) oi (1,2,3) = F (2,-6,-4) 


=> 92 4 (-6) + (-4)° = =V5 = =Vix 7a yf? 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 
—2 


Method 2 (Quadratic-Cartesian). The line / has cartesian equations ; =y-l= i 5 


The distance between A and any arbitrary point R= (9r,r+1,3r+2) on / is 


V/ (9r - 1)? + (r+1-2)? + (8r +2-3)7 = V9lr? - 26r +3. 


As usual, this last surd expression is minimised when 


pie =i 7 1 


2a? 2x91 7 





I 
Hence, i= - (Osc), 


And the distance between A and | is 


D, 
yo1x(=) -26xh432y/8-Bss-yfb-nf? 
i 7 (a 7 7 


Method 3 (Quadratic-Vector). Pick any arbitrary point R = (0,1,2)+A(9, 1,3) on l. 
The distance between A and R is 


|AR| = [OR - OAl = |(0,1,2) + A(9, 1,3) - (1,2,3)| = |(9A-1,A-1,3-1)| 


=\/ (9-1) + (A-1) + (84-1) = V912 — 261 43. 


As usual, this last surd expression is minimised when 


) E Sipe —26 7 1 
a me ea 





i] 
Hence, B= (0, 1,2)+ 2 (9; 1,3) = =(9,8, 17). 


7 


And the distance between A and I is 


yoix(z pire VE reve NT 
7 | y a 


(Example continues on the next page ... 
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(... Example continued from the previous page.) 


Method 4 (Scalar Product). Let B = (0,1,2) + r4(9,1,3). Then AG = OB=0A= 
(ae aie esi 


We have AB 1 L, or ABL (91,3), or AB- (9,1,3) =0, or 


0 = (9A, - 1, Ay — 1, 3A, - 1) - (9, 1,3) = 819 - 9 + Ay — 14+ 9AQ- 3 = 912) - 18, 


1 1 
or Ay = 1/7. Hence, B = (0,1,2) + 79, 3) = 7 (9, 8,17). 


And the distance between A and | is 
AB S(t 1 


A oe x6 
49 49 “3° : 


Method 5 (Vector Product). Recall that AB is rejyPA—the rejection of PA on v. 
Recall also Fact 149: rej PAl = IPA x 4. So, 


[AB] = [rej PA] = |PA x o| = |(1,1,1) x (9, 1,8)/V9? + 12 + | 
= |(3-1,9-3,1-9)|/V91 = 1/22 + 62+ (-8)’/V91 = \/104/91 = /8/7. 


(Example continues on the next page ...) 





Formal statement of Method 5, reproduced from Ch. 62: 


Corollary 28. Let A be a point. Suppose | is the line R= P+2Av (AER). Then the 





———_> me 
distance between A and l ts IPA x |. 


Another example illustrating the four or five methods: 
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Example 913. Let A = (-1,0,1) be a point and I be the line described by 
r= OP v=, 125, 1,2) “OeR). 


Method 1 (Formula Method). First, PAS (-1,0,1) - (8, 2,1) = (-4,-2,0). So, 


Pie proj,PA = projs,1 2) (-4,-2,0) = a et 


(oye) 


—V-2-0 22 11 
2 5. 2) = 1) Se = 5, 1). 
30 ome) 39 (12) Ta 


And so by Fact 150, the foot of the perpendicular from A to / is 


_ 11 1 
B = P + projyPA = (3,2,1) - 7, (5,1,2) = 7, (10,19, -7). 


Y 


A= (-1,0,1) 





By Corollary 28, the distance between A and 1 is 


antes eed 512 -4,8,6 58 
[BA| = |PA x o| = (4-20 x aS = an = ve 
Method 2 (Perpendicular Method). Let B = (3,2,1) + (5, 1,2). Write down AB: 
AB = B- A= (3,2,1) + \(5,1,2) - (-1,0,1) = (5X +4,A 42,20). 
Since AB 1 1, we have AB Lv or, 
0=(5A+4,A+2,2A)-(5,1,2) =5(5A+4) +(A+2) +2 (2A) = 30d + 22. 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Rearranging, \ = -22/30 = -11/15 and so, 
B=, 2. Ita = (5,1,2) =a (- 10,19, -7). 
Happily, this is the same as what we found in Method 1. And now, 
— il t] 
AB=B-A= is (-10, 19, -7) - (-1,0,1) = 7 (5,19, -22). 


Thus, the distance between A and I is 


AB] = = (6, 19, -22)| = —v/'5? + 192 + (-22)7 V'58+ 192 + (-22)? = - 


Method 3 (or the Calculus Method). Let R be a generic point on /, so that AR = 
(5A +4,A+2,2X) and the distance between A and R is 


JAR] = / (5A +4)" + (A +2)" + (2d) = VB002 + 44X +20. 


Again, first differentiate the expression 30? + 44 + 20 with respect to .: 


* (30)? + 44) + 20) = 60) + 44. 


Then by the First Order Condition (FOC), we have 


~ 44 11 
Ay ee A= -—, 
(60 + 44) |,_; =0 ae 
Happily, this is the same as what we found in Method 2. And now, as before, we can find 
B and |AB . Alternatively, we could simply have found \ by using “_b/2a”: 


44 ll 


i, eee ee 
be 59 


Exercise 269. For each of the following, use all three methods you just learnt to find 
the foot of the perpendicular from A to 1; and the distance between A and I. 

The point A The line / Answer on p. 
(a) (Go) r=(8,3,4) 4A.(9, 3,7) 1865. 
(b) (5,052) Contains the points (4,4,3) and (6, 11,5) 1866. 
(c) (8, 5,9) r = (8,4,5) + (5,6, 0) 1867. 
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70. Planes: Introduction 


In the remainder of Part III (Vectors), we’ll study planes. 


Informally, a plane is a “flat 2D surface” (like a piece of paper). A bit more formally, a 
plane is, like a line, simply a set of points. 


Let’s start by taking a quick look at some examples of planes. 


Example 914. Consider the plane q described by the cartesian equation x = 1. & 


It contains exactly those points whose x-coordinate is 1. So, it contains A = (1,0,0), 
B=(1,1,1), C=(1,3,1), and every other point with x-coordinate 1. 


In contrast, it does not contain (2,0,0), (7,1, 1), (v2.3, 1), or any other point whose 
x-coordinate isn’t 1. 


The plane g described by x = 1 


=iiio kh) 


=( tl) 











Formally, the plane g is a set of points: 


q={(z,y,2):2=1}. 


In words, gq is the set containing exactly those points (2, y,z) whose x-coordinate is 1. 


You should take a moment to convince yourself that the plane qg, which is the set of points 
whose x-coordinates are 1, does indeed form a “flat 2D surface”. 
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Example 915. Three planes are described by these cartesian equations: 


Z2 1, and  g= 5. es) 


Later on, we will learn what it means for two planes to be parallel and how to calculate 
the distance between two planes. But for now, we merely assert that “obviously”, 


e The three planes are parallel. 
e The distance between the first and second planes is 2. 


e The distance between the second and third planes is also 2. 





Exercise 270. The planes q and q are described by y = 2 and z =3. 
(a) Sketch the graphs of both planes in a single figure. 


Then find two points that are on 


(b) gq but not g; 
(c) go but not q; and 
(d) Both q and q. (Answer on p. 1868.) 
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More examples: 


Example 916. Consider the plane gq described by the cartesian equation y = 22. 


It is the set of points (2, y,z) that satisfies the equation y = 2x. Formally, 


Os {((2,y,z): y= 2r}. 


So for example, it contains 
The origin O (0, 0,0) because 
The point A (12.3) because 
The point B (-1,-2,0) because 

In contrast, it does not contain 

¢ The point C (iS) because 

« The point D (-1,2,0) because 


The plane 
v= 22 


z 





° C= (1,0,3) 


Also, the plane g contains the lines y = 2x7, z = 0 and y = 22, z =3. 





Indeed, for every k € R, the line y = 2x, z = k is contained in the plane q.*” 





°0Here’s a proof of this assertion. Consider the line y = 2x, z = k. Let P be any point on the line. Observe 
that P obviously satisfies the plane’s equation y = 27. So, Péq. We have just shown that any arbitrary 


point P on the line is also on g. Hence, qg contains the line. 
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Example 917. Consider the plane q described by the cartesian equation x + y = z. 


It is the set of points (x,y,z) that satisfies the equation x + y = z. Formally, 


q={(t,y,2)i ety =z}. 


So for example, it contains 
The origin O (0, 0,0) because 
The point <A (23) because 
The point B (-1,1,0) because 
In contrast, it does not contain 


«The ) pomt C= (2,03) because 
« The point D (-1,2,0) because 


N 


a 
B =(-1,1,0) \ 





Note that as depicted, D is “behind” the plane. 





As the above examples suggest, it turns out that in general, any plane gq is simply the 
graph of this cartesian equation: 


ax + by+cz=d, 


where a,b,c,d€R (and at least one of a, 6, or c is non-zero). 
In other words, the plane q is the set of points (x,y,z) that satisfy ax + by + cz =d. 


Formally, the plane gq is a set of points: 
q={(2,y,z):axt+byt+cz=d}. 
In the coming chapters, we will explain why a plane may be described by the above cartesian 


equation. We will also learn what the vector (a,b,c) and the number d mean geometrically. 
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70.1. The Analogy Between a Plane, a Line, and a Point 


In 3D space, 


A plane is a two-dimensional object and can be described by one equation(s). 


4 


and two 


and ' three e 


A line “ one-dimensional 


4 


A point “ zero-dimensional 


Example 918. We are given 3D space, the set of all points. 


We first impose the equation or constraint x = 4. That is, we keep only those points (in 
3D space) that satisfy the equation x = 4 and “throw away” all other points. This leaves 
us with the plane g. We say that the plane g is described by one equation: 

ga. 
“3-—1=2”: By imposing 1 constraint on 3D space, we end up with a 2D object. 


The plane q: © = 4 





Now suppose we also impose the constraint y = 5. That is, we take the plane gq, but keep 
only those points on q that satisfy the equation y = 5 (and “throw away” all other points). 
This gives us the line /. We say that the line / is described by two equations: 


r=A4 y=, 


“3-2=1”: By imposing 2 constraints on 3D space, we end up with a 1D object. 
Finally, we impose a third constraint z = 6. That is, we take the line /, but keep only 
those points on / that satisfy the equation z = 6 (and “throw away” all other points). 
This gives us the point P. Hence, the point P is described by three equations: 

¢=4, Ga, wel - 2 6. 


“3-3=0”: By imposing 3 constraints on 3D space, we end up with a OD object. 
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We observe that each additional equation (or constraints) “chops off” a dimension. This 
observation also generalises to spaces of other dimensions. For example, in 2D space, 


“2-1=1": <A line isa1D object that can be described by one equation(s). 
“2-2=0": A point “ OD “ two “ 


Example 919. In this example, we return to 2D space. 
We first impose the equation or constraint x = 4. That is, we keep only those points (in 
2D space) that satisfy the equation x = 4 and “throw away” all other points. This leaves 
us with the line /. We say that the line / is described by one equation: 

e=4, 


“2-1=1”: By imposing 1 constraint on 2D space, we end up with a 1D object. 


A 
y The line & ¢=4 








Now suppose we also impose the constraint y = 5. That is, we take the line /, but keep 
only those points on I that satisfy the equation y = 5 (and “throw away” all other points). 
This gives us the point P = (4,5). We say that the point P is described by two equations: 


z=A4 y=. 





“2-2=0”: By imposing 2 constraints on 2D space, we end up with a OD object. 


Similarly, in 4D space (not in H2 Maths): 


ge es as One equation describes a 3D space. 

“A-2=2”: A plane is a 2D object that can be described by one equations. 
“A-3=1": A line “ 1D i two - 
“4—-4=0”: A point “ OD 7 three " 


And so on and so forth in all higher-dimensional spaces as well. 
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71. Planes: Formally Defined in Vector Form 


Example 920. Let qg be the plane that contains the points A = (1,0,0), B = (0,1,0), 
and C' = (0,0,1). Informally and intuitively, a plane is a “flat surface”. 


Since it’s a “flat surface”, there must be The plane q 
some vector that is perpendicular to it. 

We will call any such vector a normal 

vector of the plane.*”! 


To find a normal vector of q, all we need do 

is pick any two vectors on g and compute 

their vector product. 

Let’s pick, — say, AB = (-1,1,0) and 

AC =(-1,0,1). Their vector product 

(which we'll call n) is ~ C= (10,0) 


As we learnt in Ch. 68, the vector product n= (1,1,1) must be perpendicular to both 


AB and AC . It turns out that n is also perpendicular to every vector on q (we'll formally 
state and prove this as Fact 164 below). And so, we call n anormal vector of g. & 


Now, let R denote a generic point on g. Then the vector AR is on gq. Which means 
AR in= lea Fala or equivalently, AG =o! 
Let’s manipulate this last equation a little: 
Geo lilaboh 20 
(OR-OA)-(1,1,1) =0 ( AR = OR-OA) 
OR 1, b= O71 a1) 20 (Distributivity) 


Oni) =O4-da 1) 
OR 1) SCO Oecd, 


We say that this last equation OR: (1,1,1) =1 is a vector equation that describes the 
plane q. To be a bit more formal and precise, we’d write q as this set (of points): 


q={R:OR-(1,1,1) =1}. 


In words, q is the set containing exactly those points R that satisfy OR- (1,4, 1).= 1, 





(Example continues on the next page ...) 





°1We’'ll formally define what a normal vector of a plane is in Definition 177. 
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(... Example continued from the previous page.) 


If we let r denote the position vector of the generic point R, then here is another vector 
equation that also describes q: 


r-(1,1,1)=1. 


Again, to be a bit more formal and precise, we’d write 


@=4 Aer (lye 1h 





In words, q is the set containing exactly those points R that satisfy r-(1,1,1) =1. 
The above discussion motivates the following formal definition of a plane: 
Definition 175. A plane is any set of points that can be written as 
= 
{R:OR-n=d} or {Arend}, 


where n is some non-zero vector and d¢€ R. 





In words, a plane is the set containing exactly those points R that satisfy 
— 
OR-n=d or r-n=d. 


We can also say that a plane is described by either of the above vector equations. 


By the way, in the above example, we spoke of vectors being on a plane. It’s probably a 
good idea to formally and precisely define what this means: 


Definition 176. A vector v is on a plane q if there are points S,T € q such that v = ST. 
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Example 921. The plane gq = {R :OR- (=3,0),2) = -5} contains 


¢ the point A= (1,0,-1) because A satisfies the plane’s vector equation: 


GA ae) (23. a 


e the point B = (3,1, 2): 


CEP Ge ee ee 


¢ but not the point C = (9,1, 1): 


COEUR Eo ere), 


The plane q 








Since g contains the points A and B, the vector AB is on q. (We could also say that “q 
— 
contains the vector AB,” but this is a bit sloppy—n. 109.) 


— SS 
Now, are the vectors AC and BC on q? As you may have guessed, the answer is, “No, 
they are not.” But to prove this, we’ll have to wait until Fact 163 below.?”” 





3°2Here’s an incorrect “proof”: “C' is not on g. Therefore, the vectors AC and BC not on g.” This proof 


—- —> |) Ol > 
is incorrect because in order to prove that AC’ is not on q, we need to prove that AC + PQ for any two 
points P and Q on q. The mere observation that C’ is not on q does not suffice. 
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Remark 110. Strictly speaking, this statement is wrong: 
“Let q be the plane OR: (=3,0,2)= 6,” x 


This is because g is not an equation but a set of points. So, strictly speaking, we should 
write 


2 “Let g be the plane {R :OR:- (-3,0,2) = Bh J 


Nonetheless, having to always write Statement 2 gets tiresome. So, we'll be lazy and 
sloppy and often write the (incorrect) Statement 1, with the understanding that what we 
really mean is the (correct) Statement 2. 


(We already made this same point earlier with lines—n. 323.) 


Exercise 271. Does the plane gq = {R -OR: (5, 7,9) = -1} contain any of these points: 
A= (5,-3,1), B = (1,-2,6), C = (-2,2,-3)? (Answer on p. 1869.) 
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Our first result about planes is simple and intuitively “obvious”: 





The plane q 
The line AB Since g contains A and B, 
A it also contains the line AB. 


B 














Proof. See p. 1630 (Appendices).?°? 


Example 922. In the last example, we verified that the plane gq = {R :OR- (-3, 0,2) = 5} 


contains the points A = (1,0,-1) and B = (3,1,2). By the above Fact then, qg also contains 
every point in the line AB. 


Remark 111. A couple of pages ago, we made the point that strictly speaking, a plane 
cannot contain a vector (Remark 109). 


Here we can make a similar remark: Strictly speaking, a plane cannot contain a line 
(equivalently, a line cannot be in or an element of a plane). 


This is because a line is a set of points (where a point is an ordered triple of real numbers). 
A plane is also a set of points. But one set of points cannot be in or an element of another 
set of points. 


Instead, what’s possible is for a set of points to be a subset of another set of points. In 
particular, a line can be a subset of a plane. 


Nonetheless, as with planes and vectors, we shall be sloppy with planes and lines: 


We will often go ahead and say that “a line is on a plane” or “a plane contains a line”, 
even though strictly speaking, either statement is incorrect. We’ll make such statements 
with the understanding that what we really mean is that “the line is a subset of the 
plane” or “the plane contains all the points in the line”. 


And so, continuing with the last example, we’ll often be sloppy and say that “the line 
AB is on q” or “q contains the line AB” (even though strictly speaking, either statement 
is incorrect). 








3°3T have relegated many of the proofs in this chapter to the Appendices even though they are not difficult 
and would usually have been in the main text. However, this chapter contains an inordinate number of 
results and I decided to do so to avoid overwhelming the student. 
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Example 923. Let q be the plane r- (4,1,5) = O and / be the line 
r=OA+ Av = (-1,-1,1) +A(5,0,-4) (Ac R). =) 


The plane g contains the line /. Here are two Ay 
ways to show this: The plane q 


Method 1. Observe / contains the points 
A=(-1,-1,1) and B= A+v=(-1,-1,1)+ 
(5,0, -4) = (4,-1,-3). 


But gq also contains A and B (as you should 
be able to verify). And so, by the above Fact, 
q contains the line AB, which is also the line The line / 
if 


Method 2. Let R= (-1,-1,1) + A(5,0, -4) 
be an arbitrary point on the line /. We show 
that R satisfies q’s vector equation: 





OR -(4,1,5) = Kiet ee + (5,0, -4)] -(4,1,5) 
=4(-1+5A)+(-1) +5(1-4A) 
=-4-20A\-14+5+20A =0. 


We’ve just shown that g contains the arbitrary point R on /. So, g contains every point 
on |. (Sloppily, we may say that q contains /.) 


Exercise 272. Suppose the plane q is described by r- (4, -3,2) = -10, while the line / is 
described by r = (7,3, 1) + A(8,6,-2). Determine if qg contains /. (Answer on p. 1869.) 





We next examine the normal vector in greater detail. 
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71.1. The Normal Vector 


A plane’s normal vector is perpendicular to every vector on that plane. Formally, 


Definition 177. A normal vector of a plane is a vector that’s perpendicular to every 
vector on that plane. 


More simply, instead of saying, “n is a normal vector of the plane g,” we'll also say, “n is 
normal to q”. And as shorthand, we’ll write n 1 q. 


Not surprisingly, the vector n used in Definition 175 of the plane is a normal vector: 


Fact 159. If q= {R : OR-n = d} is a plane, then n 1 q. 

















Proof. See p. 1630 (Appendices). 


Example 924. Let qg be the plane r-n =d. 


By the above Fact, n 1 q—i.e., n is 
perpendicular to every vector on q. 


So if the vectors u, v, and w are on q, 
then n 1 u,v, w. 





A plane’s normal vector is not unique: 


Example 925. Let q be the plane r-(1, 1,1) = 1, 
so that n = (1,1,1) is a normal vector of gq. Let 


m= (2.2.2) 
u = (-1.5, -1.5, -1.5) 
ie) 

The vectors m = 2n, u = -1.5n, and 


v = V5n are parallel to n. So, they are 
“obviously” also normal vectors of q. 





As the above example suggests, if n is a normal vector of the plane qg, then so too is any 
vector m that’s parallel to n: 


Fact 160. Let q be a plane and n and m be vectors. Supposen 1 q. Then 





m|n = mig. 





Proof. See p. 1630 (Appendices). 
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It turns out that the converse of Fact 160 is also true. That is, suppose n is a normal vector 
of the plane q. If m is also a normal vector of g, then m || n: 


Theorem 18. Let q be a plane and n and m be vectors. Supposen 1 q. Then 


mig => mn. 








Proof. See p. 1633 (Appendices). 











Fact 160 says that a normal vector n of a plane q is not unique: Every vector parallel to 
n is also a normal vector of gq. 


Theorem 18 then says the converse: Only vectors that are parallel to n are normal vectors 
of q. 


Combine these two results together: 


Corollary 29. Let q be a plane and n and m be vectors. Supposen 1 q. Then 


mig — min. 





In other words, given a plane’s normal vector n, that plane’s normal vectors are exactly 
those which are parallel to n. 


Example 926. Let ¢ again be 
the plane r-(1,1,1) =1. is} The plane q 


By Corollary 29 ( <= ), ev- 
ery vector that’s parallel to 
n = (1,1,1) is also normal to gq. 
For example, the following vec- 
tors are are parallel to (1,1, 1) 
and so also normal to gq: 





Conversely, by Corollary 29 (==> ), every normal vector of g must be parallel to (1, 1,1). 
For example, the vectors a = (1,2,3), b = (0,-1,2), ce = (1,1,0.9), and 0 are not parallel 
to (1,1,1) and so are not normal to q. 


The vector d = (1,-1,0) is on the plane q and, as depicted, d 1 n,u,v, but d f a,b. 


Exercise 273. The plane q is described by r-(1,-1,1) = -2. Determine if a = (2, -2,2), 
b=(2,2,=2), and c= (-v2, /2, -/2) are normal vectors of q. (Answer on p. 1869.) 
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Suppose the plane g can be described by r-n = d. If m = kn for some k # 0, then “obviously”, 
q can also be described by 


For future reference, let’s jot this down as a formal result: 


Fact 161. Suppose q is a plane with q={R:r-n=d}. 
If m=kn for some k #0, then we also have q={R:r-m = kd}. 


Example 927. The plane q described by r- (1, 2,3) = 4 can also be described by 


r-(2,4,6)=8, r+ (-1,-2,-8)=-4, or r-(V5,2V5,3V5) = 4V5. 


Example 928. Let ¢ be the plane that 
contains the points A = (9,0,5), B = 
(-2,1,1), and C = (3,-5,4). & The plane q 


Let ’s first write down 
two vectors on q: 


AB = (-11,1,-4), 
AC = 


So, a normal vector of q 


a 


C254) 


Compute d = OA-n = (9,0,5)- (-21, 13,61) = -189 + 0+ 305 = 116. 
Hence, g may be described by r- (—21, 13,61) = 116. 


Another normal vector of q is 2(—21, 13,61) = (-42, 26,122). Thus, q may also be de- 
scribed by r- (—42, 26, 122) = 2-116 = 232. 
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Exercise 274. The plane q contains the points A = (1,-1,2), B = (-2,3,0), and C = 
(0,-1,1). (Answer on p. 1869.) 
(a) Find a normal vector of gq. 

(b) Hence write down a vector equation that describes the plane gq. 

(c) Write down another normal vector of q. 

(d) Hence write down another vector equation that describes the plane q. 

Exercise 275. Let g, 1, v, and P be, respectively, a plane, a line, a vector, and a point 
(all in 3D space). Explain whether each statement (i) could be perfectly correct; or (ii) 


is only ever strictly speaking incorrect, but nonetheless written by us anyway (because 
we're sloppy and lazy): (Answer on p. 1869.) 


(a) qcontains/;  (b) qcontains P; (c) Icontains P; (d) gq contains v; 


(e) lisonoring; (f) Pisonoring; (g) Pisonorinl; (h) visonor ing. 
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Next, let g be a plane with normal vector n. Definition 177 says that if a vector v is on q, 
then it must also be perpendicular to n. 


It turns out that the converse is also true. That is, if v is perpendicular to n, then it must 
be on q. Formally, 


Fact 162. Suppose q is a plane with normal vector n. Then 


Vn = Vv 1s On q. 








Proof. See p. 1632 (Appendices). 











Putting Definition 177 and Fact 162 together, a plane’s vectors are exactly those that are 
perpendicular to its normal vector: 


Corollary 30. Suppose q is a plane with normal vector n. Then 
vin <— visongq. 


Example 929. Let n= (5,1,6) be a vector and q be the plane r-n = -3. As you can 
verify, g contains the points A = (-1,2,0), B = (-2,1,1), and C = (3,0, -3). 


Since A, B, and C are on q, so too are the vectors AB = (-1,-1,1) and AC = (4, -2, -3). 
Hence, we should have AB, AC 1 n, as we now verify: 

AB ek oo) = 1-6 = a7 

AC n= G2? 3) roo 2 18 


Now, is the vector u = (-3,3,2) on the plane q? 


Well, it’s not obvious. We could try to find two points S and T on q such that u = aie 
But this could be slow and laborious. 


A quicker method would be to use Corollary 30. That is, simply check if u 1 n: 
u-n = (-3,3,2)-(5,1,6) = -15+3+12=0. "A 


So yup, u is on q.*4 


Let’s now consider the vector v = (-7,2,3). Is it on g? Again, simply check if v 1 n. 
v-n=(-7,2,3)-(5,1,6) =-35+2+18=-15. x 
So nope, v is not on gq. 


Exercise 276. Let q be the plane described by r- (8,-2,1) = 5. Are the vectors a = 
(3, 7,-0), b= (16,4), and ¢= (3, 10, 1) are on G7 (Answer on p. 1869.) 








>4For anyone not convinced, let D = A+u = (-1,2,0) + (-3,3,2) = (-4,5,2). We can verify that D € q. 
_- > 
And now, since A, D € q, by Definition 176, the vector AD = u is on q. 
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Suppose the plane qg contains the point P. Then g contains exactly those points R for which 
—_ 
the vector PR is on q. Formally, 


Fact 163. Let gq be a plane and P and R be points. Suppose Pe q. Then 


= 
Req The vector PR is on q. 

















Proof. See p. 1631 (Appendices). 


Example 930. The plane q = {R :OR:- (0,-2.3) = 1} 
contains the point A = (0,1,1). The point B is such 
that AB = (1,3, 2). 


Here are two methods for showing that B € q: 


Method 1. First find B = A+ AB = (0,1,1) + 
(1,3,2) = (1,4,3). Then show that B satisfies the 
plane’s vector equation: 


Ob n= (h 43) O23) 026010 ae 
Method 2 (quicker). Simply check if AB Ln: 


An n=U,4 7), 3) =0=62620. = v 





Yup, AB Ln. And so by Fact 163, B eq. 
We now revisit Example 921: 


Example 921. We already showed that the plane g = {R :OR:- (=3,.0,2) = -5} contains 
the points A = (1,0,-1) and B = (3,1,2), but not the point C = (9,1,1). We also 
concluded that the vector AB is on q (because q contains A and B). 


However, we were unable to say if the vectors AC and BC are on q. But now, with Fact 
163, we know that they are not. 


Exercise 277. The plane q is described by r-(7,-1,3) = 19 and A = (1,4,-1) is a point. 
== 
The point B is such that AB = (7,3,-2). Is the point B on q? (Answer on p. 1869.) 
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Given two (non-parallel) vectors on a plane, their vector product is normal to the plane: 


Fact 164. Ifa and b are non-parallel vectors on a plane q, thenaxb 1 q. 


Proof. See p. 1636 (Appendices). 














Now, suppose a and b are (non-parallel) 
vectors on a plane q. If v 1 g, then by Defi- 
nition 177, v L a,b. 


It turns out that the converse is also true. 
That is, if v 1 a,b, then v 1 qg. This is 
because if v 1 a,b, then by Fact 157, v || 
axb. But by Fact 164, axb tq. And so 
by Corollary 29, v 1 gq. 


Let’s jot this down as a formal result: 


Corollary 31. Suppose a and b are non- 
parallel vectors on a plane q. Then 


via,b — vig 





vile — Via b: 


Corollary 32. Suppose a and b are non-parallel vectors on the plane q. Then 


claxb Cc 1s On q. 














Proof. By Fact 164, a xb 1g. And so by Corollary 30,c Laxb <=> cis on gq. 





Example 931. The plane q contains the points A = (0,0,1), B = (4,2,0), and C = 
(-5,0,4). 


Let v = (6,-7,10). Isv iq? 


First, write down two non-parallel vectors on g. ‘Two obvious candidates are 
— — 
AB = (4,2,-1) and AC = (-5,0,3). 


Then check if v 1 AB, AC: 


A Gn ee = 0 oe 
ve AC =(6, 7,40) (25,03) == 30 0s 0 ne a7 


Since AB tt AC and v 1 AB, AC, by Corollary 31, v 1 gq. 


Exercise 278. The vectors a = (1,-1,1) and b = (-2,2,-2) are on the plane q. Is 
n = (0,1,2) a normal vector of g? What about m = (1,3,2)? (Answer on p. 1869.) 
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72. Planes in Cartesian Form 


Example 932. The plane q is described by the vector equation r-n=r-(1,2,3) 24. 


The plane q contains those points R = (x,y,z) whose position vector satisfies =. 
r n 


FS 
We can also rewrite = as (a,9,2) (12,3) = 4. 


But, (a, y,2)* (12,3) =a + 29+ 32. 





So, q is also described by the cartesian equation x + 2y + 3z £4. 
In general, suppose g is the plane described by this vector equation: 
r-n=(a,y,z)- (a,b,c) =d. 
Then qg can also be described by this cartesian equation: 
ax + by+cz£d. 
Formally, 


Fact 165. Suppose (a,b,c) is a non-zero vector anddéR. Then 


y :an+byt+ez=d}. 


Example 933. The plane described by r-(5, 0, -1) = 3 may also be described by 5a-z = 3. 


Example 934. The plane described by r- (-1,7,2) = 0 may also be described by —x + 
Ty +2z =0. 


Example 935. The plane described by 5x + 6y + 7z = 8 may also be described by r- 
(o,6,7) = 8. 


Example 936. The plane described by y = 5 may also be described by r-(0,1,0) = 5. 
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Fact 166. The plane described by r- (a,b,c) =d contains the origin if and only if d= 0. 





Proof. The origin is the point (x,y,z) = (0,0,0) and satisfies the equation r- (a,b,c) =d or 
ax + by + cz =d if and only if d= 0. So, the plane described by r- (a,b,c) = d contains the 
origin if and only if d=0. 














Example 937. Consider the plane q described by 
ax + by+cz=0 or F(a,026)=), 
Even without knowing a, b, and c, we know gq contains the origin. 
Example 938. Consider the plane q described by 
ax + by+cz=8 or Fo (a,0ec)="8, 
Even without knowing a, b, and c, we know gq does not contain the origin. 


Exercise 279. Each of the following is a plane given in vector form. Rewrite each in 
cartesian form and state if each contains the origin. (Answer on p. 1870.) 


(ye 1a) ale (b) r-(-1,0,-2) =0. (c) r-(0,-2,5) =-3. 


Exercise 280. Each of the following is a plane given in cartesian form. Rewrite each in 
vector form and state if each contains the origin. (Answer on p. 1870.) 


(a) v+5=17y+z. (b) y+1=0. (c) @4+2sy-2. 
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72.1. Finding Points on a Plane 


Example 939. The plane q is described in vector or cartesian form by 
r- (1.2.3) =4 or x+2Qy+3z=4. 
Given a plane’s cartesian equation, we can use trial-and-error to find points on that plane: 


Simply try out values of x, y, and z that satisfy the cartesian equation. (Tip: As always, 
zero is our friend.) 


So for example, these points are on q (as you should verify yourself): 
A = (4,0,0), B= (0,2,0), and ~C =(1,0,1). 
In contrast, these are not: 
D=(1,.0,0), B=(0,1,1),0 and & =(-3,4,4), 
Example 940. The plane q is described in vector or cartesian form by 
r-(3,1,1) =-4 or Bu t+y+2=-A4. 
These points are on q: 
A = (0, -4,0), B=(0,0,-4), and C=(-1,-1,0). 
These are not: 
D= (10,0), H=(0,1,1)y- and = =(—3,4 5). 
Example 941. The plane q is described in vector or cartesian form by 
r-(-5,1,0) =1 or =5n by = It 
These points are on q: 
A = (0,1,0), B=(0,1,1), and C=(-1,-4,0). 
These are not: 
D= (10,0), H=({0,2,1), and F=(-—3,4, 5). 


Exercise 281. For each plane (given in vector form), rewrite into cartesian form. Then 
find three points it contains and another three it doesn’t. (Answer on p. 1870.) 


(a) r-(0,0,1) = 32. (b) r-(5,3,1) =-2. (oy may? 3) 0) 
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72.2. Finding Vectors on a Plane 


Example 942. The plane q is described by r- (1,2,3) =4 or r+ 2y+ 32 =4. 
It has normal vector n = (a, b,c) = (1, 2,3). 


Recall (Corollary 30) that a vector v is on q if and only if v 1 n. So, to find vectors on 
q, we need simply find vectors that are perpendicular to n—that is, vectors whose scalar 
product with n is zero. 


We will now construct one such vector u = (uj, U2,u3). That is, we’ll pick values of uy, 
uz, and uz so that u-n = 0. 


As always, zero is our friend. Let’s start by picking us = 0: So, u = (u1, ua, 0). 


We'll now play a simple little trick. Suppose we set ug = —a and uy, = b: 
u = (b, -a,0) = (2,-1,0). 
Then things nicely cancel -ant (b, —a,0)- (a,b,c) = ba-ab+0=0. 
Et voila! By construction, u is perpendicular to n and is thus on q. 
Using the same method, we can easily construct two more vectors that are also on q: 
v = (c,0,-a) = (8,0, -1) and w = (0,c,—-b) = (0,3, -2). 


(As you can easily verify, v-n =0 and w-n =0.) 


And of course, the additive inverses of u, v, and w are also on gq: 
—u = (-2,1,0) -v =(-3,0,1) and -w=(0,-3,2). 


That qg contains the six vectors u, v, w, —u, —v, and —w is formally stated as Fact 167 
below. 


Even without the above method, we can easily find vectors that are on g. For example, 
it is not difficult to see that d = (1,1,-1) is also on q because 


d-n =(1,1,-1)-(1,2,3) =1+2-3=0. 


It is equally easy to show that a vector is not on qg. For example, e = (3,2,1) and 
f = (1,-1,1) are not on q because e-n # 0 and f-n #0 (as you can verify). 


Fact 167. The plane with normal vector (a,b,c) contains points whose differences are 
these vectors: 


(b,-a,0), (c,0,-a), (0,c,-b), (—b,a,0), (-c,0,a), and (0, -c, b). 
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Example 943. The plane q is described by r- (3,1,1) = -4 or 3u+y+z=-4. It has 
normal vector n = (a,b,c) = (3,1,1). And so by Fact 167, q contains these vectors: 


(b, -a,0) = (1,-3,0), (c,0,-a) = (1,0,-3), (0,c,-b) = (0,1,-1), 
(-b,a,0) = (-1,3,0), (-c,0,a) = (-1,0,3), (0,-c,b) = (0,-1,1). 


We can also easily find other vectors on q. For example, d = (-1,1,2) is on q because 
d-n = (-1,1,2)-(8,1,1) =-34+1+2=0. 
In contrast, e = (3, 2,1) and f = (1,-1,1) are not on q because e-n #0 and f-n#0. 


Example 944. The plane q is described by r-(—5, 1,0) = —4 or -5r+y = 1. It has normal 
vector n = (a,b,c) = (-5,1,0). And so by Fact 167, g contains these vectors: 


(G,—a, 0) = (1,5,.0), (e,0, a) = (0,0.4). (0,c,-b) = (0,0,-1), 
CaOv2 Clee Teeter Olen Oe = Ou, 


Actually, here we can make another useful and important observation. Notice that n’s 
z-coordinate is 0. And so, if a vector u = (w1, ua, us) is perpendicular to n (and is hence 
on q), then so too is the vector (uj, U2, A) for any A\€R. 


So for example, since (1,5,0) is on qg, so too are these vectors: 
(13,0), Go51), (1,5,-v2), (1,5,999),  (1,5,7), ete. 


Moreover, for any  € R, the vector (0,0,A) must be perpendicular to n. In particular, 
the standard basis vector k = (0,0,1) is perpendicular to n and is thus also on gq. 


Example 945. The plane q is described by r- (0,0,1) = 5 or z =5. It has normal vector 
n = (a,b,c) = (0,0,1). Observe that n’s x- and y-coordinates are both 0. 

Following the observation made in the previous example, if a vector u = (u1, U2, Us) is 
perpendicular to n (and so is on q), then so too is the vector (A, 1, u3) for any A, € R. 
Here we also make a new observatio: Since only the z-coordinate of n is non-zero, if 
u = (U1, U2, U3) Ln or u-n=0, it must be that uz = 0. 

Altogether then, the vectors that are perpendicular to n (and so are on q) are exactly 
those vectors that can be written as (A, u,0) for A, € R. So for example, the vectors 
(9,0,0), (0,5,0), and (-n, v2,0) are perpendicular to n (and so are on q). 


In contrast, the vectors (9,0,1), (0,5,2), and (-n, V2, 3) are not. 


Exercise 282. Find three non-parallel vectors on each plane: (a) r-(1,-2,3) = 0; (b) 
Pe(o.01)=—2) (ec) (1.04 j= 5. (dj = (0, 7-0) =32 (Answer on p. 1870.) 
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73. Planes in Parametric Form 


In the last two chapters, we learnt to describe planes in vector and cartesian form. In 
this chapter, we’ll learn to describe planes in parametric form. 


To do so, we first introduce an important result. 


Recall (Fact 125) that in 2D space, if a and b are non-parallel vectors, then any vector c 
can be written as the linear combination (LC) of a and b. That is, given any vector c 
(in 2D space), there exist real numbers A and y such that 


c= Aa + pub. 


It turns out that the same is true of vectors on a plane in 3D space. That is, in 3D space, 
a vector is on a plane if and only if it can be written as a LC of two non-parallel vectors on 
that plane. Or equivalently, the vectors on a plane g are exactly those that can be written 
as a LC of any two non-parallel vectors on gq. Formally, 


Theorem 19. Let q be a plane anda and b be non-parallel vectors on q. Suppose c is a 
non-zero vector. Then 


cis avectoronq <> There exist A, ¢€R such that c= rat pb. 





Proof. First note that since a and b are non-parallel vectors on q, by Fact 164, ax bi q. 


We first prove <=. Suppose there exist A, 4 € R such that c = Aa+ pub. We show that 
axblic, so that by Fact 162, c is also a vector on the plane: 


(ax b)-c=(axb)-(Aa+ pb) = (ax b)- (Aa) + (a x b) - (ub) 
=\(axb)-at+p(axb)-b=0+0=0.o 











The proof of == is harder and relegated to p. 1636 (Appendices). 





As we'll see in Ch. 73.1, Theorem 19 will allow us to describe planes in parametric form. 


But first, let’s better acquaint ourselves with Theorem 19 with some examples: 
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Example 946. The plane g de- 
scribed by x+y+z = 1 has normal Pe Gone 
vector n = (1,1,1). _ 


The vectors u=(1,-1,0) and 
v=(1,0,-1) are both perpen- 
dicular to n (verify this!). So, 
by Corollary 30, both are on q. 


Observe moreover that u + v. Hence, by 
Theorem 19, the vectors on q are exactly 
those that can be written as a LC of u and 
Vv. 








For example, the vector w = (0,1,-1) is perpendicular 
to n and is on qg. So, by Theorem 19, we should be able 
to write w as a LC of u and v, as indeed we can: 


w = (0,1,-1) = (1,0,-1) -(1,-1,0) =v-u. 


In contrast, the vector a = (1,2,3) is not perpendicular to n and so is not on gq. So, by 
Theorem 19, we cannot write a as a LC of u and v. To verify this, write 


1 1 il il 
a= AU + Liv or 9 1=Al -1 +z 0 2 
3 0 -] 


Hoo [Im tle 


| 
= 


Take + plus 2 to get 3 = p, which contradicts 3. So, there are no A, € R such that 
a=Au+pv. Or equivalently, a cannot be written as a LC of u and v. 
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Example 947. The plane q described by x — y = 5 has normal vector n = (1,-1,0). 


The vectors u = (0,0, 1) and v = (1, 1,0) are perpendicular to n and are so on gq. Moreover, 
u + v. Hence, by Theorem 19, the vectors on q are exactly those that can be written as 
a LC of u and v. 


For example, the vector w = (1,1,1) is perpendicular to n and is on qg. So, by Theorem 
19, we should be able to write w as a LC of u and v, as indeed we can: 


we (1, 1,1)=(0,0,1)+0,L0)su+v. 
In contrast, the vector a = (0,1,0) is not perpendicular to n and is not on g. So, by 


Theorem 19, we cannot write a as a LC of u and v. To verify this, write 


a=Au+ [Vv 


1 
2 
3 


Clearly, + contradicts =. So, there are no A, w € R such that a = Au+pv. Or equivalently, 
a cannot be written as a LC of u and v. 
Example 948. The plane q described by z =7 has normal vector n = (0,0, 1). 


The vectors u = (1,0,0) and v = (0, 1,0) are perpendicular to n and so are on g. Moreover, 
u + v. Hence, by Theorem 19, the vectors on q are exactly those that can be written as 
a LC of u and v. 


For example, the vector w = (1,2,0) is perpendicular to n and is on qg. So, by Theorem 
19, we should be able to write w as a LC of u and v, as indeed we can: 


w = (1,2,1) = (1,0,0) +2(0,1,0) =u+2v. 
In contrast, the vector a = (0,1,1) is not perpendicular to n and is not on qg. So, by 


Theorem 19, we cannot write a as a LC of u and v. To verify this, write 


a=Au+ pV 


He Re © 
Hoo [Im tle 
Se 


Clearly, 3 is a contradiction. So, there are no A, € R such that a = Au+ pv. Or 
equivalently, a cannot be written as a LC of u and v. 
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73.1. Planes in Parametric Form 


Suppose g is a plane and P is a point on g. Then Fact 163 says that a point R is on the 
—> 
plane q if and only if the vector PR is on q. In formal notation, 


i =; 
Req —> PR is on q. 


Next, suppose also that a and b are non-parallel vectors on g. Then Theorem 19 says that 
a vector v is on the plane gq if and only if it is a LC of a and b. In formal notation, 


v is on q ss There exist A, 4 € R such that v = Aa + yb. 
' 1 2 
Now combine <> and <>: 
Req ee There exist A, 4 € IR such that PR=)a+t ub. 


In words, a point FR is on the plane q if and only if the vector PR is a LC of a and b. 
Equivalently, the plane g contains exactly those points R for which the vector PR isa LC 
of a and b. That is, 
4 —_— 
g2{R:PR=at+ub (,neR)}. 


We previously learnt how to describe planes in vector and cartesian forms. 


With = we now have us a third way to describe planes and is called the parametric form 
of a plane. 


Let’s summarise the above discussion as a formal result: 


Fact 168. Let q be a plane, P be a point, anda and b be non-parallel vectors. Suppose 
P,a, andb are on q. Then 


Req => there exist A, u€ IR such that PR ae ub. 





Or equivalently, q é {R 2/1) Ste: ub (A, we R)}. 


Since PR = OR- OP, we have 
PR =)at+pb => OR = OP + at pb. 
So, the above Fact can also be rewritten as 


Corollary 33. Let q be a plane, P be a point, and a and b be non-parallel vectors. 
Suppose P, a, and b are on q. Then 


Req <> _ there exist A,.€R such that OR Or ae pb. 


Or equivalently, q 2 {R (OR=OP as ub (A, pe R)}. 
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As before, instead of fully writing out the plane qg in set notation (as was done in * and 2). 
we can simply say that qg is described by the parametric equation 


Vec. Vec. 
== ars 
OR £&£ OP +\at+pb (A,n€R); 
Pt Pt 
or, R £4 P  +i\a+pb (A, eR). 
With £ we have a nice, informal geometric interpretation: A point R is on the plane q if 
and only if 


R can be reached from P through a LC of “steps” in the directions of a and b. 
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Example 949. Let qg be the plane described by r- (1,1,1) =1 or r+ y+z=1. & 


It contains the point P=(1,0,0) and the non-parallel vectors a=(-1,1,0) and 
b = (-1,0,1). So, by Corollary 33, 


:OR = OP +)\at pb (,eR)} 


slesg a = =} in 
:OR£] 0 [+A] 1 , (\,weR)}. 
0 0 LL 


More simply, we say that qg is described by the parametric equation gS. 


1 


d (A,€R). 
m 


As the parameters A and p vary, we get dif- 
ferent points on gq. For example, (A, 1) = 
(3,5) produces the point S: 


The plane q 
S = (1,0,0) +3(-1,1,0) +5(-1,0,1) = 
(-7,3,5). 


Starting from the point P, we can reach S 5b =5 (=1,0,4) 
by taking 3 “steps” in the direction of a, 
then 5 “steps” in the direction of b. 


Similarly, (A, 1.) = (0,0) produces the point 
fie 


T = (1,0,0) +0(-1, 1,0) +0(-1,0,1) = 


) ) 


Starting from the point P, we were already 
at the point T' = P. 
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Example 950. Let g be the plane described by z = 3. = 


It contains the point P = (0,0,3) and the non-parallel vectors a = (1,0,0) and b = (0, 1,0). 
So, by Corollary 33, 


:OR=OP +)\at pb (,€R)} 


i 0 1 0 r 
:OR® QO }+AlL 0 |t+y] 1 U (A,u¢€R)>. 
0 0 3 


More simply, we say that qg is described by the parametric equation g 


As the parameters and p vary, we get different points on the plane gq. For example, 
(A, 1) = (5,-1) produces the point S: 


5S = {(0,0,3) + 5(1, 0,0) + (-1)(0,1,.0) = (5,1, 3). 
Starting from the point P, we can reach the point S by taking 1 “step” in the direction 


opposite to b, then 5 “steps” in the direction of a. 


The plane q 


As another example, (\, /) = (0,1) produces the point T (not depicted): 
P2100 5) 00.0) 100.1 = 


Starting from the point P, we can reach T by taking 1 “step” in the direction of b. 
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Example 951. Let qg be the plane described by —x + 3y - 5z = 7. <I 


It contains the point P=(-7,0,0) and the non-parallel vectors a=(3,1,0) and 
b = (5,0,-1). So, by Corollary 33, 


:OR=OP +)at pb (,€R)} 


=i a 5 -7+3+ 5p 
O J+AP 1 [+4] 0 J= x (A, ueR)>. 
0 0 —] —[ 


More simply, we may say that q is described by the parametric equation g. 


As the parameters and jp vary, we get different points on the plane gq. For example, 
(A, 4) = (1,2) produces the point S: 


S = (-7,0,0) +1(3, 1,0) +2(5,0,-1) = (6, 1,-2). 


Starting from the point P, we can reach S' by taking 1 “step” in the direction of a, then 
2 “steps” in the direction of b. 


The plane q 


2b| = 2 (50, -1) 


(6, 1,-2) 


As another example, (A, ) = (-1,3) produces the point T (not depicted): 
Jape ses) ee Cols GV) Eee Gaines ee ayes 


Starting from the point A, we can reach T by taking 1 “step” in the direction opposite 
to a and 3 “steps” in the direction of b. 


Exercise 284. Rewrite each plane into both cartesian and parametric forms. (Answer 
on p. 1872.) 





(a) r-(-1,2,5)=5. (b) r-(0,0,1) =0. yy eu eae 
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73.2. Parametric to Vector or Cartesian Form 
Given a plane in parametric form, rewriting it into vector or cartesian form is easy: 


Example 952. We are given a plane in parametric form: 


7 8 7+8A+9u 
r=] 3 |+A] 3 [+u 3+3A+3u (A, we R). 
4 4 4+4\+ 7p 


This plane contains the vectors (8,3,4) and (9,3,7). So, a normal vector is 
(8, 3,4) x (9,3, 7) = (9, -20, -3). % 


The plane contains the point (7,3,4). Since (7,3,4) - (9, -20,-3) = 63 - 60 - 12 = -9, the 
plane may be described by 


r-(9,-20, -3) = -9 or 9x - 20y - 3z = -9. 


Example 953. We are given a plane in parametric form: 


ion 
Qu —-2 (A, wé€R). 


5A 
First, rewrite the above as 


o 2 
+\} O |+pu] 2 (A,ue€R). 
5 0 


This plane contains the vectors (3,0,5) and (-2,2,0). So, a normal vector is 
(3,0,5) x (2, 2,0) = (-10, -10,6). % 


Observe that (-10,-10,6) || (5,5,-3). Hence, by Fact 160, (5,5,-3) is also a normal 
vector of the plane. (It’s nice to “simplify” the normal vector as much as possible—this 
will usually make our subsequent calculations slightly easier.) 

The plane contains the point (17,-2,0). Since (17, -2,0) - (5,5,-3) = 85 -10+0 = -75, 
the plane may be described by 





r-(5,5,-3)=-7 ba + 5y—32=-T. 
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Example 954. We are given a plane in parametric form: 


A-p-2 
Po | 4 oi (A, € R). 
5+ut+7r 


First, rewrite the above as 


1 —1 
+AT 5 [+e] 3 Qa, we R). 
i 1 


This plane contains the vectors (1,5,7) and (-1,3,1). So, a normal vector is 


ia (a9, 1 CG, 38). 


Observe that (-16,-8,8) || (2,1,-1). So, (2,1,-1) is also a normal vector. 


The plane contains the point (—2,14,5). Since (-2, 14,5) -(2,1,-1) = -4+14-5=5, the 
plane may be described by 


r-(2,1,-1) =5 2n+y-2=5. 


Exercise 285. Rewrite each plane into both vector and cartesian form. 

(a) r=(1,2,3)+.2(4,5,6) + (7,8, 9) (A, WER). 

(b) r=(A-p,40+5,0) (A, u€R). 

(c) r=(1+y,14+)A,A +p) (A,wéR). (Answer on p. 1872.) 
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74. Four Ways to Uniquely Determine a Plane 


Recall that there are two ways to uniquely determine a line. A line can be uniquely 
determined by (a) two distinct points; or (b) a point and a vector. 


Similarly, there are Four Ways to uniquely determine a plane:*”° 


1. A point and a normal vector; 
2. A point and two vectors (that aren’t parallel); 
3. Two points and a vector (that isn’t parallel to the vector between the two points); or 


4. Three points (that aren’t collinear). 
More formally and precisely, 


Fact 169. Let A be a point and u be a vector. 


(a) The unique plane that contains A and has u as its normal vector is 
{R:OR-u=OA-u}. 


Let v be a vector that isn’t parallel to u. 

2. The unique plane that contains A, u, andv is{R:R=A+Aust+pv (Ape R)}. 
Let B+ A be a point such that AB Hu. 

3. The unique plane that contains A, B, and u is {R >R=A+Au+ uAB (A, We R)}. 


Let C be a point that isn’t collinear with A and B. 





4. The unique plane that contains A, B, and C is {R “ReAP ARS uAC (A, We R)}. 


Proof. (a) See Fact 279 (Appendices). 
(b) Apply Corollary 33. 
(c) Since AB t+ u, we can again apply Corollary 33. 


(d) Since A, B, and C aren’t collinear, AB tt AC. And now, we can again apply Corollary 
Gu 














We'll give two examples of each of the Four Ways. These examples will also serve as a 
summary of what we’ve learnt so far about planes. 


First, two examples where we’re given a point and a normal vector of the plane: 





®°°This is an assertion that we formally prove only in Ch. ?? (Appendices). 
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Example 955. A plane contains the point (1,2,3) and has normal vector (1, 1,0). 


Compute (1,2,3)-(1,1,0) = 1+2+0=3. So, this plane may be described in vector or 
cartesian form by 


re(L LOy=3 or Lay = 3. 


This plane also contains the non-parallel vectors (1,-1,0) and (0,0,1). So, it may be 
described in parametric form by 


1+. 
a a | (A, wé€R). 
3+ ps 


Example 956. A plane contains the point (0,0,1) and has normal vector (2,-1,1). 


Compute (0,0,1)-(2,-1,1) =0+0+1=1. So, this plane may be described in vector or 
cartesian form by 


r-(2,-1,1)=1 or 2r-ytz=l. 


This plane also contains the non-parallel vectors (1,2,0) and (0,1,1). So, it may be 
described in parametric form by 


(A, wé R). 





Second, two examples where we’re given one point and two vectors (that aren’t parallel): 


Example 957. A plane contains the point (1,2,3) and the non-parallel vectors (5, 4,3) 
and (1,-1,2). So, it may be described in parametric form by 


| 5 1 1+5A+ ps 
r=] 2 ]+A] 4 [+H] -1 |=] 24+4A-wp J, 
3 3 ; 34+3A+42u 


This plane has normal vector (5, 4,3) x (1,-1,2) = (11,-7, -9). a 


Compute (1, 2,3) -(11,-7,-9) = 11 - 14 - 27 = -30. So, this plane may be described in 
vector or cartesian form by 


r-(11,-7,-9) = -30 lla - Ty - 9z = -30. 
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Example 958. A plane contains the point (5,0,1) and the non-parallel vectors (1, 1,8) 
and (1,0,1). So, it may be described in parametric form by 


5+A+ p 
r 
1+8A+ py 


This plane has normal vector (1,1,8) x (1,0,1) = (1,7,-1). % 


Compute (5,0,1)-(1,7,-1) =5+0-1=4. So, this plane may be described in vector or 
cartesian form by 





p- (17-1) =4 or g+Ty-z=A4. 


Third, two examples where we’re given two points and a vector (that isn’t parallel to 
the vector between the two points): 


Example 959. A plane contains the points (0,0,3) and (1,4,5), and the vector (3, 2,1). 


The vector between the two points is (1,4,5) - (0,0,3) = (1,4,2) and isn’t parallel to the 
vector (3,2,1). So, this plane may be described in parametric form by 


3 1 3A + LL 
+\]} 2 |+yu 2 ae (A,ueéR). 
1 2 Bt+A+ 2p 


This plane has normal vectors (3, 2,1) x (1, 4,2) = (0,-5, 10) and hence also (0,-1, 2). * 


Compute (0,0,3)-(0,-1,2) =0+0+6=6. So, this plane may be described in vector or 
cartesian form by 


r-(0,-1,2) =6 —y+2z=6. 


Example 960. A plane contains the points (8, -2,0) and (3, 6,9), and the vector (0, 1,1). 


The vector between the two points is (3,6,9) — (8, -2,0) = (-5,8,9) and isn’t parallel to 
the vector (0,1,1). So, this plane may be described in parametric form by 


8- 5p 
=| -2+A+8p ], 
A+ 9 


This plane has normal vector (0, 1,1) x (-5,8,9) = (1,-5, 5). Sa 


Compute (8, -2,0)-(1,-5,5) =8+10+0=18. So, this plane may be described in vector 
and cartesian forms by r-(1,-5,5) = 18 and x -5y+5z=18. 





Fourth and last, two examples where we’re given three points (that aren’t collinear): 
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Example 961. A plane contains the points (1,2,3), (4,5,8), and (2,3, 5). 


The vector between the first two points is (4,5,8)-(1, 2,3) = (3,3,5), while that between 
the first and last is (2,3,5) - (1,2,3) = (1,1,2). Since (3,3,5) + (1, 1,2), this plane may 
be described in parametric form by 


1+3A+p 
=| 24+3A+p |, 
3+ 5A + 2u 


This plane has normal vector (3,3,5) x (1,1,2) = (1,-1,0). % 
Compute (1,2,3)-(1,-1,0) = 1-2+0=-1. So, this plane may be described in vector 
and cartesian forms by r-(1,-1,0) =-1 and x-y=-1. 

Example 962. A plane contains the points (1,0,0), (0,1,0), and (0,0, 1). 


The vector between the first two points is (0, 1,0)-—(1,0,0) = (-1, 1,0), while that between 
the first and last is (0,0,1) - (1,0,0) = (-1,0,1). Since (-1,1,0) + (-1,0,1), this plane 
may be described in parametric form by 


i LL 


aos 
r 
[L 


This plane has normal vector (-1,1,0) x (-1,0,1) = (1,1, 1). Sa 


Compute (1,0,0)-(1,1,1) = 1+0+0=1. So, this plane may be described in vector and 
cartesian forms by r-(1,1,1) =1l and r+y+z=1. 





As the above examples suggest, we can easily “go “from any one of to any other of the Four 
Ways. 


That is, starting from (a) a point and a normal vector (hence uniquely determining some 
plane q), we can easily find 


(b) two other non-parallel vectors on q; 


(c) another point and another vector on q (where the vector is not parallel to the vector 
connecting the two known points); and 


(d) another two points on q (where the three points are not collinear). (How?)?°° 


Similarly, starting from (b), we can easily “go to” (a), (c), or (d). (How?)?” 





8°6Say we're given (a) a point A = (3,2,1) and a normal vector n = (1,2,3) of a plane g. Then (b) two 
non-parallel vectors on g are u = (0,3,-2) and v = (3,0,-1); and (c)+(d) another two points on gq are 
B=A+uand C=A+v, where A, B, and C are not collinear. 

3°7Say we're given (b) a point A = (3,2,1) and two non-parallel vectors u = (0,3,-2) and v = (3,0,-1) 
on q. Then (a) a normal vector of g is ux v; and (c)+(d) another two points on g are B= A+u and 
C= A+v, where A, B, and C are not collinear. 
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Similarly, starting from (c), we can easily “go to” (a), (b), or (d). (How?)*® 


Similarly, starting from (d), we can easily “go to” (a), (b), or (c). (How?)*”? 


Exercise 286. In each of (a)-(c), three points are given. Describe the plane that 
contains all three points in vector, cartesian, and parametric form. 


(a) (7 34), @,3.4)and @47). (c) (8,5,9), (8,4,5), and (5,6,0). 
(by 0 2) 44 38) and (27,2). (Answer on p. 1873.) 








8°8Say we're given (c) two points A = (3,2,1) and B = (4,4,4) and a vector v = (3,0,-1) on q, where 
v + AB. Then (a) AB x v is a normal vector of q; (b) AB is another vector on q that isn’t parallel to 
v; and (d) C = A+v is another point on q such that A,B, and C are not collinear. 

3°9Say we’re given (d) three non-collinear points A = (3,2,1), B = (4,4,4), and C = (6,2,0) on g. Then 


(a) AB x AC is a normal vector of q; and (b)+(c) AB and AC are two non-parallel vectors on gq. 
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75. The Angle between a Line and a Plane 


Suppose the line / has direction vector v and the plane g has normal vector n. 


Let 6 be the non-obtuse angle between v and n. 






The line | 


The plane q 








We want A in the above figure to be the angle between the line / and the plane gq. 


There are two possible cases: The angle between v and n is (1) 6; or (2) 7-8. 


But in either case, A is the complement of #. That is, A = . — 6. 





By Fact 138, 6=cos ve al 
[v| [| 
el | mu 
By Fact 101, sin %+cos "x= 3" 
TT . . 
Altogether, A=—-cos? wl = sin? a 
2 v| [| [v| fn 


The above discussion motivates this definition: 


Definition 178. The angle between a line with direction vector v and a plane with 
normal vector n is this number: 


sin”! [ea 
[v| |n] 


Example 963. The angle between a line 
with direction vector (9,1,3) and a plane 
eS 


with normal vector (1,1,1) is The plane 


: oj (ovis) (ht) aa 


sin 


(9,1, 3) 14, 1, 1)| 
z I9+1+3] 
S92 +12 4 32/12 + 12 + 12 
[13 
V91V/3 
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= siti 


« 0.906. 





Example 964. The angle between a line 
with direction vector (1,0,1) and a plane 
with normal vector (—1,-1,0) is & 


The plane 
eee] |(1, 0, 1) ° (ee = 0) 


|(1, 0, 1)| Kew =, 0)| 
|-1+0+0| 


V1? +0? + 12/(-1) + (-1)* + 0? 


—l 
= sin! seu =<sin 


V2\/2 





Example 965. The angle between a line with direction vector (1,0,1) and a plane with 
normal vector (0,1,0) is 


(1,0, 1)-(0,1,0)] 4 JO+0+0 _ 1 _|0 


=i al 
sin sin re Siti —— =s1Iln 0 = 0. 
JV 12 +02 + 12\/02 + 12 + 02 J2V1 





I(1,0,1)||(0,1,0)| 


Exercise 287. Find the angle between the given line and plane. (Answer on p. 1874.) 
(a) Line: r=(-1,2,3)+.A(-1,1,0) (A€R). 

Plane: r-(3,4,5) =0. 
(b) Line: Contains the points (-1,2,3) and (-1,4,9). 

Plane: r= (2,0,0) + A(-3, 1,0) + 4 (0,5, -3) (A € R). 


(c) Line: Contains the points (-1,2,3) and (0,11, 11). 
Plane: Contains the points (1.5,0,0) and (0,0,1.5) and the vector (4,-1,0). 
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75.1. When a Line and a Plane Are Parallel, Perp., or Intersect 


Definition 179. Let @ be the angle between a line and a plane. The line and plane are 


said to be (a) parallel if 0 =0; and (b) perpendicular if 6 = 7/2. 





As usual, if a line / and plane q are parallel, write / || q. If not, write 1 + q. 


And if they’re perpendicular, write write / 1 q. 
If not, write | { q. 


The plane g has normal vector n, while the lines 
1, and J) have direction vectors v, and vo. 


“Obviously”, 1) Lq <> vj; || n. 
And ly ||q <> vol n. The plane 





Let’s state and prove these obviosities formally: 


Fact 170. Suppose the line | has direction vector v and the plane q has normal vector n. 
Then (a) l||q <> vin; and (b)llq <— vl n. 


Proof. Let @ be the angle between / and gq. By Definitions 179, 178, and 146, and Fact 135: 




















(a) I||q <> 6=0 es gi OA <— v-n=0 —vin. 
[y| [n| 
(b) ig 5S es et he eg <> v|n. 
2 iv|[n| 2 [v| |n| 


The next result is intuitively “obvious”: 


Fact 171. Given a line and a plane, exactly one of the three following possibilities holds: 
The line and plane are 


(a) Parallel and do not intersect at all; or 


(b) Parallel and the line lies entirely on the plane; or 
(c) Non-parallel and intersect at exactly one point. 








Proof. See p. 1638 (Appendices). 











Example 966. ‘The three lines /;, /2, and /3, and plane g 
illustrate all three possibilities given in Fact 171: 


1. 1, || q and does not intersect q. 
2. I, || q and lies entirely on q. 


3. ls 4 q and intersects q at exactly one point. 
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By Fact 171, the line / and plane q intersect at exactly one point <= > I + q. 


Example 967. The line / and plane gq are described by 


Vv n 
—— J 


—S 
r = (3,5,5) +A(9, 1,3) (Ae R) and ued @ Galle Base 


Observe that v-n = (9,1,3)-(1,1,1) # The line | 
0, so that v { n and hence, / + q. 


Thus, / and g share exactly one inter- 


section point. 1 (-51, 55, 35) 
To find this intersection point, simply The plane q 13 — 
plug in a generic point of / into the 


equation for q: 


(3, 5,5) +4(9,1,3)]-(1,1,1)=3 ieee wae 


x 10 1 
So, / and q intersect at (3,5,5) + (9, 1,3) = (3,5,5) - Te (9, 1,3) = 7G (-51, 55,35). 


Example 968. The line / and the plane q are described by 


Vv n 


a ——— 
r = (3,5,5)+A(9,1,3) (A€R) = and __r-(1,0,-3) = -6. 
Observe that v-n = (9,1,3)- (1,0,-3) = 0, so that The line | 
v i nand hence / || g. Thus, either 


l lies entirely on q; or 


{ and ¢g don’t intersect at all. 
x The plane q 


It’s easy to figure out which of these two possibilities holds: We already know that 1 
contains the point (3,5,5); so, simply check if this point is also on the plane gq: 


(3,5,5)-(1,0,-3) =3+0-15=-12 4-6. 


Since this point does not satisfy q’s vector equation, it is not on g. So, it cannot be that 
l lies entirely on gq. Hence, it must be that / and q do not intersect at all. 
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Example 969. The line / and the plane q are described by 


Vv n 
a aN 


a 
PIGS Gre) Gee sand) “rs Guns) 2=6. 


Again, I || g. So again, two possibilities: Either / lies entirely on q or they don’t intersect 
at all. 


To find out which, same trick as before: Check if the point (3,5,3) on / is also on gq: 
(3,5, 3)-(1,0,-3) =3+0-9 =-6 "A 


Yup, this time it is. Since / and q are parallel and share an intersection point, it must be 
that / lies entirely on q. 


The line / 


The plane q 


Exercise 288. In each of (a)—(c), a line / and a plane q are given. For each, determine 
which of the three possibilities given in Fact 171 holds. And if the line and plane intersect, 
find their intersection point. (Answer on p. 1874.) 
(a) I: r=(4,5,6)+A(2,3,5)  (\eR). 

re (10,0, 4) 6. 


Contains the points (5,5,6) and (3,2, 1). 
r = (3,0,1) +A(2,0,5) + 4 (2,1,5) (Ae R). 


Contains the points (4,5,6) and (6,8, 11). 
Contains the points (2,0,-2) and (2,1,-2) and the vector (3,0, 10). 
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76. The Angle between Two Planes 


The planes gq; and g2 have normal vectors n; and no. 





Let 6 be the (non-obtuse) angle between n, and ny. 
Then @ is also the angle between q; and qo. 


The above discussion motivates this definition: 


Definition 180. The angle between two 


planes is the non-obtuse angle between 
their normal vectors. 





Now apply Fact 138: 


Fact 172. The angle between two planes with normal vectors u and v is 


cos”! Lea 
ul [v| 


Example 970. The angle between the planes r- (2, 1,3) = 26 and r- (-3,0,5) = —25 is 


3 Cee) = cos! eelceas ~ 1.146. % 
(2, 1, 3)||(-3, 0, 5)| V 14/34 





6 = cos 


Example 971. The angle between the planes r-(1,1,1) = 12 and r-(-1,-1,0) =-1 is 


Bee ie 10) a2 Pe 
pee | a a) Pelee aot % 
~ (GLDICL-LO ava 


Exercise 289. Find the angle between the given planes. (Answers on p. 1875.) 


(a) r-(-1,-2,-3) =1 and r- (3,4,5) =2. 
(b) One plane contains the vectors (1, -1,0) and (3,5,-1). The other contains the vectors 
(0,10) and (10; 2,3). 


(c) One plane contains the points (1,1,0), (3,0,0), and (0,0,1). The other contains the 
points (1,-1,0), (1,0,-1), and (0,3, 1). 
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76.1. When Two Planes Are Parallel, Perp., or Intersect 


Definition 181. If @ is the angle between two planes, then the two planes are 





(a) Parallel if 0 = 0; and (b) Perpendicular if @ = 7/2. 


As usual, if two planes qg and r, write q || r. If not, write q } r. 
And if they’re perpendicular, write write q 1 r. If not, write q [ r. 


The next result is “obvious”: 


Fact 173. Suppose q and r are planes with normal vectors u and v. Then 





(a) q|lr — uly; and (b) gir — uly. 





Proof. See p. 1639 (Appendices). 











Example 972. The planes g), g2, and g3 have normal 
vectors Vj, V2, and v3. We have 


e Vv, L vo and q; 1 q. 
e vi 1 v3 and q 1 qs. 
° v2 || v3 and @ || @. 





Another intuitively “obvious” result: 


Fact 174. If two planes are parallel, then they are either identical or do not intersect. 











Proof. See p. 1639 (Appendices). 











Example 973. The planes q, and q are described by r-(3, -3, -1) = 1 and r-(-6,6, 2) = 5. 
Since (3, -3, -1) | (=6, 5,2), 71 | QQ. = q1 


Now pick any point on gq —for example, 
(0,0,-1). Check if this point is on qo: 


(0,0,-1)-(-6,6,2) =0+0-2=-2¢5. {2 





It isn’t. Since the two planes are not identical, by Fact 174, they do not intersect at all. 
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Example 974. The planes q and q are described by r-(1,0,2) = —3 and r-(—2,0, -4) = 6. 
Since (1,0, 2) | (3,08), 71 | q2. 
Now pick any point on q;—for example, (-1,0,-1). Check if this point is on qo: 


(-1,0,-1)-(-2,0,-4) =2+0+4=6. qi = 


Yup, it is. Since the two planes share at least one 
intersection point, by Fact 174, they are identical. 





Here’s another “obvious” result: 








Fact 175. If two planes are not parallel, then they must intersect. 














Proof. See p. 1640 (Appendices). 


Example 975. Since q, + q2, by Fact 175, they must intersect. 


q2 


In fact, and as should be intuitively “obvious”, g; and g. must intersect along a line. 





As the above example suggests, two non-parallel planes must intersect along a line. 


Now, what do we know about this intersection line? Well, “obviously”, its direction vector 
is parallel to both planes and is thus also perpendicular to both planes’ normal vectors. 


Let n and m be two planes’ normal vectors. By Fact 157, the only vectors perpendicular 
to both n and m are those parallel to n x m. 


Hence, their intersection line must have direction vector n x m. Altogether, 

















Proof. See p. 1640 (Appendices). 
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Example 976. The planes q; and gz are described by 
r-n, =r-(-1,2,-3) =4 and r-n)=r-(5,-6,7) =0. 


Since n; + no, by Fact 176, q and gq must intersect along a line with direction vector 
ny X Ny = (-1, 2, -3) x (5, -6, 7) = (-4, -8, -4) or (1, 2,1). * 


Recall that a direction vector and a point fully describe a line. We already have a direction 
vector. So, let’s find some point P that is on the intersection line. 


To do so, first write out the two planes’ cartesian equations: 
—-%+2y-3z=4 and 5a —-6y + 7z =0. 


The solutions to the above system of (two) equations gives us the two planes’ intersection 
points. Note that with three variables and two equations, this system of equations has 
infinitely many solutions and hence infinitely many intersection points. And of 
course, the set of all these intersection points is the intersection line. 


To find any one such intersection point, zero is as usual our friend. Let’s look for an 
intersection point whose x-coordinate is zero. In other words, let’s simply plug x = 0 into 
the above equations to get 


2y-3z24 and -6y + 7z 20. 


And now, we can easily solve this system of (two) equations (with two variables): 2 plus 
3x = yields —2z = 12 or z = —-6 and hence, y = —7. 
So, an intersection point shared by q; and q is P = (0,-6,-7). Hence, their intersection 
line is 

r = (0, -6,-7) + A(1, 2,1) (A €R). 


= ny = (-1,2,-3) 


i — (3, —6, 7) aval 


The intersection line has 
direction vector (1, 2,1). 
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Together, Facts 174 and 176 say the following: 


Corollary 34. Given two planes, exactly one of these three possibilities holds: They are 


(a) Identical and thus also parallel; 


(b) Parallel and do not intersect at all; or 


(c) Non-parallel and intersect along a line. 





Proof. If two distinct planes are parallel, then by Fact 174, they do not intersect at all. 
And if they aren’t parallel, then by Fact 176, they intersect along a line. 














The above result implies that two distinct planes intersect if and only if they are not parallel. 
Example 977. The planes q; and g2 are described by 
r-n, =r-(-3,7,1) =2 and rene = 9-4 17,1)=0. 


Since n; } no, by Fact 176, they must intersect along a line with direction vector 
(=3, 7,1) *(1,2,1)=(6,4,—13). Sa 


To find an intersection point, write out the two planes’ cartesian equations: 


—32 + Ty+z=2 and x+2y+z2=0. 
ny = (=3; G 1) 
Again, there'll be infinitely many in- qo 
tersection points. To find one, we 
again plug x = 0 into the above equa- 


tions to get P = (0,0.4, -0.8) 


Ty+z=2 and Qy+z20. 


Solving, y = 0.4 and z = -0.8. So, an 
intersection point shared by q, and 
q. is P =(0,0.4,-0.8). Hence, their 
intersection line is 


The intersection line has 


r = (0,0.4,-0.8) + 4(5,4,-13) direction vector (5,4,-13). 
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Example 978. The planes q; and gz are described by 
r-n, =r-(0,4,5) =0 and feig = t104,5) =1, 
Since n; + no, by Fact 176, they must intersect along a line with direction vector (0, 4,5) x 


(3, 4,5) = (0,15, -12) or (0,5, -4). 


To find an intersection point, write out the two planes’ cartesian equations: 


Ay + 5z =0 and 32+ 4y+5z=1. 


Again, there'll be infinitely many intersec- 
tion points. It turns out that this time, our 
“plug in x = 0 trick” won’t work so nicely. 
Let’s try it anyway and see what happens: 


P = (1/3,0,0) 


Ay +5z=0 and dy +5221, 


which are clearly contradictory. What this 
contradiction means is that the two planes 
do not share any intersection point whose 
x-coordinate is 0. 


No big deal. Instead of plugging in x = 0, 


let's try y = 0 instead: The intersection line has 


: : direction vector (0,5, —4). 
5z =0 and 3x +5z=1. 


Solving, we have z = 0 and x = 1/3. So, an intersection point shared by q; and q is 
P = (1/3,0,0). Hence, their intersection line is 


r = (1/3,0,0)+A(0,5,-4) = (Ae R). 


Exercise 290. In each of (a)-(g), a pair of planes q and q is given. For each pair, 
determine which of the three possibilities in Corollary 34 holds. If the two planes intersect, 
find the set of intersection points. (Answer on p. 1875.) 


(aj r-(4.9 2) ol and rei 12) = 10) 


(b) q, contains the point (1,3,-2) and the vectors (1,-1,0) and (1,-1,1); while q 
contains the point (2,3,5) and the vectors (6,-1,0) and (8,0,-1). 


(c) m@ contains the points (1,1,6), (7,7,0), and (5,3,3); while q contains the points 
(7,3,1), (5,5,1), and (3,5,2). 


(d) q contains the points (5,3,2), (1,5,3), and (10,0,1); while q. contains the points 
(5,-1,4), (8,8, -2), and (3,5, 2). 


(er (Ge laa? and ro) = 6. 
(fr 3)=0 and re (—15 1,3) =2. 
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77. The Distance between a Point and a Plane 


Earlier, from Ch. 15.1, we had these result and definition: 


Corollary 4. Suppose A is a point not on the line 1. Then there exists a point B that 
is both (a) the unique point on | that’s closest to A; and (b) the unique point on | such 
that 11 AB. 


Definition 182. Let A be a point that isn’t on the line 1. The foot of the perpendicular 
from A to lis the (unique) point B on / such that AB 1 l. 





In this chapter, we have, analogously, these result and definition: 


Fact 177. Suppose A is a point not on the plane q. Then there exists a point B that is 
both (a) the unique point on Q that’s closest to A; and (b) the unique point on q such 
that q 1 AB. 








Proof. See p. 764 (Appendices). 











Fact 177. Suppose A is a point not on the plane q. Then there exists a point B that is 
both (a) the unique point on Q that’s closest to A; and (b) the unique point on q such 
that q 1 AB. 





Proof. Let q be the plane r-n=d. Let | be the line R= A+An (A €R). 

Since | + q, by Fact 171, / intersects q at exactly one point—call it B. So, B= A+ pn for 
some A, € R. 

Since AB = Apn, we have q 1 AB (and AB is perpendicular to every vector on q). 


Let C+ B be any other point on g. Then BC is a vector on q. SO, AB 1 BC or AB-BC 2 


>. 
Also, Bc + 0 


0. 


Now, |AC| - |AB+BC| = (AB+ BC). (AB+BC) - [AB] +|BC] +248. BC + [ABl + 
[Bc] 5 |aBy. 

We’ve just shown that (a) B is closer to A than any other point on gq. 

Now, AC BC = (4B + BC) .BC = AB- BC+ all 2 Be 3 0. Hence, AC - BC +0 or 
AC { BC. Thus, q { AC. 

We’ve just shown that (b) besides B, there is no point C' on q such that q 1 AC. 
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Definition 183. Let A be a point that isn’t on the line g. The foot of the perpendicular 


from A to q is the (unique) point B on q such that AB 1 q. 





In Chs. 62 and 69, we learnt to find 

(a) The foot of the perpendicular from a point to a line; and 
(b) The distance between a point and a line. 

In this chapter, we’ll analogously learn to find 

(a) The foot of the perpendicular from a point to a plane; and 
(b) The distance between a point and a plane. 


We again reproduce from Ch. 15 this definition: 


Definition 184. The distance between a point A and a graph G is the distance between 
A and B, where B is the point on G that’s closest to A. 


Let A be a point, g be a plane, and B be the foot of the perpendicular from A to q. By 
Fact 177, B is also the point on q that’s closest to A. So, the distance between A and q is 


simply AB]. 
Example 979. Let A= (1,2,3) be a point, q be the plane r-n=r-(1,1,1) =3, and B 
be the foot of the perpendicular from A to gq. 
Find B and |AB| (the distance between A and q). 


Since B is the foot of the perpendicular, we have AB 1 q or AB || n or for some k ¢ 0, 
kn = AB =OB-OA. 
So, OB = OA+ kn = (1,2,3) + k(1, 1,1). 


A=(1,2,3)0 


Since Beg, OB-(1,1,1) =3 or [(1,2,3) + k(1,1,1)]- (1, 1,1) =3 or 6+ 3k =3. 


So, k=-1, B=A+kn= A-1n = (1,2,3)-1(1,1,1) = (0,1,2), and the distance between 
A and q is 





|ABl = [en] = || fn] = [-1]|(1, 1,1) = V3. ts 
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Example 980. Let A= (-1,0,1) bea point, q be the plane described by r-n = r-(0,2,5) = 
1, and B be the foot of the perpendicular from A to q. 


Find B and |AB| (the distance between A and q). 


For some k # 0, OB = OA+kn = (-1,0,1) + k(0, 2,5). 


A= (-1,0,1) 
? 


Not to scale. 


Since Beg, OB -(0,2,5) = 1 or [(-1,0, 1) + k(0,2,5)]-(0,2,5) = 1 or 5+ 29k=1. 


4 4 8 9 
k =-4/29, B= A+kn = A-—n = (-1,0,1) -—(0,2,5) = |-1,-—, — h 
So, k=-4/29, B= A+ hn = A-—n = (-1,0,1) -3(0,2,5) = (-1,-35,25), and the 
distance between A and q is 


— 4 4 4 4 
AB| = |kn| = |k S| Uy ee CU Se) ss 
FAB] = fn = el al = |-55]1(0.2.5)1 = 55 10.2.5) = 5g V 29 = 


Exercise 291. For each of the following, let B be the foot of the perpendicular from A 
to q. Use the Perpendicular Method to find B and the distance between A and q. 
The point A The plane q 
(a) (ae) r- (9,3, 7) = 109. 
(b) (oO. 2) re (2,7,2) =42, 
(c) (8, 5,9) r- (5,6,0) = 64. (Answer on p. 1877.) 





By mimicking the above examples, we can derive general formulae for B and AB}: 


Fact 178. Let q be a plane described byr-n=d, A be a point that isn’t on the plane q, 
and B be the foot of the perpendicular from A to q. 


Suppose k= ees 


In| 





Then (a) B=A+kn; and —_(b) [AB] = |x| |n]. 





Proof. See Exercise 292. 











Observe that if n = (a,b,c) and A = (z,y,z), then in the above result, we also have 


_ a= (Gas by +2) 
a+b?+e | 
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You may find this second formula for k easier to remember. 





Do not try mugging Fact 178. This is a waste of your brainpower. And disaster strikes if 
you're unable to remember the formulae. 


It’s far easier and better to understand the method used to derive Fact 178 and illustrated 
in the last two examples. By actually understanding what’s going on, you also easily 
remember how you can use the method. 


Nonetheless, as a quick illustration, let’s redo the last two examples using Fact 178: 


Example 981. Let A = (1, 2,3) bea point, q be the plane described by r-n = r-(1, 1,1) = 3, 
and B be the foot of the perpendicular from A to gq. 


Compute |n| = V1? + 12 + 1? = V3 and 
JOA ae Gosia) Bed ee 
a or Gene a os Sono 


So, by Fact 178, B = A+kn = (1,2,3)-1(1,1,1) = (0,1,2) and |AB| = [Al In| = 1- V3 = V3. 


Example 982. Let A= (-1,0,1) bea point, q be the plane described by r-n = r-(0,2,5) = 
1, and B be the foot of the perpendicular from A to gq. 


Compute |n| = V0? + 22 + 52 = /29 and 
,4-OA-n _1-(-1,0,1)-(0,2,5)_3-(0+0+5) 4 
* ie Ooo 29 i. 3 ae 
4 g 4 = 
So, by Fact 178, B = A+ hn = (-1,0,1)-3-(0,2,5) = (-1, =) and {AB| = [fl |n] = 


; ; 29° 29 

2/29 = —_—. 
29 V 29 

The next result is not one that students could reasonably have been expected to know. 

Which means, of course, that it made a sudden appearance in 2017 (Exercise 608). 





Corollary 35. Suppose a plane is described by r-n=d. Then the distance between the 
plane and the origin is 


ld 


fn 


Also, if n is a unit vector, then the distance between the plane and the origin is |d|. 








Proof. See Exercise 293. 
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Exercise 293. Prove Corollary 35. (Hint: Use Fact 178). (Answer on p. 1879.) 


We again revisit the same two examples: 


Example 983. Let qg be the plane described by r-n=r-(1,1,1) =3. Then by Corollary 
30, the distance between g and the origin is 


Le 
In| V3 ae 


Example 984. Let qg be the plane described by r-n=r-(0,2,5) = 1. Then by Corollary 


30, the distance between g and the origin is 


ld 


In| 4/29" 





Two new examples: 


Example 985. Let qg be the plane described by r-n=r-(0,5,1) =7, A=(-1,2,5) bea 
point, and B be the foot of the perpendicular from A to gq. 


For some k # 0, OB =OA+kn= (-1,2,5) +k(0,5,1). 
Since Be gq, OB -(0,5,1) =7 or [(-1,2,5) + k(0,5,1)]-(0,5,1) =7 or 15 + 26k =7. 


4 4 1 
So, k = -8/26 = -4/13, B = a = (=1,2,5) ~ 730052) = Tg (-18,6,61), and the 
distance between A and q is 


= 4 4/2 
AB| = |k||n| = —- V26 = ——. af 
FAB| = [kl hn| = 55 V26 = 


Again, you shouldn’t bother mugging Fact 178. Nonetheless, let’s use it to quickly check 
that our above results are correct: 


Compute |n| = V0? + 5? + 1? = V26 and 


,_d-OA-n _7-(-1,2,5)-(0,5,1) _7-15__ 4 
7 26 een 





4 i 
So, by Fact 178, B= A+kn = (-1, 2,5)-75(0,5, = ai (-13,6,61) and 


[AB] = |&| fn] = 4/2/18. 
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Example 986. Let q be the plane described by r-n=r-(1,2,3) = 32, A=(0,0,0) bea 
point, and B be the foot of the perpendicular from A to gq. 


For some k # 0, OB = OA+kn = (0,0,0) + &(1, 2,3). 
Since Be q, OB - (1, 2,3) = 32 or [(0,0,0) + &(1,2,3)]-(1,2,3) = 32 or 14k = 32. 
16 16 16 
So, k = 32/14=16/7, B= A+ zh = (0,0,0) + 7 (12,3) i (1,2,3). and the distance 


between A and q is 
—> 16 16/2 
AB| = |k| |n| = —-V14 = ——_. % 
JAB] = [kl | = = V14 = 
Again, you shouldn’t bother mugging Fact 178. Nonetheless, let’s use it to quickly check 
that our above results are correct: 


Compute |n| = V1? + 2? + 3? = V14 and 


,_d-OA-n _ 32-(0,0,0)-(1,2,3) _ 32-0 _ 16 
InP 14 ees a 


1 1 
So, by Fact 178, B= A+kn = (0,0,0) + = (1,2,3) = = (1,2,3) and 
|AB| = |A| In| = 16,/2/7. 


Exercise 294. Let S = (-1,0,7) and T = (3,2,1) be points and q be the plane described 
by r- (5, -3, 1) = 0. Use both methods you’ve learnt in this chapter to find (a) the feet of 
the perpendiculars from S and T to q. Then find (b) the distances from q to the points 
S and T; and also (c) the distance between gq and the origin. (Answer on p. 1879.) 





769, Contents www.EconsPhDTutor.com 


78. Coplanarity 


Definition 185. Two or more points are coplanar if some plane contains all of them. 


“Obviously”, given any line, there is a plane that contains this line. 
Recall (Fact 144): Any two points are collinear. So “obviously”, they must also be coplanar. 


Recall (Ch. 74): Three non-collinear points uniquely determine a plane. So, any three 
points must also be coplanar. 


In contrast, four points need not be coplanar. Given four distinct points, we’ll use these 
steps to check if they’re coplanar: 


1. Write down the plane that contains three of the points. 


2. Then check whether this plane also contains the fourth point. 


Example 987. Let A = (1,0,0), B = (0,1,0), C = (0,0,1), and D = (1,1,-1) be points. 
To check if they are coplanar, we’ll write down the (unique) plane q that contains A, B, 
and C’. We’ll then check if D € q. 


As we learnt in Ch. 74, given three non-collinear points, there are several ways to find q. 


Here we’ll first compute q’s normal vector AB x AC = (Cindy at 


We then also compute OA.- (1,1,1) =1+0+0=1. We conclude that q is r-(1,1,1) =1. 
We now check whether D € g: 2 


Oi = (0G Sibyes Geek = eels ae eenkeat y 
So yes, the four points are coplanar. 


Example 988. Let A = (2,3,5), B = (8,-1,0), C = (0,1,0), and D = (-3,-2,-1) be 
points. To check if they are coplanar, we’ll write down the plane q that contains A, B, 
and C’. We’ll then check if D € q. 


We can describe g in parametric form as 


ae eS ae 6A - 8u 
r=O0C+AAB+uBC = =| 1-442 JO,ueR). 
0 —5A 


Now check if Dé q by plugging D into the the above parametric equation: 


6A - 8 ~3 £6\-81, 
gece ry =) S14 is, 
=) =) 5 


From Z = 0.2, So, from é. yu = 21/40. These values of A and yz don’t satisfy Z. Hence, 
D¢q and the four points are not coplanar. 
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Here’s an occasionally useful shortcut: 


Fact 179. If three of four points are collinear, then the four points are coplanar. 


Proof. Given four points A, B, C’, and D, suppose A, B, and C are collinear. Some plane 
q contains A, B, and D. By Fact 158, ¢ contains the line AB and hence also the point C. 
Thus, g contains all four points. 














Example 989. Let A = (1,2,3), B = (4,5,6), C = (10, 11,12), and D = (9, 1,7) be points. 
The line AB is r = (1,2,3) + A(3,3,3) (A€ R). By setting \ = 3, we see that AB also 


contains the point C’. So, A, B, and C are collinear. 


Hence, by Fact 179, A, B, C, and D are coplanar. (Indeed, given any point E, it is 
similarly true that the four points A, B, C, and E must be coplanar.) 





Note though that the converse of Fact 179 is false (see Exercise 295). 


Exercise 295. In Example 987, we already showed that the points A = (1,0,0), B = 
(0,1,0), C = (0,0,1), and D = (1,1,-1) are coplanar. Now show that no three of these 
four points are collinear. (You will thus have produced a counterexample to the converse 
of Fact 179.) (Answer on p. 1880.) 


Exercise 296. In each of the following, determine if the four points given are coplanar. 


If they are, write down the plane that contains all four points. (Answer on p. 1880.) 
(aye A= (0,155), Be=C3.-11), @C=](@,7,5), and D = (6,6, 1). 

(b) A=(-1,3,-5), B=(0,0,0), C=(6,l-2).. and D=(4 7-12). 

(ce) ASCO.) (1h. 8), C =(2.3.4), and D = (19,0,-5). 
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78.1. Coplanarity of Lines 


Definition 186. Two or more lines are coplanar if some plane contains all of them. 








Example 990. The lines /; and /) are coplanar because the plane gq, contains both of 
them. Similarly, /) and /3 are coplanar because the plane g2 contains both of them. 


In contrast, /; and /3 are not coplanar because no plane contains both of them. 





Somewhat obviously and trivially, 


Fact 180. If two lines are identical, then they are also parallel and coplanar. 








Proof. Let the two (identical) lines be described by r = OP +)\u (A €R). 
They have parallel direction vectors. And so by Definition 150, they are parallel. 


Let v be any vector that points in a different direction from u. Then the plane r = 














OP +\u+ uv (A, 4 € R) contains both lines (to verify this, simply let p= 0). 
In the less trivial case of two distinct lines, there are three possibilities: 


Fact 181. Suppose l, and ly are distinct lines described by R= P+au and R=Q+ Bv 
(a,G¢€R). Then exactly one of the following three possibilities holds: The two lines are 


(a) Parallel and do not intersect; moreover, the unique plane that contains both lines is 
Gy = {R: R= P+du+pPQ Q.neR)}, or 


(b) Not parallel and share exactly one intersection point; moreover, the unique plane that 
contains both lines is described by m={R:R=P+dutpv (A,uweR)}; or 


(c) Skew (i.e. neither parallel nor intersect) and are not coplanar. 








Proof. See p. 1643 (Appendices). 
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Example 991. We illustrate Fact 181 using 
the last example: 


(a) The lines /) and /3 are parallel, copla- 
nar, and do not intersect. 

(b) The lines J; and /) are non-parallel, 
coplanar, and share exactly one inter- 
section point. 

(c) The lines J; and /3 are skew (i.e. aren’t 
parallel & don’t intersect) and not 
coplanar. 





It’s probably easier to remember this Corollary, which is immediate from the last result: 








Corollary 36. Two lines are coplanar if and only if they are not skew. 
And now, remember that when two distinct lines are not skew (either parallel or intersect), 
there are only two possible Cases: 


1. If they’re parallel, they don’t intersect; and 


2. If they’re not parallel, they intersect exactly once. 
And in either Case, Fact 181 says there’s a unique plane that contains both lines. This 
plane can be easily found using any of the Four Ways (Ch. 74): 

Example 992. ‘Two lines are described by 


u Vv 


ee ees ——— 
r=(8,1,1)+0(3,6,9) and r=(4,5,6)+6(1,2(a),8 eR). 


Since u || v, the two lines are parallel. Since (8,1,1) - (4,5,6) = 
(4,-4,-5) + u, the two lines are distinct and don’t intersect. 


Compute v - u = (-4,4,5) = w. Note that v + w. So, the unique plane 
that contains (8,1,1), v, and w (and hence also the unique plane that 
contains both lines) is 


Vv Ww 
—— = 


r= (8,1,1)+A(1, 2,3) + u(-4, 4,5) (A, we R). 
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Example 993. Two lines are described by 


u Vv 
_—— es = 


r=(0,0,0)+0(0,1,0) and r=(4,17,0)+8(1,0,@,8€R). 


Since u + v, the two lines are not parallel. To check if they intersect, 
write 


0 0 4 1 
Oo 1+] 1 ]=] 17 1+ 0 
0 0 0 0 


Solving, @ = 17 and B = -4. So, the two lines intersect at 
(0,0,0) +17 (0,1,0) = (4, 17,0) —-404,0,0) = (0, 17,0). 


Hence, the two lines aren’t skew. The unique plane that contains both lines is also the 
unique plane that contains (0,0,0), u, and v. And this plane is 


r = (0,0,0) + Au + pv, (A, u¢€R). 


Example 994. ‘Two lines are described by 


u Vv 
a ee 


— 
r = (0,1,2) + a(9, 1,3) and r = (4,5,6) + B(3, 2(d),8 € R). 


Since u + v, the two lines are not parallel. To check if they intersect, 
write 


94 244+38, 


+B 14425426, 
2+3a 26+ 8. 


Take + minus 3x 2 to get -6 = -14, a contradiction. So, the two lines do not intersect. 
So, they’re skew and not coplanar. 
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Part IV. 
Complex Numbers 
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What I cannot create, I do not understand. 


Know how to solve every problem that has been solved. 


— Richard Feynman’s blackboard at the time of his death (1988). 
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79. Complex Numbers: Introduction 


the historical sequence of extensions of the number system, from natural numbers to 
integers to rationals to reals to complex numbers, can with hindsight be interpreted 
as a quest to make more and more equations have solutions. 


— lan Stewart (2015). 


Here’s a brief and simplistic motivation of complex numbers: 
1. Solve x-1=0. Easy; the solution is a natural number: x = 1. 
2. To solve x +1=0, we must invent negative numbers. The solution is x = -1. 


3. To solve x? = 2, we must invent irrational numbers. The solution is x = +V2. 


4. To solve x? = -1, we must invent the imaginary unit: 


Definition 187. The imaginary unit, denoted i, is the number that satisfies i? = -1. 





Or equivalently: i= V-1. And so, the solution to 2? = -1 is x = 4i=4+V-1. 


In this textbook, we'll happily (and naively) assume that the “usual” rules of arithmetic 
also apply to the complex numbers. 


Any real-number multiple of the imaginary unit is called an imaginary number: 


Definition 188. An imaginary number is any ib, where be R. 


Example 995. Imaginary numbers: 21 = 2/V-1, -i = -V-1, 5.21 = 5.2V-1, —99i = 
—99V/-1. 


Example 996. The number i is both the imaginary unit and an imaginary number. 


Example 997. Perhaps surprisingly, 0 is also an imaginary number! This is because we 
can write 


0 = (i. 


And so, by Definition 188, 0 is imaginary. 


Indeed, 0 is both real and imaginary. 


This may seem weird, but you can treat it as just another arbitrary convention (that turns 
out to be sometimes more convenient than the alternative convention of not treating 0 
as an imaginary number). 





We can now define a complex number as the sum of any real and any imaginary number: 





3©0Tn this textbook, we’ve been neither clear nor explicit about what these rules are. We have simply 
assumed that everyone, including you the student, “knows” what they are. 
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Definition 189. A complex number is any a+ib, where a,béR. 


The set of all complex numbers is C = {a+ib: a,b € R}. 


Example 998. All of these are complex numbers: 0, 1, -5, 4/3, -2.881, V2, 7, i, 2i, 
2 my 3, 5 


Indeed, all numbers we’ll encounter in A-Level Maths (and this textbook) are complex 
numbers.°°! 


The last three of the above numbers (3 + 2i, -5 - i, -2 + 0.5i) are complex numbers that 
are neither real nor imaginary.*©? Such numbers are of the form a+ ib, where a,b + 0. 


Definition 190. A impure number is any complex number that is neither real nor imag- 
inary, i.e. any a+ib, where a,b¢€ R and a,b #0. 


Example 999. Impure numbers: 3+ 21, -5—i, -2 + 0.5i. 





Earlier Ch. 4.10 gave a Venn diagram of the real numbers. We now expand that diagram 
to produce this Venn diagram of the complex numbers: 


R = The set of all real numbers 


Irrationals 
852.99 


Rational 
non-integers 


Imaginary 


5.21 





Rationals Q 


C = The set of all complex numbers 


With one very special exception, every complex number is either real, imaginary, or 
impure (and not any two of these). The one very special exception is the number 0, which 
is both real and imaginary. 


We have C2 R2Q22Z. 





3©2Reminder: Infinity is not a number. In particular, infinity is not a complex number. 
Examples of numbers that aren’t complex: Quaternions, octonions, and hypercomplex numbers. ts 
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Example 1000. Each of the numbers i = /-1, -i = -V-1, 5i, 1 + 5i, ~/2i, 14/21 is 
either imaginary or impure. None is real, rational, irrational, positive, negative, or an 
integer. 


—4 — 2) 
Complex J J 








Imaginary 





Impure J 
Real 








An integer 





Positive 





Negative 





Rational 



































Irrational 




















®3Recall (Ch. 4.9) that we defined Q = {x ¢ R: x =a/b for some a,b¢ Z} and {Irrationals} = R \ Q. 
364 See SB (especially ajotaxte’s answer). 
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Exercise. Fill in the table. 


=e 


(Answer on p. 1882.) 


/5+0i | —200i 





Complex 





Imaginary 





Impure 





Real 





An integer 





Positive 





Negative 





Rational 




















Irrational 


Exercise 298. Explain if each statement is true or false: 


(a) “Zu {Rational non-integers} = Q”. 

(b) “Qn {Irrationals} = @.” 

(c) “OeC\R” 

(d) “{Imaginary numbers} n {Impure numbers} = 2.” 
(e) “Zc {Imaginary numbers}.” 

(f) “Zn {Imaginary numbers} = 2.” 


(g) “({Imaginary numbers} u {Impure numbers}) 1 R = @.” 
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(Answer on p. 1882.) 
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79.1. The Real and Imaginary Parts of Complex Numbers 


Definition 191. Given a complex number z = a+ ib, its real part is Rez = a and its 
imaginary part is Im z = 6. 


Example 1003. Let z=3+2i. Then Rez = Re (3 + 2i) = 3 and Imz = Im (8+ 2i) =2. 


Example 1004. Let w=7. Then Rew = Re7=7 and Imw = Im7=0. 


Example 1005. Let w = 19i. Then Rew = Re (19i) = 0 and Imw = Im (19i) = 19. 


“Obviously”, two complex numbers z and w are equal if and only if 





Rez = Rew and Im z = Imw. 


Example 1006. Let z =3+ib and w=a-17i. If z=w. then 


Pa, 


Exercise 299. Exactly two of a, b, c, and d are identical. Which? (Answer on p. 1882.) 


ee ampere 
Se esha 


3 
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79.2. Complex Numbers in Ordered Pair Notation 


It is also often convenient to write complex numbers in ordered pair notation, with the 
first term being the real part and the second term being the imaginary. 


Example 1007. z = 3+ 2i= (3,2). 


Example 1008. w =7 = (7,0). 


Example 1009. w = 19i = (0,19). 
In general, given a complex number z, we can also write 


z=Rez+ilmz=(Rez,Imz). 


Exercise 300. Rewrite each number in ordered pair notation. (Answer on p. 1882.) 





(a) z=33(1+ei) (b) w = (237 +m) -(V2-3)i (c) Rew=p, Imw=q. 
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79.3. The Usual Ordering Does Not Apply to Non-Real Numbers 


The “usual ordering” refers to <, <, >, or >. 


In this textbook, we haven’t precisely defined what exactly any of these four symbols means. 
We’ve simply relied on your informal and intuitive understanding of what each means: 


Example 1010. 5>4,3<8, 2>-6,1>0. 


The title of this brief subchapter says this—if a or 6 is not real, then the following statements 
are simply left undefined or meaningless: 


O<b. G20..0> 0; a2 0. 


Example 1011. These statements are left undefined or meaningless: 


3+51>4,i<5i, 0>-i, i>-1-i. 





That’s all you need know for the A-Levels. For a deeper discussion, see e.g. &%. 
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80. Some Arithmetic of Complex Numbers 


To repeat, we’ll happily (and naively) assume that the “usual” rules of arithmetic also apply 
to the complex numbers. In which case, addition and subtraction are especially simple: 


80.1. Addition and Subtraction 


Example 1012. Let z = -2 += (-2,1)i and w = 3i = (0,3). Then 


Z+w=-24+4i and Z-w=-2-2i. 


Or, z+w=(-2+0,1+3) = (-2,4) and z-w =(-2-0,1-3) = (-2,-2). 


Example 1013. Let z=7-i=(7,-1) and w =2+5i= (2,5). Then 
z+w=9+4i and Zi = 9 — O61, 


Or, z+w=(7+2,-1+5) =(9,4) and z-w=(7-2,-1-5) =(5,-6). 





In general, 


Fact 182. Suppose z=a+ib=(a,b) andw=c+tid=(c,d). Then 


(a) z+w=atcti(b+d) =(at+c,b+d); and 
(b) z-w=a-c+i(b-d) =(a-c,b-d). 


Exercise 301. For each, compute z+w and z-w. (Answer on p. 1883.) 





(a) z=-542i,w=7431. (b) 2=3=-i,w=1142i. (c) 2=142i, w=3=-V2i. 
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80.2. Multiplication 


Here are the powers of i: 


21, i? =ixi=-l, i =ix i? =-i, i =ixiP=1, 
iP =ixi* =i, if =ixi=-l, if =ixi? = -i, b=ixiP=1, 
9 «8 10 11 2 “12 3 


etc. 


Observe that it = 1. And so, the cycle repeats after every fourth power. 


The “usual” rules of multiplication hold: 


Example 1014. Let z=i and w=1+i. Then zw =i(1+i) =i(1) +i? =i-1. 


Example 1015. Let z= -2+i and w =3i. Then 
zw = (-2 + i) (3i) = (-2) (3i) +i (3i) = -6i + 3i? = -3 - 6i. G 


Google does the basic arithmetic of complex numbers as well as Wolfram Alpha, but 
much more quickly. So here in Part IV, whenever you see the G logo, click on it to go to 
the relevant computation done by Google. 


Example 1016. Let z=2-i and w=-1+i. Then 
= (2 (a ee 
Example 1017. Let z=3+2i and w =-7+4i. Then 


zw = (3+ 2i) (-7 + 4i) = -21 + 121i - 141 + 812 = -29 - 21. 





In general, 


Fact 183. If z=a+ib=(a,b) and w=c+id= (c,d), then 





zw =ac— bd +i(ad + bc) = (ac— bd, ad + bc). 





Proof. See Exercise 303. 
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Recall that (x+y)? = 2? + 2ey + y? and (x+y)° = 2° + 322y + 32y? + y?. Hence, 


(a+ib)° = a? + 2a (ib) + (ib)? = a? + 2iad — B?, 
(a +ib)® = a® + 3a? (ib) + 3a (ib)? + (ib)® = a? + 3ia2b — 3ab? - ib*. 


Let’s jot these down formally: 


Fact 184. (a) (a+ib)? = a2 + 2iab - 0. 
(b) (a+ ib)® = a3 + 3ia2b - 3ab? - ib?. 


Example 1018. Let z = 3+2i. To compute z?, we can use Fact 184(a): 
PaO P) 33 298 => 20 (a4. 
Instead of using Fact 184(a), we could do the usual multiplication: 
2° = (3+ 2i) (3 + 2i) =9+ 6i+ 61-4 =5 412i. 
And to compute z°, we can use Fact 184(b): 
2? = (34+2i)° = 33+ 3-32. 2i- 3-3-2? - 23] = 27 + 54i — 36 - 81 = -9 + 461. 

Again, instead of using Fact 184(b), we could do the usual multiplication: 

23 = 27z = (5+ 12i) (3 + 2i) = 15 + 10i + 36i — 24 = -9 + 46i. 


Exercise 302. For each, compute zw, 27, and z?. (Answer on p. 1883.) 
(a) 2=-5429i,w=743i1. (b) 2=3=i,w=1142i. (c) 2=142i, w=3-V2i. 


Exercise 303. Prove Fact 183. (Answer on p. 1884.) 


Exercise 304. If 2+i solves az® + bz? + 3z-1=0, then what are a and b?(Answer on p. 
1884.) 
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80.3. Conjugation 


Example 1019. Let z=1+i. Then the (complex) conjugate of z is z* = 1-i. 


We call z =1+i and z* =1-ia (complex) conjugate pair. 


Definition 192. Given the complex number z = a+ib, its (complex) conjugate is z* = a-ib. 


Also, z=a+ib and z* =a-ib are called a (complex) conjugate pair. 


Example 1020. The conjugate of w = -5 - (17+ v2); is w’ =-5+ (17+ v2) i; and w 
and w” are a conjugate pair. 


Example 1021. The conjugate of w = 10 is w* = 10; and w and w” are a conjugate pair. 





Example 1022. The conjugate of a = 2i is a* = -2i; and a and a* are a conjugate pair. 


“Obviously”, the conjugate of the conjugate of a complex number z is z itself: 
Example 1023. Let z=1+i. Then z* =1-i and (z*)" =1+i=z. 
Example 1024. Let w =-5-17i. Then w* = -5+17i and (w*)" =-5-17i=w. 


Example 1025. Let w= 10. Then w* = 10 and (w*)* = 10=w. 





Example 1026. Let a =2i. Then a* = -2i and (a*)" = 2i=a. 
Recall?’ that a+b and a—b were called a conjugate pair because 
(a+b)(a—b) 202-07. 


Recall also that when a denominator contained a surd, we could often use + to rationalise 
(“make rational”) the denominator: 


ee a eo) 
(eae t=/5 eS 


Here we can play a similar trick. Observe that if z = a+ib, then 


Example 1027. 





zz* = (a+ ib) (a— ib) 2 a? - (ib)” = a? - 20? 2 a? +B. 


2 ‘ 
We can often use = to realise (“make real”) a denominator that contains a complex number: 





SoC hi. See 
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Example 1028. Let z = 1+i. Consider the reciprocal of z: 


1 
ees 


1 
g 


In general, it is easier to deal with “simpler” denominators. We might thus like to rid 
the above denominator of any complex numbers. 


To do so, simply multiply by z*/z* = 1: 


* 
z 


(aN 
1 el eed oi ee et es 


Poe ey ee. 





Fact 186. If z=a+ib= (a,b), then 





(a) 2s =a +1) = (a 40): 








Proof. See Exercise 306. 











A few more examples: 
Example 1029. Let z = -3 + 5i = (-3,5). Then z* = -3 - 5i = (-3,-5) and 


gee See 


i 


z 3452 34 34 34 34 
Example 1030. Let w =1-i=(1,-1). Then w* =1+i=(1,1) and 


ia bel ee ae | 
uw (oe) 2 Te5 = S 





ee | 
i=—(1,1). 


Example 1031. Let w=1+i=(1,1). Then w* =1-i=(1,-1) and 


Exercise 305. In each, express z’s conjugate and reciprocal in the form a + ib. 


(a) z=-5 + 2i. (by 2-31. (c) z=14+2i. (Answer on p. 1884.) 


Exercise 306. Prove Fact 186. (Answer on p. 1884.) 
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80.4. Division 


We can now divide one complex number by another: 


Fact 187. If z=a+ib=(a,b) and w=c+id=(c,d) with w #0, then 


1 
7 = Ba gn (ae + bd, be - ad). 
C 





eS 2a PAT 1 
Proof. aa = ae = eae = 2a pe (ac + bd, be ad). 














Example 1032. Let z= -2+i and w =3i. Then 


2 _ (-2+i)(-3i) _3+6i 1 2, 
w 383i 02 +32 9 Neo wos 


—i. 
3 
Example 1033. Let z=3+iand w=1-i. Then 


cape en eee ee ee 
i Sela Clee 2 2 
Example 1034. Let z=1+iand w=3-2i. Then 


ele Zoe _ (+i +2i) _ 1+5i_ 1 5, 


320) | ae 13 osm 





Example 1035. Let z=2-iand w=-1+i. Then 


4 zw* — (2-i)(-1-i) -3-i_ 3 i 
. Be oe 


ve 
i ee (ee D. 9 


Example 1036. Let z=3+2i and w =-7+4i. Then 


2 342) 2u*> (84 2i)(-7-4) 7-13-2619 1 2. 
iy 27 eA eee 9 65 ie 65 oe 





Example 1037. Let z =-3+6i and w=2+i7. Then 


2d Ol eee (3 Ol) 2) Oe int ona M+4, 


= — = = = to ae 
w Q+in 22477 44+ 7 4+ 7 44+ 7 4+ 7 


Exercise 307. Express z/w in the form a + ib. (Answer on p. 1884.) 
(a)z=14+3i, w=-i. (b)z=2-3i, w=lti (ce) z=V2-mi, w=3-V2i. 
(d)Z=1142i, wei. (e) z =-3, m=24i. (d)ege732), webti, 
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81. Solving Polynomial Equations 


Recall (Ch. 14) that if the quadratic equation ax? +bxr+c = 0 has non-negative discriminant 
(ic. b? —4dac > 0), then it has two real roots, which are given by 


1 —b+ Vb? - 4ac 
7 2a 





x 


Now that we’ve learnt a little about complex numbers, we can more simply say that re- 
gardless of the sign of the discriminant: 


Fact 188. Every quadratic equation has two complex roots given by 2, 


Proof. See Theorem 20 below (the Fundamental Theorem of Algebra). 














Example 1038. Consider the quadratic equation x? — 2x + 2 = 0. 
Its discriminant is negative: b? — dac = (-2)?-4(1) (2) =-4<0. 


Nonetheless, like every quadratic equation, it has two complex roots: 


b+ VP -dac _2+V-4_ | V4xv-l 2i 


=l41—=14i. 
2 





2a 2 2 


In this case, both roots are non-real. 


Example 1039. The quadratic equation x?-3x+2 = 0 has positive discriminant: b?—4ac = 
(-3)?-4(1)(2) =1>0. Thus, both of its complex roots are real: 


b+ VP -dac 34V1_ 


i 2. 
2a 2 


Example 1040. The quadratic equation x? — 27 +1 = 0 has discriminant zero: b? — 4ac = 
(-2)?-4(1) (1) =0. As usual, its two roots are 


—b+Vb?-dac 2+V0_ 
2a a ap ad 


I 





Hmm ... this time there’s only one root, namely 1. Doesn’t this contradict Fact 188? 
Well, here we’ll cheat a little, by calling 1 a repeated or double root of the quadratic 
equation 27 -2xr+1=0. You can think of this as a sort of cheap accounting trick to 
ensure that Fact 188 (and later on also Theorem 20) are “correct”. 


Exercise 308. Solve each equation. (Answer on p. 1885.) 


(a) 2? +r+1=0. (b) 2?+2xr+2=0. (c) 327+3r+1=0. 








3°6This is a somewhat simplistic explanation. Repeated or multiple roots actually have greater significance 
than merely ensuring Fact 188 or Theorem 20 holds. 
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81.1. The Fundamental Theorem of Algebra 


By Fact 188, every quadratic equation has two (possibly repeated) complex roots. It turns 
out this is more generally true: Every nth-degree polynomial equation in one variable has 
n (possibly repeated) roots. This is the Fundamental Theorem of Algebra (FTA): 


Theorem 20. (The Fundamental Theorem of Algebra) Suppose ay #0. Then the 
following equation has n (possibly repeated) roots: 


il 2 


ane” + a,x" + Ane” ++++ + An-1L + An = 0. 








Proof. Omitted.>° 











Example 1041. The 2nd-degree polynomial (or quadratic) equation x? — 1 = 0 has two 
roots, namely 1 and -1. 


Example 1042. The 2nd-degree polynomial (or quadratic) equation x? + 1 = 0 has two 
roots, namely i and —2. 


Example 1043. By the FTA, the 3rd-degree polynomial equation (or cubic equation) 
x? —8=0 has three roots. Let’s find them using what we learnt in Ch. 38. 


Observe that 2? - 8 = 0. So one root is 2 and x —2 is a factor of x? —8. 


To find the other two factors, write 
xv? —8 = (x-2) (ax? + bar +c) =axr* + (b- 2a) 27+?a — 2c, 
Comparing coefficients, we have a = 1, b= 2, andc=4. (Note that ? stands for a coefficient 


we don’t bother to calculate because it isn’t necessary. We have three unknowns a, 8, 
and c; and so, it is only necessary to compute three of these coefficients.) Thus, 


P= S=(¢—2)\(e 42044), 


We can then further factorise x? + 27 + 4 using the usual quadratic formula: 


Py —b + Vb? —4ac © —2+,/2?-4(1) (4) - 14 JV3i. 


2a Zo 


Altogether then, the three roots of the 3rd-degree polynomial equation x° — 8 = 0 are 
2, =] 3i, Sy i ss 
And here is the cubic polynomial x? — 8 factorised into its three linear factors: 


x3 -8 = (2-2)(a+1- V3i)(e+1+ vi). 


Exercise 309. Verify that -1 + V3i solve x - 8 = 0. (Answer on p. 1885.) 








367See e.g. Schilling, Lankham, & Nachtergaele (2016, Ch. 3). 
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As noted earlier, there may sometimes be repeated or multiple roots: 


Example 1044. The 2nd-degree polynomial (or quadratic) equation 


x? -22+1=(r-1)? =0 


has two repeated or multiple roots, namely 1 and 1. 


Example 1045. The 3rd-degree polynomial (or cubic) equation 
xv - 62? + 12x -8 = (x -2)° =0 


has three repeated or multiple roots, namely 2, 2, and 2. 





The FTA can be useful even if we have no idea how to solve an equation. 


Example 1046. We may have no idea how to solve the 17th-degree polynomial equation 
a +3x4-2r+1=0. 


Nonetheless, the FTA gives us a useful piece of information, namely that this equation 
must have 17 roots or solutions (though some may possibly be repeated). % 


Exercise 310. Solve x° + 64 = 0. (Answer on p. 1885.) 


Exercise 311. You’re told that 1 solves both of the equations given below. Find the 
other roots of each equation. (Answer on p. 1885.) 


(a) 2? +2?-2=0. (b) xt-a2?-27+2=0. 
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81.2. The Complex Conjugate Root Theorem 


Example 1047. The equation x? — 2x +2 =0 has roots 1+i and 1-i. 


1 
Example 1048. The equation 7z? + x + 1 = 0 has roots aA 





The above examples suggest that if z = p+ig solves the quadratic equation ax? + br +c = 0, 
then so too does its conjugate z* = c—id. It turns out that this is generally true of any 
polynomial equation, provided the coefficients are real: 


Theorem 21. (Complex Conjugate Root Theorem.) Suppose co,ci,...,¢n € R. If 
z=a+tib solves Cyt” + Cp_10" | + Cy_ou"* +--+ +010 + Cy = 0, then so does z* = a-ib. 








Proof. See p. 1646 (Appendices). 











Example 1049. If z = 2-i solves x? -2?-7r+15 = 0, then by Theorem 21, the conjugate 
z* =2+/i also solves the same equation. % 


Example 1050. If both i and 0.5i solve 474 + 52?+1 = 0, then by Theorem 21, the 
conjugates —i and —0.5i also solve the same equation. % 


Example 1051. If -1+2i is a root of x? —- 3x? —5a-25 = 0, then what are the other two? 
Well, by Theorem 21, we know that —1- 2i must be another root. 
So, both e-(-1+ 2i) = 7+1-2i and x-(-1-2i) = 4+1+2i are factors of x? -32?-5x2-25. 


Compute (4-12 @ 14 2 Se 1) — Oi) =e" oF 5, 
Now write, 2° — 327-5 -25 = (x +25 + 5) (de =a 40 a op: 


Comparing coefficients, we have a = 1 and b=—-5. Thus, 


e382 br —2h= (2° 4 224 6) (25) = (24 1-21) (@ el + 21) @—4). 


Altogether then, the three roots of the cubic equation x? - 32? - 5a - 25 = 0 are 
—14 2i, —1 - 2i, Bi 


Example 1052. Let p,q¢R. If 3+ 2i solves x7 + px + q = 0, then what are p and q? 
Well, by Theorem 21, 3 - 2i also solves this equation. ‘Thus, 


peg =| Soe 2 ile = (3 — l= es) =) =e = eI, 


Comparing coefficients, we have p = -6 and q = 13. 
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Example 1053. We’re given that i solves «* + 2? - 5x? + 2-—6-=0. What are the other 
three roots? 


Well, by Theorem 21, we know that -i must be another root. 


So, both z-iand x +i are factors of c++ 2° -5x27+2-6. 


Compute (x-i)(2+i)=2?-P=074+1. 


Now, vi +23 -52?+2-6=(27+1) (ar? + br +c) =ax' + bar +2274? +c. 


Comparing coefficients, we have a =1, b=1, and c=-6. Thus, 
G2 2b; 4a—-0=> (2°41) (2° +2 =6)., 
By the quadratic formula or otherwise, we have 
a? +x2-6=(2+3)(x-2). 
Altoeetier phen ee a =o 4 == 0 iio) a 2 


4 


And the four roots of the quartic equation x* + 2° — 5x2? +2-6=0 are 





By the way, the condition that all coefficients co, c,...,¢p in the polynomial equation are 
real is important. If this condition is violated, then the Theorem’s conclusion may not hold: 


Example 1054. We are told that —2+i solves x? - (5 + 4i) x + (-17 +i) =0. 


Observe that not all of the coefficients in = are real. And so, Theorem 21’s conclusion 
may not hold. 


And indeed, it does not. As you should verify yourself, the conjugate —2-i does not solve 
Z. Instead, the other solution to 2 is 7+3i. 5 


Exercise 312. You’re told that 2-3i solves both of the equations below. Find the other 
roots of each equation. (Answer on p. 1886.) 


(la 6 le 17 0, (b) -2a* + 212° - 93a? + 229% - 195 = 0. 


Exercise 313. Let p,q¢R. If 1-i solves 2? +px+q=0, then what are p and qg?(Answer 
on p. 1886.) 
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82. The Argand Diagram 


In Ch. 7.2 (and also earlier in secondary school), we learnt that ordered pairs of real numbers 
can be depicted geometrically as points on the cartesian plane, where the horizontal or 
x-axis corresponds to the first coordinate of an ordered pair, while the vertical or y-axis 
corresponds to the second. 


We just learnt that the complex number z = a+ib may also be written as an ordered pair: 
2 = (0,0): 


And so, we shall also depict complex numbers geometrically as points on the the complex 
plane or Argand diagram. The horizontal or x-axis is now our real axis, while the 
vertical axis is now our imaginary axis. 


Example 1055. The Argand diagram below depicts seven complex numbers. 
The real numbers -3 = (-3,0) and 4 


2 = (2,0) are on the horizontal axis. 3 a (-3,2) 4 § 21 = (0,2) 


The imaginary numbers 2i = (0,2) and 
—3i = (0,-3) are on the vertical axis. 6 lei=(i41) 

The number 0 = (0,0) is both real and 0 = (0,0) . 
imaginary. And sure enough, it is on 
both the horizontal and vertical axes. -3 = (-3,0) 2 = (2,0) 


@ @ @ > 





The impure imaginary numbers —3 + 
2i = (-3,2), 1+i= (1,1), and 1-3i= 
(1,-3) are not on either axis. —3i = (0, -3) 





eas (he.) 


Exercise 314. Draw 2, -1, 2i, 1+2i, -1-3i on an Argand diagram. (Answer on p. 1886.) 
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83. Complex Numbers in Polar Form 


So far, we’ve written complex numbers in cartesian (or standard or rectangular) form, 
i.e. as either 


z=atib or ¢= (a,b). 


In this chapter, we’ll learn to write down a complex number in polar form. (And in the 
next chapter, we’ll learn how to do so in exponential form.) 


To write down z =a+ib= (a,b) in cartesian form, we need two pieces of information: 
its real part Rez =a; and its imaginary part Im z = b. 
To write down z in polar form, we likewise need two pieces of information: 


its modulus, denoted |z]; and its argument, denoted arg z. 
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83.1. The Modulus 


We just learnt to depict a complex number z on an Argand diagram. Now consider z’s 
position vector (i.e. the vector from 0 to the point that is z). The modulus of z, denoted 
|z|, is simply the magnitude or length of its position vector: 


A z 






|z| = Length of 
position vector 
> 





x 





Definition 193. The modulus of the complex number z = a+ib is denoted |z| and defined 
as this number: 


z| a a2 62. 





Or equivalently, Jz| = \/ (Rez)? + (Imz)’. 


Example 1056. Let 


eae) 2 =3+42i = (3,2) 


w=-2i= (0, -2), \2| 2 Via 
we 2-9) = (-2,-2). 


ee eee ake 


jeu] = 1 0? + (-2)" = 2, 





= (2)? + (-2)? = 2v2. 





Exercise 315. Compute the moduli of these numbers: (Answer on p. 1887.) 


(on = Sei 
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83.2. The Argument: An Informal Introduction 


The argument of a complex number z is denoted arg z. 


Informally, it is the angle that z’s position vector makes with the positive x-axis, with some 
qualifications: 


e If z is on or above the z-axis (i.e. Imz>0), then arg z € [0,71]. 


¢ If z is below the z-axis (i.e. Imz <0), then arg z € (-71,0). 


Example 1057. Let z= 3+ 2i = (3,2), w = -2i = (0,-2), and w = -2 - 2i = (-2, -2). 


O7t 
Then arg z = tan! — 0.588, argw=7, and argw=-—. 


Notice argz and argw are measured anti- 
clockwise from the positive x-axis, while arg w 
is measured clockwise from the positive x- 
axis. We'll adopt these informal rules: 


¢ If z is on or above the z-axis (i.e. Im z > 0), 
then arg z € [0,71]. 

¢ If z is below the z-axis (i.e. Imz < 0), then 
arg z € (-71,0). 





Altogether, for any z # 0, we have argz ¢ 
(-7, 7]. (We'll leave arg 0 undefined.) 





Exercise 316. Find the arguments of 2, -1, 2i, 1+ 2i, and -1 - 3i.(Answer on p. 1887.) 


Exercise 317. Depict z = 2-i and w = -3+ 2i on an Argand diagram. What are moduli 
and arguments? (Answer on p. 1887.) 
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83.3. The Argument: Formally Defined 
A 
We now work towards a formal definition of the y 
argument. We’ll do so using what we learnt 
about vectors. 


Let z= (a,b) and w = (c,d) be non-zero com- 
plex numbers, where z is on or above the x-axis 
(i.e. b> 0), while w is below it (i.e. d< 0). 

Let z = (a,b) and w = (c,d) be the correspond- 
ing position vectors. Let i= (1,0) be the unit 
vector that points in the direction of the posi- 
tive x-axis. 











Let 6 be the angle between z and i; and pz be the 
angle between w and i. Following our informal ee 

w=ctid 
discussion on the previous page, we want 


arg w = —p 





aree2=¢ and arg w = —jL. 


Recall (Definition 145 in Ch. 145) that the angle between z and i is 
7 Ze 4 (a, b)+(1,0) e a+0 : a 7 
G=cos = tog?) og * te ite * 
[2| |i I(a, 6) ||, 0)| Var+b-1 Var+ 0 2 
Similarly, the angle between w and i is 


fad = cos’! ails = cos”! (c, d) * (1, 0) _ =i i 0 = Cc “ 


; = cos | ——————_ = cos *’ —————. = cos 
[w|i I(e, 4) ||, 0)| Ve+d-1 Ver + a jw 





a C 
Thus, arg z=0=cos~ — and arg w =—p=—cos | —. 


jw 


The above discussion motivates this formal definition of a complex number’s argument: 


Definition 194. Let z =a+ib #0. The argument of z is denoted arg z and is the number 
defined by®® 


Lino 


i Oe 0) 





R 
cos? a if Im z > 0, 
Zz 
Or equivalently, arg z= 
R 
~cos? a if Imz <0. 
Ps 





3°8We could also think of arg: C \ {0} > (-7, 71] as the function with the mapping rule 2, 
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As previously noted, arg 0 is undefined. 


Example 1058. Let z = 3+ 2i = (3,2), 
w =-2 = (-2,0), and w = -2 - 2i = (-2,-2). 
We have |z| = V13, |w| = 2, and |w| = 2V2. And 
so by Definition 194: a 








Note that there’s a negative sign before arccosine for w (because w is below the z-axis). 


In contrast, there isn’t one for either z or w (because they are on or above the z-axis). 





The next result is immediate from the Argand diagram and Definition 194: 
Fact 189. Let z be a non-zero complex number. Then 


ane , : : 7 
(a) z is imaginary <=> z is on the y-aris <> argz= £5: 
(b) z is a positive real number <> z is on the positive x-aris <> argz=0. 


(c) z is a negative real number <=> z is on the negative x-avis — > argz=T. 


Exercise 318. Find each number’s argument, but this time using Definition 194. (Check 
that your answers are the same as before.) (Answer on p. 1887.) 


12), 923). eeo=i. wes e0r 
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83.4. Complex Numbers in Polar Form 


Suppose z = a+ib = (a,b) is a complex number with 











a,b>0. Let @=argz and r =|z|. Then , y 
A a O bd 
@=—=-— d in@=—=-. z=atib=(a,b 
cosO = T= an sin =F = 7 (a,b) 
Rearranging,a = rcos@ and b=rsiné. : 
0b 
Thus, we may also write z in polar form: 
z=a+ib=rcos0+irsin@ =r (cosé + ising). > 
a x 





Let’s jot this down as a formal result: 


Fact 190. Suppose z is a non-zero complex number with |z|=r and argz=@. Then 





z=r(cos@+isin@). 


Proof. We already proved this result above, but only in the case where both a and b are 
positive. For a complete proof, see p. 1646 (Appendices). 














Example 1059. Let z = 5 - 2i = (5,-2). Then 
5 
r= |z| =\/52 + (-2) = V29 and =. =o. 
V 29 
So, z may also be written in polar form as 
z=r(cos@+isin@) » V29 (cos —0.381 + isin -0.381). 
Example 1060. Let z = 14+3i= (1,3). Then 


iL 
p=|el=¥ 12-32=4/10 and p= are27= cos rs] On op) 
V 10 


z=r(cos@+isin@) » V10 (cos0.322 + isin 0.322). 
Example 1061. Let z = -4+7i = (-4,7). Then 


—4 
r= |2)=\/(-4)+ 7 =V65 and @=arez = cos Je ~ 2.090. 


z=r(cos@+isin@) » 65 (cos 2.090 + isin 2.090). 


Exercise 319. Rewrite each complex number in polar form. (Hint: We already computed 
their moduli and arguments in earlier exercises.) (Answer on p. 1887.) 


10), P23 220i. ayes 
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84. Complex Numbers in Exponential Form 


In this chapter, we introduce Euler’s Formula, then use Euler’s Formula to write down 
complex numbers in exponential form. 


Theorem 22. (Euler’s Formula) Suppose 0¢R. Then e" = cos6 + isin 0. 


Proof. See p. 1647 (Appendices). 














Richard Feynman called the above “the most remarkable formula in mathematics”.* 


Plug 6 = 7t into Euler’s Formula to get 
e™ = cosm+isin7a =—-1+0-=-1. 


Rearrange to get Euler’s Identity: 


Corollary 37. (Euler’s Identity) e'™+1= 0. 


Euler’s Identity is one of the most extraordinary and beautiful equations in all of math- 
ematics. It links together five of the most fundamental mathematical constants: 


€, i, ot, 1, and 0. 


Fun Fact 


Leonhard Euler (1707-83) was a stud. There are so many mathematical results and 
objects named after him that there is even a Wikipedia entry just to list the things that 
have been named after him! 


This can sometimes result in confusion. For example, what we call Euler’s Formula is 
called Euler’s Identity by others and vice versa. 


As another example, Euler’s number e = 2.718... is different from Euler’s constant 
= Ona iia 


Even if we ignored his output before he turned blind at around 60, he’d still figure amongst 
the greatest mathematicians in history. 








3°9 The Feynman Lectures on Physics (1964, p. 22-10). 
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84.1. Complex Numbers in Exponential Form 
Let z be a non-zero complex number with r = |z| and 6 = arg z. Then by Fact 190, 
ze r(cos@+isin@). 
By Euler’s Formula, cos @ +isin@ 2? Now plug 2 into = to get z in exponential form: 
z=reo. 
Let’s jot this down as a formal result: 


Fact 191. Suppose z is a non-zero complex number with r = |z| and 0 = arg z. Then 


Example 1062. Let z = 5 - 2i = (5,-2). Then 


5 
r= |z| = 1/524 (-2)7 = /29 and 6 =argz=-cos? x —0.381. 


So, we can write z in exponential form as 
ye re? Ly /F9e~9-381i, 


Example 1063. Let z = 14+3i= (1,3). Then 





1 
: ~ 0.322. 


r=|z)=V12 +3? = V10 and @ =arg 2 = cos” 


ee es 
Example 1064. Let z = -4+7i = (-4,7). Then 


—4 
r= |z|=/(-4) +7 = V65 and f= are2 = cos Ve « 2.090. 


z= re? = /65 02-9905, 


Exercise 320. Rewrite each complex number in polar form. (Hint: We already computed 
their moduli and arguments in earlier exercises.) (Answer on p. 1887.) 


to; 2128 22044, S234 9% 
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Altogether, we’ve learnt to write complex numbers in three ways: 


Cartesian (or standard or rectangular) form 


z=atib=Rez+ilmz 


Polar form 


z=r (cos +isin@) = |z| (cosarg z + isinarg z) 


Exponential form 


iarg z 


z=re =|zle 
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84.2. Some Useful Formulae for Sine and Cosine 


Fact 192. Suppose 0¢€ R. Then 


id -i0 


eae 
2 


cos 6 = 





Proof. By Theorem 22 (Euler’s formula: e! = cos@ + isin@), 


e+e cos +isiné + cos (-0) + isin (-0) 


























2 2 
cos@+isin@+cos@-isin@ 2cosé 
= = = cos 6; 
2 
e@—e? cos +isin 6 — [cos (-0) + isin (-0)] 
py 2i 
cos?+isind-—cosé+isind 2isind . 
———  — —=sind. 
Zi 2i 
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85. More Arithmetic of Complex Numbers 


Now that we know how to write complex numbers in polar and exponential forms, their 
arithmetic becomes easier. Consider multiplication: 


Fact 193. Let z and w be non-zero complex numbers. Then 


(a) |zw] = |2| |w; and (b) arg (zw) =argz+argw + 2kn, 


if argz+argw > 1, 
where in (b), if argzt+argw€ (-7,71], 


if argz+argw < —-7. 








Proof. For (a), see Exercise 322. For (b), see p. 1648 (Appendices). 











The additional term 2k7t in Fact 193(b) is to ensure that arg (zw) € (-71, 71], as is required 
by the definition of the argument. A few examples will make this clear: 


Example 1065. Let 
z= 5-2i = (5,-2) » V29 (cos—-0.381 + i sin-0.381) = V/29e70384, 


w=1+3i= (1,3) ~ V10 (cos1.249 +i sin 1.249 ) = V10e!?%, 


By Fact 193, = (a) |zw| = V29V 10 = V 290; and 
(b) arg (zw) = argz+argw + 2k7 x -0.381 + 1.249 + Ot » 0.869. 


Now, how did I know to choose k = 0 here? Well, by definition, the argument of any 
complex number is in the interval (—71, 7]. And so, when applying Fact 193(b), we always 
simply choose k to ensure that arg (zw) € (-71,71]. Here we already have arg z + argw * 
—0.381 + 1.249 ~ 0.869 € (-71,71]. And so, we simply choose k = 0. 


With (a) and (b), we can write zw down in both polar and exponential forms: 


zw x 290 (cos 0.869 + isin 0.869) = V290e9%", 


To write zw down in cartesian form, we can use |zw| and arg (zw) to compute 
Re (zw) * V 290 cos 0.869 = 10.994; and Im (zw) » V290sin 0.869 = 13.005. 


Of course, since the real and imaginary parts of both z and w are all integers, so too 
must be the real and imaginary parts of zw. And so we have in fact Re (zw) = 11 and 
lm (20) = 13. Vhs; zw = + 131. 


Alternatively, we can do the usual multiplication, which yields us the exact value of zw: 


zw = (5-2i)(1+3i) =5 + 15i- 214+ 6 = 114131. % 
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Example 1066. Let 


z=-4+Ti = (-4,7) » V65 (cos 2.090 + isin2.090) = V65e2%, 
w= 1+9i = (1,9) » V82(cos 1.460 + isin 1.460) = V82e"4. 


By Fact 193, (a) |zw| = V65V82; and 


(b) arg(zw) = argz+argw + 2k7 » 2.090 + 1.460 - 27 x -2.733. 


Since arg z + arg w * 2.090 + 1.460 > 71, we choose k = -1. 


With (a) and (b), we can write zw down in both polar and exponential forms: 


zw » 65/82 (cos -2.733 + isin -2.733) = V65V82e 2-732, 


To write zw down in cartesian form, we can use |zw| and arg (zw) to compute 


Re (zw) * V65V 82 cos -2.733 » -67; and Im (zw) » V65V 82 sin —2.733 x —29. 


Thus, zw = -67 — 29i. 

Alternatively, we can do the usual multiplication, which yields us the exact value of zw: 
zw = (-4+ 7i) (1+ 9i) = -4 - 361 + 7i — 63 = -67 - 29. % 

Example 1067. Let 


e221 = (2 1) 5 (cos 2 678 Ficin-2 678) = 508 
w= 1-3i= (1,-3) * V10(cos—1.249 +isin—-1.249) = /10e 17". 


By Fact 193, (a) Jzw| = V5V10 = 5 V2; and 


(b) arg(zw) = argz+argw + 2k7 » -2.678 — 1.249 + 27 x 2.356. 


Since arg z + arg w ® —2.678 — 1.249 < 1, we choose k = 1. 


With (a) and (b), we can write zw down in both polar and exponential forms: 
zw » 5V2 (cos 2.356 + isin 2.356) = 5V2e? 6, 
To write zw down in cartesian form, we can use |zw| and arg (zw) to compute 
Re (zw) * 5V2.cos 2.356 » —5; and Im (zw) * 5V2sin 2.356 * 5. 
Thus, zw =-5+ i. 
Alternatively, we can do the usual multiplication, which yields us the exact value of zw: 


zw = (-2-i) (1-3) =-2+6i-i-3 =-5+5i. % 
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Multiplying a complex number by a positive real number leaves the argument unchanged: 


Corollary 38. Suppose z is a complex number and a>0O. Then arg (az) = arg z. 


Proof. By Fact 193, arg (az) = arga+argz+2k7 = argz + 2k = arg z, where we choose 
k =0 because arg z € (-7, 71]. 














Example 1068. Let z=7-9i, so that 5z = 35 - 451. Then arg z = arg(5z) » -0.910. = 2 





In contrast, multiplying z by —1 changes the argument by either —7t or +71: 


Corollary 39. (a) Jf argz>0, then arg (-z) =argz-7. 





(b) Jf argz <0, then arg (-z) =argz+7. 


Proof. By Fact 193(b), arg (-z) = arg (-1-z) =arg(-1) + arg z+ 2k = + argz + 2k7. 
(a) If arg z > 0, then k = -1 and thus, arg (-z) 2m +argz- 2m =argz- 2m. 











(b) If arg z <0, then k = 0 and thus, arg (—z) tnmt+arez—On=arez+7 





Example 1069. Let z=7-9i, so that -z = -7+9i. Since arg z x —0.910 < 0, by Corollary 
39, we have arg (-z) =argz +7» -0.910+ 7 2.232. % 


Example 1070. Let a=1, b=1+i, c=i, and d=1-\V/3i. Then 


IA 
S 


arg a= arg | 


ae Bil 


- 
N 
\ 


V 


argb= arg(1+i) 


V 


argc= argi 


co| 4 NDIA AIA CO 


IA 


arg d= arg (1 - V3i) = 0. 








And so by Corollary 39, 


arg(-a)= arg(-ll) =arga+7=7, 


= 
arg(-b)= arg(-l-i) =argb-7= =, 


—T 
arg(-c)= arg(-i) =argc-m=— 





9 ’ 
27 
arg (—d) = arg (-1 - v3i) =argd+7= me 
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Combining the last two Corollaries, we have that multiplying a complex number by a 
negative real number changes the argument by either —7 or +7: 

Corollary 40. Suppose z is a complex number and a> 0. 

(a) If argz>0, then arg (-az) = arg (-z) =argz-7. 

(b) Jf argz <0, then arg (-az) = arg (-z) =argz+7. 


Example 1071. Let z=7-9i, so that -5z = -35+45i. Since argz » -0.910 < 0, by 
Corollary 40, we have arg (-5z) = argz +7» —0.910+ 7 ® 2.232. x 


Example 1072. Suppose a = 1, b=1+i, c=i, and d=1-V3i. Then 
arg a= arg t= 0, argc = argi 


argb= arg (1+i)=—>0, argd = arg (1- V3i) = <0. 


And so, by Corollary 40, 


arg (-3a) = arg (-33) = arg (-a) =arga+7=T7, 


—3 
arg (—3b) = arg (-3 - 3i) = arg (—b) =argb-7 = =, 


arg(~Be) =ang(-3i) = ang (-c) = arge—m = =. 


2 
arg (-3d) = arg (-3 + 3V3i) = arg (-d) =argd+7= = 


Exercise 321. For each, find |zw]|, arg (zw), |-2zw]|, and arg (-2zw). Then express both 
zw and —2zw in polar, exponential, and cartesian forms. (Answer on p. 1888.) 


(ajar i==3). (bee =2 wa lec (c) z=-1-3i, w =3+4+4i. 
(d) z=-245i, w =i. (e) z=-l-i,w=-1-2i1. (f) z=-5-3i, w =5-i. 


Exercise 322. This Exercise guides you through a proof of Fact 193(a). Let r = |z|, 
6=argz, s=|w|, and d=argw. (Answer on p. 1889.) 


(a) Express z and w in polar form. 


(b) Expand zw. Then use a trigonometric identity to show that 


zw =rs(cos(6+¢) +isin(@+¢)]. 


(c) Now show that |zw| = |z||w]. 
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85.1. The Reciprocal 


Fact 194. Suppose w is a non-zero complex number. Then 


@ al” ak 


il 
(b) If w is not a negative real number, then arg — = -—argw. 
w 





1 
(c) If w is a negative real number, then arg — = argw =T. 
w 





Proof. (c) is “obvious”. For a proof of (a) and (b), see p. 1649 (Appendices). 











Example 1073. Let 


z= 5-2i = (5,-2) » V29 (cos—0.381 + isin -0.381) = V29e 084, 
w=143i= (1,3) » V10(cos1.249+isin1.249) =+/10e'74%. 


By Fact 194, (a) = and 


il 
and (b) arg— = -argz x 0.381. Also, (a = —— 
(b) arg = = -arg (@) | |= a 


aE 1 i 1 
, ZI 29 
(b) arg — = -argw » -1.249. So, 

w 


l 
x ——(cos0.381 + isin 0.381) 
V 29 


if 
z 
1 
Ww 


i 
x (cos -1.249 + isin -1.249) » 
V 10 


Example 1074. Let w =—-5, so that 1/w =-1/5. Hence, argw = arg (1/w) =7. 
Exercise 323. Find the moduli and arguments of each number and its reciprocal. Then 
write down the latter in exponential, polar, and cartesian form. (Answer on p. 1889.) 


(ae 4a (b)) a= 27. (ce), 2a aly (d) w=-8i. 
(e) z=-2+ 5i. (ft) we-l=i (2). zel=3i. | (a) we3+4, 
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85.2. Division 


Fact 195. Suppose z and w are non-zero complex numbers. Then 


_ Fl. 


and (b) arg —_ arg z-argw + 2k, 
w 


ful 


if arg z—argw > Ti, 


where in (b), if arg z—argw € (-7,71], 





if arg z-—argw < —-7. 














Proof. For (a), see Exercise 325. For (b), see p. 1650 (Appendices). 


Example 1075. Let 


z=5-2i = (5,-2) » 29 (cos—0.381 + isin—0.381) = V/29e 984, 


w=1+3i= (1,3) » V10(cos1.249+isin1.249) = V10e!74%, 


By Fact 195, we have 


Zz 29 
(a) 7 ie 


(b) arg— =argz—-arew + 2km~ —0.381 — 1.249 + 0 » -1.630, 
W 


where we chose k = 0 because arg z — arg w € (-7, 71]. 
With (a) and (b), we can write z/w down in both polar and exponential forms: 


2 4s VJ2.9 (cos 1.630 + isin-1.630) = /2.9e-1 530, 
W 


To write z/w down in cartesian form, we can use |z/w| and arg (z/w) to compute 


FO Aa 2 oicos= ean a0Gin) Nando yim Any 2GGnn = fe snret 700: 
W W 


Thus, Ale Ti. 
W 


Alternatively, we can simply do the usual division, which yields us the exact value of zw: 


pe ign ie pie tae aig 


eee ee Se eee % 
Ta ea) Pes 12432 10 : 
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Example 1076. Let 


z= -447i = (-4,7) » V65 (cos 2.090 + isin 2.090) = V/65e2.0%, 
w= 1+9i = (1,9) » V82(cos1.460 + isin 1.460) = V/82e!4%, 


V65 
By Fact 195, (a) as = Wh 
Ww 


(b) arg— =argz —arew + 2km 2.090 - 1.460 + 0 » 0.630, 
W 


where we chose k = 0 because arg z — arg w € (-71, 71]. 


With (a) and (b), we can write z/w down in both polar and exponential forms: 


e a ae 
x wee (cos 0.630 + isin 0.630) = V'65 0.6301 
V82 


vs, 
w JB2 


To write z/w down in cartesian form, we can use |z/w| and arg (z/w) to compute 


x —— cos 0.630 x 0.719 and Im — x» —=sin 0.630 » 0.525. 


2 /65 2 V65 
i ay/ 89 iu? 4/89 


Thus, # 0.719 + 0.5251. 


Zz 
w 
Alternatively, we can simply do the usual division, which yields us the exact value of z/w: 


2 4+ Ti 4+ 711-91 -44+361+71+63 594431 59 AS. 


ee a ee ce ts 
w 149i 1+9i 1-91 124+ 92 82 82 82 
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Example 1077. Let 


z= -2-i= (-2,-1) » V5 (cos—2.678 + isin-2.678) = V5e72-6%, 
w=1-3i= (1,-3) = V10(cos-1.249 +isin-1.249) = /10e17. 


5 1 
By Fact 195: (a) |= ae 


wl” Vio” V2 


(b) arg— =argz—-arew + 2k» -2.678 + 1.249 + 0 » -1.429, 
W 


where we chose k = 0 because arg z — arg w € (-71, 71]. 


With (a) and (b), we can write z/w down in both polar and exponential forms: 
a 
9/2 


To write z/w down in cartesian form, we can use |z/w| and arg (z/w) to compute 


1 ' 
Cos = 1420 isin 1.190) =] ar, 
( ) ee 


cos -1.429 x 0.100 and Im — # ze sin —1.429 x —0.700. 


i] 


oo 

J2 D 
Peat 
W 


Alternatively, we can simply do the usual division, which yields us the exact value of z/w: 


ie ie lee 


= a= 5 2 = = =().1=—0.71 x 
(a ee ES eee 12 + 32 10 





Exercise 324. For each, find |z/w| and arg (z/w). Then express z/w in polar, exponen- 
tial, and cartesian forms. (Answer on p. 1890.) 


(ay aaa we=3. (b)2S2) wel. (c) z=-1-3i, w =3+4i. 
(d) z=-2+45i, w =i. (e) z=-l-i,w=-1-2i1. (f) z=-5-3i, w =5-i. 


Exercise 325. Use Facts 193 and 194 to prove Fact 195(a). (Answer on p. 1890.) 
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Part V. 
Calculus 





Revision in progress (November 2021). 


And hence messy at the moment. 
Appy polly loggies for any inconvenience caused. 
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The calculus was the first achievement of modern mathematics, and it is difficult 
to overestimate its tmportance. 


— John von Neumann (1947). 


if you forget about understanding what’s going on and concentrate on mechanical 
manipulations, you'll forget how to do even the mechanical manipulations. 


— Timothy Gowers (2012). 


Zudem ist es ein Irrtum zu glauben, daf die Strenge in der Beweisfthrung die 
Feindin der Einfachheit ware. An zahlreichen Beispielen finden wir im Gegen- 
teil bestatigt, daB die strenge Methode auch zugleich die einfachere und leichter 
faBliche ist. 


Besides it is an error to believe that rigor in the proof is the enemy of simplicity. 
On the contrary we find it confirmed by numerous examples that the rigorous 
method is at the same time the simpler and the more easily comprehended. 


— David Hilbert (1900, 1902t). 
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86. Limits 


The idea of limits isn’t on your syllabus.?” But it is fundamental to calculus. And it 
really isn’t all that difficult, especially if presented in informal and intuitive terms (as I try 
to do here). It is therefore worthwhile investing a little time in it, just so things become 
that much clearer.?”! 


Ch. 25 already briefly introduced you to limits (in the context of asymptotes—vertical, 
horizontal, and oblique). This chapter will look at limits again, but at greater depth. 








3Tonoring the Central Limit Theorem, the word limit appears on your syllabus only once (p. 9, “concept 
of definite integral as a limit of sum”), almost in passing, and solely in relation to the definite integral. 

3"1To keep things simple, we discuss only functional limits (and not sequential limits). 

3”2We already discussed this “phenomenon” in my Preface/Rant—see p. xlix. 
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86.1. Limits, Informally Defined 


Example 1078. Define f :R > R by f (2) = 27-1. 


Ne 








Informal observations: 


For all values of x that are “close” but not equal to 2, 
f (x) is “close” (or possibly even equal) to 3. 


on By making x “sufficiently close” but not equal to 2, 
f (x) can be made as “close” as we like to 3. 


373 


Either of the above two informal statements’'’ may be written formally as 


lim f (2) = 3, 
which we read aloud as, The limit of f at 2 is 3. (2) 


Or, As x > 2, f (x) > 3, (3) 


which we read aloud as, As x approaches 2, f (7) approaches 3. (4) 


These last four statements are equivalent and formal. You need not know their precise 
meaning. An intuitive and informal understanding will suffice. 





You may be thinking, “The above example is quite silly or useless. We already know that 
f (2) = 3. So what have we added by also writing lim f (7 )eg7" 


And you'd be right. The above is just our very first introductory example to limits. To see 
why limits aren’t so silly or useless, we’ll have to look at more examples: 


Subtle Point 1: The condition not equal to is important. When considering the limit of 
a function g at a, we do not care about g(a), the value of the function at a. We only care 





373Tn particular, the terms “close” and “sufficiently close” are informal and hence in scare quotes. 
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about the values of x that are “close” to a: 


Example 1079. Define g: R > R by 


= 10. aor es 2) 


g(x) =4" 
0 


tory = 2: 


The function g is very similar to the function f (last example), except that now g (2) = 0 
and we have a “hole” in our curve. As we’ll learn later, this is an example of a removable 
discontinuity. 





Observe though that these two informal statements remain true: 


For all values of x that are “close” but not equal to 2, 
g(x) is “close” (or possibly even equal) to 3. 


By making x “sufficiently close” but not equal to 2, 
g(x) can be made as “close” as we like to 3. 


So, each of these four equivalent and formal statements remains true: 


(1) lim g (a= a3: (3) Asx —>2, g(x) >3. 
(2) The limit of g at 2is3. (4) Asx approaches 2, g(x) approaches 3. 





Subtle Point 2: Actually, the condition “not equal to” goes even further—even if h(a) is 
undefined, lim h(a) may still be well-defined: 


819, Contents www.EconsPhDTutor.com 


Example 1080. Define h: R \ {2} > R by h(x) = 27-1. 
The function h is very similar to g, except that h (2) is now simply left undefined. 








Observe that these two informal statements remain true: 


For all values of x that are “close” but not equal to 2, 
h(a) is “close” (or possibly even equal) to 3. 


By making x “sufficiently close” but not equal to 2, 
h(a) can be made as “close” as we like to 3. 


So, each of the following four equivalent and formal statements remains true: 


(1) lim h (a) = 3; (3) Asx>2, h(r) 33. 
(2) The limit of h at 2is 3. (4) As x approaches 2, h(x) approaches 3. 


Definition 195 (informal). These four statements are equivalent and formal: 
(1) lima (ee (3) Asxz—a, f(r) > L. 
(2) The limit of f atais LZ. (4) Asx approaches a, f (x) approaches L. 


Informally, each of (1)—-(4) says,?“ 


For all values of x that are “close” but not equal to a, 
f (2) is “close” (or possibly even equal) to L. 


By making x “sufficiently close” but not equal to a, 
f (x) can be made as “close” as we like to L. 








34For a formal definition of limits, see Ch. 146.2 (Appendices). 
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Exercise 326. The function 7: R > R is graphed below. Find the limits of 2 at 0, 1, 2, 
and 3. (Answer on p. 1891.) 
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86.2. Examples Where The Limit Does Not Exist 
To better understand limits, we now look at examples where the limit does not exist: 


Example 1081. Define 7 : R > R by 


0 for x <0, 
j(x) = 
fore >0: 





Consider lim 7 (x), the limit of 7 at 0. 


Could it be that lim j (x) =-1? No, because there are values of x that are “close” to 0 


but for which j (x) = 1. Hence, it is not true that for all values of x that are “close” but 
not equal to 0, 7 (x) is “close” to -1. So, 


lim j (x) #-1. 
r0 


Next, could it be that lim j (x) =1? No, because there are values of x that are “close” to 
Cc 


0 but for which 7 (x) = -1. Hence, it is not true that for all values of x that are “close” 
but not equal to 0, 7 (x) is “close” to 1. So, 


lim j Gaye ae 


We’ve just argued that lim j (x) does not equal either -1 or 1. Let’s now also argue more 
Cc 


generally that lim j (x) cannot be equal to any real number L. This is because for values 


of x that are “close” but not equal to 0, there is no single real number L that j (x) is 
“close” to. 


So, we conclude lim j (x) does not exist. 
[> 


As we'll learn later, this is an example of a jump discontinuity. 


(Example continues on the next page ...) 
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We now introduce also the concepts of a left-hand limit and a right-hand limit. These 
concepts are simple and will improve your understanding. 


(... Example continued from the previous page.) 

These four statements are equivalent, formal, and true: 

iL jim j (x) = -1. 

2. The left-hand limit of 7 at 0 is -1. 

3. Asx>0°7,j(2) > -1. 

4. As x approaches 0 from the left, j (x) approaches -1. 


Informally, each of the last four statements says this: 


For all values of x that are “close” to but less than 0, 
j (2x) is “close” (or possibly even equal) to —1. 


Or By making x “sufficiently close” but less than 0, 
7 (x) can be made as “close” as we like to -1. 


Similarly, these four statements are equivalent, formal, and true: 


1. lim J Cc ee & 


2. The right-hand limit of 7 at 0 is 1. 
3. Asx >0*, j(2) > 1. 
4. As x approaches 0 from the right, j (x) approaches 1. 


Informally, each of the last four statements says this: 


For all values of x that are “close” to but more than 0, 
j (x) is “close” (or possibly even equal) to 1. 


By making x “sufficiently close” but more than 0, 
7 (x) can be made as “close” as we like to 1. 
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Definition 196 (informal). These four statements are equivalent and formal: 


eulitahy (3) =a. 

. The left-hand limit of f at a is L. 
-Asz>a,f(r)>L. 

. As x approaches a from the left, f (x) approaches L. 


Informally, each of the above four statements says,?” 


For all values of x that are “close” but less than a, 
f (2) is “close” (or possibly even equal) to L. 


By making x “sufficiently close” but less than a, 
f (x) can be made as “close” as we like to L. 


Definition 197 (informal). These four statements are equivalent and formal: 


. lim f (x) = ZL. 

. The right-hand limit of f at a is L. 

-Asz>a’, f(z) >L. 

. As x approaches a from the right, f (x) approaches L. 


Informally, each of the above four statements says,>” 


For all values of x that are “close” but more than a, 
f (2) is “close” (or possibly even equal) to L. 


By making x “sufficiently close” but more than a, 
f (x) can be made as “close” as we like to L. 





Not surprisingly, the limit of f at a is L if and only if the left- and right-hand limits of f 
at a are both also L: 


Fact 196. Suppose f is a nice function. Then 





lim f (x) = = Ligaen lim, f (x). 











Proof. See p. 1655 (Appendices). 











For a formal definition of left-hand limits, see Definition 320 (Appendices). 
3’6For a formal definition of right-hand limits, see Definition 320 (Appendices). 
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Example 1082. Continue to define 7: R > R by 


, USS itor 4-00) 
j(x)= 
b fora 0: 


NY 


i 
1eo— oO 








The left- and right-hand limits of 7 at 0 do exist: 


lim J (x) =-1 and lim. J Caper 


However, they are not equal. And so by Fact 196, the limit of 7 at 0, lim j (x), does not 
exist. 
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1 
Example 1083. Define k:R \ {0} > R by k(x) =. 
2 


IX 
Y 








x 


The limit of k at 0—or lim ki (x)—does not exist because there is no number L such 
Cc 
that 


For all values of x that are “close” but not equal to 0, 
k (x) is “close” (or possibly even equal) to L. 


Nonetheless, we are allowed to say that “the limit of & at 0 is infinity” and write 
Iba Wig ee oe 
r>0 


(As we learnt in Ch. 25, we’ll also say that the vertical line x = 0—also the y-axis—is a 
vertical asymptote for the function k.) 


Here you may be confused—we’ve just made two seemingly contradictory statements: 
« “The limit of k at 0 does not exist.” (Or, “lim k (x) does not exist.”) 

¢ “The limit of & at 0 is infinity.” (Or, “lim k (2) = ¢o,”) 

But strangely enough, both of the above statements are true. How can this be? 


The key here is to recall the point made and emphasised on p. 52—oo is not a number. 
Instead, it is merely an occasionally convenient symbol. 


We’ve written, “The limit of k at 0 is infinity,” or, “lim k (to 
Ope 


Importantly, = is simply convenient shorthand for this informal statement: 
As x “approaches” 0, k (2) “grows” without upper bound.?” 


Importantly, = does not say that lima k (x) is equal or identical to some object called oo. 
ia 





Indeed, by writing =, we do not even commit to the existence of an object called co. 
‘ O a! VW V On O 


Example 1084. Define m: Rv. {0} > R by m(az) = 1/z. 








The limit of m at 0—or lim m («)—does not exist, because there is no number L such 
Cc 
that 


For all values of x that are “close” but not equal to 0, 
m(z) is “close” (or possibly even equal) to L. 


It is likewise true that the left- and right-hand limits of m at 0—or lim. m(a) and 


lim m(a)—do not exist, because there is no number L such that 
x—0* 


For all values of x that are “close” For all values of x that are “close” 
to but less than 0, m(zx) is “close”. or ~~ to but more than 0, m(z) is “close” 
(or possibly even equal) to L; (or possibly even equal) to L. 


Yet at the same time, we may write 


lim m(z)=-0o and lim m(x)20. 
xz>0- xz>0t 


(So again, x = 0 or the y-axis is a vertical asymptote of m.) 


Again, neither = nor 2 acknowledges the existence of objects called —oo or oo. Instead, 
informally, each simply says, 


1. As x “approaches” 0 “from the left”, m(x) “keeps growing towards —00”. 


2. As x “approaches” 0 “from the right”, m (x) “keeps growing towards 00”. 


By the way, in contrast with the previous example, these two statements are false: 


lim m (2) = 00. x lim m (a) = —00. x 
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Example 1085. Recall that the natural logarithm function In has domain R’*. 


IN 
Y 








Consider the limit of In at -1 or lim Inz. 


Lo 
One key motivation for the concept of limits is to help us understand how a function 
behaves “near” a point. 


If a function is undefined “near” a point, then there is nothing to understand. In which 
case, it makes sense to simply say that the limit does not exist at that point.?® 


So, here for example, since In is undefined “near” -1, it makes sense to simply say that 
lim Inz does not exist. 


a= 
Indeed, given any negative number a, it is likewise true that limInx does not exist. 
wa 


Again, the reason is that In is undefined “near” any negative number a. 


By the way, what is the limit of In at 0? Following our previous examples, we observe 
that the right-hand limit of In at 0 is -oo. We can write this as 


: 1 
lim In wz = —oo 
xz->0* 


Or informally, as x “approaches” O “from the right”, Inz “keeps growing towards —0o”. 
(By the way, does lim Inzx exist?)?” 
x—0t 


It is also true that for all values of x that are in the domain of In, as x “approaches” 0, 
Inz “keeps growing towards —oo”. So—and this is a bit subtle—it is also true that 


: 2 
lim In x7 = —oo., 
xz-0 


pee 





(Again, does lim In x exist? 
7 





3"8This sentence is a little informal, imprecise, and incorrect. More formally, precisely, and correctly, we 
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We now revisit the Dirichlet function (last examined on p. 236): 


Example 1086‘The Dirichlet function 
d:R— R is defined by: 

1 for xeQ, 

0 forxr¢Q. 


The graph of d contains the 
point (7,1) for every xe Q. 


The graph of d contains the 
point (z,0) for every x ¢Q. 


x 





Observe that for every real number a, limd(), does not exist. 
wa 


To see why, consider any rational number, say 2. Since 2 is rational, we have d(2) = 1. 


However, for values of x “near” 2, there is no number L that d(a) stays “close” to. 
Instead, “near” 2, d(a) takes on the values 0 and 1 “infinitely often”. And so, 


lim d (a) does not exist. 


=> 


Next, consider any irrational number, say V2. Since V2 is irrational, we have d (v2) 


However, for values of « “near” 2, there is no number L that d(x) stays “close” to. 
Instead, “near” V2, d (x) takes on the values 0 and 1 “infinitely often”. And so, 


lim_d(a) does not exist. 
r>y/2 








should instead say that it makes sense to define limits only at points that are “near” other points in the 


function’s domain (see Appendices, in particular Definitions 317 and 319). 
TENG; lim Inz does not exist. Again, this does not contradict 2, which is merely shorthand for a more 


precise and formal statement whose meaning we’ve given informally as, “as x ‘approaches’ 0 ‘from the 
right’, Ina ‘keeps growing towards —0o’”. 
380 A gain, no, lim Ina. And again, this does not contradict 2. 
@ 


829, Contents www.EconsPhDTutor.com 


Our next example is perhaps even stranger: 


Example 1087. Define f :R— R by 


il 
sin — for a = (), 
f(z)=, f 
0 Lon wy =). 


This is a very strange function indeed. Like sin, f takes on values between -1 and 1. 
But as x gets “closer” to 0, f (a) fluctuates ever more rapidly between -1 and 1. (Why?) 


Indeed, when we’re very “close” to 0, it’s impossible to accurately depict the graph of f. 


for 2 = (), 


oe 
f (x)= x | 
0 tor 2 =O): | 


| 


Observe that for all values of x that are “close to” but not equal to 0, there is no number 
L that f (x) is “close to”. Instead, when x is “close to” 0, f (a) takes on every value in 
[-1,1] “infinitely often”! So, “near” 0, there is no number L that f (x) stays “close to”. 


In other words, lim f (x) does not exist. 
Exercise 327. Consider the function f : IR > R defined by 


Oe for ars (), 


for 7 > 0): 





What are lim. f (Zz). lim f (x), and lim f (x)? (Answer on p. 1891.) 
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86.3. Rules for Limits 


Happily, the usual arithmetic operations are preserved when we take limits, so that we have 
the following simple, “obvious”, and predictable Rules for Limits: 


Theorem 23. (Rules for Limits) Let f and g be nice functions; and k,L,M ¢€ R. 
Suppose lim f (x) = L and limg (x) = M. Then 


(a) lim [kf (x)] (Constant Factor Rule for Limits) 


(ob) lim | fF ieyegia) = (Sum and Difference Rules for Limits) 


(c) lim Lea en) (Product Rule for Limits) 


(d) lim a (for M #0) (Reciprocal Rule for Limits) 






(e) im (for M +0) (Quotient Rule for Limits) 


(Constant Rule for Limits) 


(Power Rule for Limits) 











Proof. See p. 1660 (Appendices). 





Example 1088. Let k =5. Define f,g: R—R by 
f(v)=27+1 and G@javz. 
Let’s find the limits of f and g at 1 using the above Rules: 
lim f (2) = lim (2? +1) = lim 2° + lim 1 = lim 2” +121241=2. 
lim g (a= lim Vx £95 2], 


Also by the above Rules, we have 


1 
a) lim|kf (a £5x2=10 im —— =1 (since limg (2) #0 
Holl well 


(b) lim [f (v) +9 (x)]=2+1=3 li ees ey, (since lim g (x) + 0) 


lim [ f(a) -g(#)J=2-1=1 





lim [/ (x) 9(x)] #2 «1 =2 
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87. Continuity, Revisited 


Ch. 18 briefly discussed the concept of continuity and stated that a function whose entire 
graph can be drawn without lifting your pencil is continuous. 


Now that we have an intuitive grasp of limits, we can formally and precisely define conti- 
nuity: 


Definition 198. Let f be a nice function whose domain contains the point a. We say 
that f is continuous at a if 


lim f («) = f (a), 
or if a is an isolated point of poe 


Example 1089. Define f :R—> R by f (x) = 27-1. 


Vs 








The limits of f at -1, 0, and 2 are 


lim, f (a) =), lim f (z)=-1, and lim f (ia a: 


The values of f at -1, 0, and 2 are 


7 (1) =0, 7 (0) ==.) cand! 47,2) = 3: 


Conclude: f is continuous at -1, 0, and 2, because 


lim f(e)=F(-1), lim f(x) = (0), and lim f (e) = f (2). 








382Vou can treat the fine print here as an annoying technicality that doesn’t matter for A-Level Maths. Ch. 
87.7 (optional) briefly discusses this technicality. 
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Example 1090. Continue with the last example. 


Ny 








Above, we concluded that f is continuous at -1, 0, and 2, because 


lim f(@)=/), lim f(@)=F@, and lim f(@)= f2). 


It turns out that for every a € Domain f = R, it is also true that 
lim f (a) = f (a). 


So, f is continuous at every point a in its domain R. 


By Definition 199 then, f is (a) continuous on R; and (b) a continuous function. 
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87.1. Functions with a Single Discontinuity 


Definition 200. Let f be a nice function and a ¢ Domain f. If f is not continuous at a, 


then we say that f is discontinuous at a (or f has a discontinuity at a). 
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Example 1091. Define g:R->R by, 
Yy 


—1, fOr 2, 
g(x) = 
for =. 








The function g is almost identical to f in the last example. The only difference is that 
whereas f (2) = 3, we now have g (2) =0 and we have a “hole” at (2,3). 


It is again the case that g is continuous at —1 because 
jim g Ga 0, g(-1)=0, and hence jim g (oe) =o (1). 
Similarly, g is continuous at 0 because 
lim g (x) =-1, g(0)=-1, and hence lim g (a\= 0 (0). 


However, g is not continuous at 2 because 


lim g (2) =, but” @(2)=0) so that lim g (eo) 


We say that g is discontinuous (or has a discontinuity) at 2. 
It turns out that g is continuous at every point in its domain R except at 2. 
Since g has a single discontinuity (namely at 2), it is not a continuous function. This 


is because the existence of a single discontinuity disqualifies a function from being called 
continuous. 


Nonetheless, we may say that g is continuous on Rv {2} or (—00, 2)U(2, 00). (Equivalently, 
g is continuous everywhere except at 2.) 

It turns out that this discontinuity is an example of a removable discontinuity. Informally, 
a removable discontinuity is a “hole” at (2,3). 


Informally, we could “patch” this “hole” by simply moving the “black dot” at (2,0) up 
to (2,3) (where the “hole” is)—if we did this, then the new function thus created would 
be continuous. 
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Two more examples of removable discontinuities: 


Example 1092. Define h: R > R by 


f 
ina) = te, or 2 2-0), 


dt for 7 =, 


Figure to be 
inserted here. 


The function h has a removable discontinuity at 0. 


Example 1093. Define 7: R > R by 


—2, for x < 0, 
i(z)=41 tor =, 


ee for a > 0. 


Figure to be 
inserted here. 





The function 2 has a removable discontinuity at 0. 


We've just looked at three examples of removable discontinuities. We now look at examples 
of jump discontinuities: 
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Example 1094. Define 7 : R > R by 


—1 for x < 0, 
j(x)= 
1 ton ae 20), 


NY 


j 
1eo OO 








We have j (0) = 1. However, lim j (x) does not exist. 


Since lim j (x) #7 (0), 7 is discontinuous (or has a discontinuity) at 0. 

4 ba 
Like the last three examples, 7 is again continuous everywhere except at a single point 
(namely 0). And so again, 7 is not a continuous function. 


But unlike the last three examples, here we have instead a jump discontinuity. Infor- 
mally, a jump discontinuity is where the function “jumps”. 


A jump discontinuity is “worse” than a removable discontinuity because it’s “harder to 
fix”. A removable discontinuity can be “fixed” by simply “patching a hole”. 


Here we see (at least) two ways to “fix” the jump discontinuity: Shift (i) j’s left half 
upwards; or (ii) its right half downwards. 





Two more examples of jump discontinuities: 
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Example 1095. Define 7: R > R by 


roe 3 for p< 1, 


se force 2 


Figure to be 
inserted here. 


The function 2 has a jump discontinuity at 1. 


Example 1096. Define k : R > R by 


k(a)= 4") 
le for 2 >, 


for <0, 


Figure to be 
inserted here. 


The function & has a jump discontinuity at 0. 





We’ve just looked at removable and jump discontinuities. There’s also a third type of 
discontinuity that’s simply a catch-all category for “everything else”—if a discontinuity is 
neither removable nor jump, then we call it an essential (or infinite) discontinuity. 


You don’t need to know the formal definitions for A-Level Maths. But they aren’t difficult 
to understand, so [ll just put them here: 


Definition 201. Suppose the function f has a discontinuity at a. Then this discontinuity 
is called a 


(a) Removable discontinuity if lim f (2) exists; 


(b) Jump discontinuity if lim f(a) and lim f (x) exist but are not equal; or 
r>a7 rat 


(c) Essential (or infinite) discontinuity if it is neither removable nor jump. 





Example of an essential (or infinite) discontinuity: 
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Example 1097. Define f :R — R by 


il 
in — Lora = () 
ae 


f(x)=\" 
0 tor % =U; 








As explained in Example 1087 (previous chapter), 


lim f (x) does not exist. 
C— 


So, f is discontinuous at 0 and is a discontinuous function. 


Nonetheless, f is continuous everywhere except at a single point (namely 0). That is, f 
is continuous on (-00,0) U (0, ©) or R x {0}. 


It is possible to prove that the discontinuity at 0 is neither removable nor jump: 


Since lim f (x) does not exist, by Definition 201, it is not a removable discontinuity. 
Cc 


It is also possible to show that lim f (x) and lim f (x) do not exist,*®* so that again by 
r>0- oe 
Definition 201, it is not a jump discontinuity either. 


Since this discontinuity is neither removable nor jump, by Definition 201, it is an essential 
(or infinite) discontinuity. 





From Definition 201, it is not difficult to prove this result: 


Fact 197. Suppose f is a nice function whose domain contains the point a. Then 


f has an essential discontinuity at a 





At least one of lim f (x) and lim f(x) does not exist. 


Proof. ( <= ) If at least one of lim f(x) and lim f(x) does not exist, then f has a 
tag zat 
discontinuity at a that is neither removable nor jump and is hence essential. 


(= = by contrapositive) If both lim f (x) and lim f (x) exist, then f is either continuous 
r>aq zat 





383We do so in Example 1583 (Appendices). 
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at a or has either a removable or jump discontinuity at a. 





Two more examples of essential discontinuities: 


Example 1098. Define g: R > R by 


g(x) = 


Figure to be 
inserted here. 


We have g (0) =0, but lim g (x) does not exist. 

Moreover, lim g(x) and lim, g(x) do not exist (though we can write lim. g(x) = -0o0 and 
limg' (4) = eo}. 

xz>0t 


And so by Fact 197, g has an essential discontinuity at 0. 


Example 1099. Define h: R > R by 


for 7 <0), 


h(x) =" 
4G = 
sin — for x > 0. 
£ 


Figure to be 
inserted here. 


We have h (0) = 0. 
Here, the left-hand limit of h at 0 does exist: lim ni{z)=0, 


However, the right-hand limit of h at 0—or lim, h (a2)—does not exist. 





And so by Fact 197, h has an essential discontinuity at 0. 
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A329. XXX 
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87.2. An Example of a Function That Is Discontinuous Everywhere 


Each function examined in the previous subchapter had exactly one discontinuity. Here 
now is an example of a function that is discontinuous everywhere: 


Example 1100. We revisit the Dirichlet function d.**4 


The Dirichlet function “ay 


d: IR -— R is defined by: Z 


LS tor 3 eG), 
0 forxr¢Q. 


aa) -= 


The graph of d contains the 
point (2,1) for every xe Q. 


The graph of d contains the 
point (7,0) for every x ¢Q. 


x 





Observe that for each a € R, d(a) is defined (i.e. is equal to a real number); but limd (x) 


does not exist; so, d(a) # limd(z). 


Hence, d is discontinuous at every point in R. Equivalently, d is discontinuous everywhere. 


Using Definition 201, it is possible to prove that none of these discontinuities is either 
removable or jump. So, each is an essential (or infinite) discontinuity. 


Hence, we may say that d is essentially (or infinitely) discontinuous everywhere. 
Or equivalently, d has an essential (or infinite) discontinuity at every point a€R. 








3847 ast examined on p. 829. 
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87.3. Functions That Seem Discontinuous But Aren’t 
Subtle point: 
Continuity and discontinuity are defined only on a function’s domain. 


That is, a function may be said to be continuous or discontinuous only at points in its 
domain. If a point isn’t in the function’s domain, then the function is neither continuous 
nor discontinuous at that point. 


In each of the next three examples, we are tempted to say that the function has at least 
one discontinuity. It turns out though that each function is continuous at every point in 
its domain and is therefore a continuous function! 


Example 1101. Define the function h:R\ {2} > R by h(x) = 27-1. 


We are tempted to say that h is discontinuous at 2. But this would be wrong because 2 
is not in the domain of h (equivalently, h is not defined at 2). 


And so, hf is neither continuous nor discontinuous at 2. 


4a 








Indeed, perhaps surprisingly, h is a continuous function because h is continuous at every 
point in its domain. 
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Example 1102. Define the function f:R\. {0} > R by f (#) =-. 
z 


We are tempted to say that f is discontinuous at 0. But this would be wrong because 0 
is not in the domain of f (equivalently, f is not defined at 0). 


And so, f is neither continuous nor discontinuous at 0. 


Figure to be 
inserted here. 


Indeed, perhaps surprisingly, f is a continuous function. That is, f is continuous at every 
point in its domain. 


Example 1103. As noted in the previous chapter, lim, Inz does not exist or is un- 
tw 
defined. 


We are tempted to say that In is discontinuous at —1. But this would be wrong because 
—1 is not in the domain of In (which is R*). (Equivalently, In is not defined at -1.) 


And so, In is neither continuous nor discontinuous at —1. 


IN 
Y 








Indeed, given any a € (—oo,0], In is neither continuous nor discontinuous at a. 


Nonetheless, In is a continuous function because it is continuous at every point in its 
domain R*. 
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Exercise 330. State whether each of the following graphed functions is continuous. If 


not, identify any discontinuities and their type. (Answer on p. 846.) 
A330. XXX 
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87.4. Every Elementary Function Is Continuous 
We reproduce from Ch. 37 this definition: 


Definition 202. An elementary function is 


a polynomial function, a trigonometric function, 


an inverse trigonometric function, a natural logarithm function, 


an exponential function, a power function, 


any arithmetic combination of two elementary functions, 


or any composition of two elementary functions. 





Recall that most functions we'll encounter in H2 Maths are elementary. And very happily, 


Theorem 24. Every elementary function is continuous. 


O 


Proof. See Ch. 87.6. 


In sum, 


e« Most functions we’ll encounter in A-Level Maths are elementary. 
e All elementary functions are continuous. 


e« Therefore, most functions we’ll encounter in A-Level Maths are continuous. 


Example 1104. XXX 
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87.5. Continuity Allows Us to “Move” Limits 


Example 1105. Suppose we are asked to evaluate lim sin 27. 
Bie 


We are tempted to do this: 
lim sin x? = sin (1im 2°) =sinW=W. 
x0 x0 


It turns out that the above 7s correct. 


However, = requires justification. Why is it that we can simply “move” the limit in? 





One of the (many) nice things about continuity is the next result: When taking the limit 
of a composite function, we can “move” the limit in if the “outer” function is continuous. 
This justifies = in the above example—since the “outer” function sin is continuous, we can 
“move” lim in. 
x20 
Theorem 25. Let f and g be nice functions such that the composite function fg is 
well-defined. Let be R. Suppose lim g (x) =b and f is continuous at b. Then 


lim f (9 (2)) = f (@). 

















Proof. See p. 1667 (Appendices). 
In the previous subchapter, we saw that all elementary functions are continuous. Since 
(almost) all functions we’ll ever encounter in H2 Maths are elementary, this means that 
happily enough, we can (almost) always apply the above result: 
Example 1106. Consider lim in, 

£—> 
By the Power Rule, lim x = 1. 

ct 

Since In is continuous at 1, by Fact 25, we can “move” the limit in: 


lim Inz@l1ln (tim r] =n io=0): 


Example 1107. Consider lim [sin (In) ]. 
The previous example already showed that lim ie =) 


Since sin is continuous at 0, by Fact 25, we can “move” the limit in: 


lim [sin In )] @1sin (tim In r] =i) =), 





848, Contents www.EconsPhDTutor.com 


Example 1108. Consider lim (x+Inz)’. 


Using the Sum Rule for Limits and what we found in above two examples, 
lim (x+Inz) = lim a + lim In « =140=1, 
Ly ieee r— 


Since the squaring function is continuous at 1, by Fact 25, we can “move” the limit in: 


2 
lim (c+Inz)’ @1 im (x + Inz)| ate a 





Note that if the “outer” function is not continuous at the given point, then Fact 25 does 
not apply and we may not be able to “move” the limit in: 


Example 1109. Define f,g: R>R by g(a) =x and 


1 fore +), 
f(z) = ; 


for x = 0. 


Figure to be 
inserted here. 


Then the composite function f og:R- R is defined by 


for x #0, 


(fog) (x)= f(g(2)) =f (2) = 


tor ¢= (0. 


Now, observe that 
lim f (g(#)) = lim f (x) = 1. 
However, if we try to “move” the limit in as usual, we get something else: 
kim F (9(0)) 22, (lim 9 (x)) = F (0) =2 


The step @? is wrong and does not work, because f is not continuous at lim g (=O) 
«= 





Happily, most functions we’ll encounter in H2 Maths are continuous. So the above example 
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is not likely to be a problem you'll ever encounter. 


Exercise 331. Find (a) lim (v +1)° and (b) lim sin (cos 2”). (Answer on p. 1892.) 
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87.6. Every Elementary Function is Continuous: Proof (Optional) 


In this optional subchapter, we’ll try to prove Theorem 24, i.e. that every elementary 
function is continuous. 


“Obviously”, 


Fact 198. Every constant function is continuous. 


Proof. This result may seem “obvious”, but still requires a proof. This is given on p. 1665 
(Appendices). 














Figure to be 


inserted here. 





Also “obviously”, 


Fact 199. Every identity function is continuous. 


Proof. Again, this result may seem “obvious”, but still requires a proof. This is given on 
p. 1665 (Appendices). 














The next result, which you may also find “obvious”, says that continuity is preserved? 
under the four basic arithmetic operations and scalar multiplication: 


Theorem 26. Suppose f and g are continuous ataéR. Then so too is each of these 
functions: 


(a) feg — (b) fg (©) (provided g(a) +0) (A) ef 














Proof. See p. 1666 (Appendices). 





Using the above “obvious” results, we can easily prove that 


Fact 200. Every polynomial function is continuous. 


Proof. Let D be a set of real numbers; cp, C1, ..., C, be real numbers; and n be a non- 
negative integer. 

Define f : D > R by f (x) = co + x + Cox? +--+ +c,2". Our goal is to show that f is 
continuous. 





385Qr closed. 


851, Contents www.EconsPhDTutor.com 


Define g,h: D>R by g(x) =2 and h(2) = cp. 
By Fact 198, fh is continuous. 

By Fact 199, g is continuous. 

By Theorem 28(b), (9) = g-g is also continuous. 


Similarly, for any k = 1,2,3,..., the repeated application of Theorem 28(b) shows that 
(g)" =a) tase g is also continuous. 


Now, observe that 


f (x) = h(a) + erg (a) + 02 (9) (x) +--+ + en (g)" (2). 


Since f may be written as the sum of n+1 continuous functions, by the repeated application 
of Theorem 26(a), f is also continuous. 














Example 1110. Define f,g:R—R by f (x) = 2? +1 and g(x) = -972" + 10x? — x7 +5. 


Figure to be 
inserted here. 


Both f and g are polynomial functions. And so, by Fact 200, both f and g are continuous 
functions. 


Fact 201. The sine and cosine functions (sin and cos) are continuous. 


Proof. We will prove this on p. 1013. 

















Figure to be 


inserted here. 





Fact 202. The natural logarithm function, ln, is continuous. 


Proof. We will prove this in Ch. 111. 














The next result says that the inverse of a continuous function defined on an interval 
is also continuous: 
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Theorem 27. Let D be an interval and f : D > R be a one-to-one function. 


continuous, then so too is its inverse f-!. 








Proof. See p. 1670 (Appendices). 











Example 1111. The function f : IR > R defined by f (x) = x +1 is continuous. 


Figure to be 
inserted here. 


It is also one-to-one, with inverse f~': IR > R defined by f7!(x) = 2-1. By Theorem 27 
then, f-! should be continuous, as indeed it is. 


Example 1112. The function g: IR > R defined by g(z) = x° is continuous. 


Figure to be 
inserted here. 


It is also one-to-one, with inverse g™! : R > R defined by g™' (x) = Wx. By Theorem 27 
then, g~! should be continuous, as indeed it is. 





Note the subtle but important requirement in Theorem 27 that the function’s domain be 
an interval. If f is continuous and one-to-one but its domain is not an interval, then its 
inverse f-! may not be continuous, as this counterexample shows: 


853, Contents www.EconsPhDTutor.com 


Example 1113. Define h: [0,1) vu [2,3] > [0,2] by 


f 1 
Ge x or x €[0,1), 
x—-1 for @e|/2,.3). 


Figure to be 
inserted here. 


The function h is continuous. Moreover, it is one-to-one, with inverse h7! : [0,2] > 
[0,1) U[2,3] defined by 


ho (2) = x for x €[0,1), 
r+1 fora e (1, 2]. 
Note though that the domain of h is not an interval, so that the conclusion of Theorem 


27 may not hold—i.e., h>! may not be continuous. And indeed, here we clearly see that 
h- isn’t continuous. 





Facts 201 and 202 said that sin, cos, and In are continuous. Now, recall that 


™ 71 
¢ sin! is the inverse of sin restricted to the interval E |. 


« cos’! is the inverse of cos restricted to the interval [0, 7]. 


e exp is the inverse of In, which is defined on the interval (0, 00). 


And so, by Theorem 27, 


1 


, cos !, and exp are continuous. 


Corollary 43. The functions tan, cosec, sec, cot, and tan are continuous. 





Corollary 42. The functions sin™ 














Proof. See Exercise 332. 


Exercise 332. Prove Corollary 43. (You should carefully specify any definitions and 





results used at each step of the way.) (Answer on p. 1892.) 
The composition of two continuous functions is also continuous: 


Theorem 28. Let f and g be nice functions with Rangeg ¢ Domain f. If g is continuous 





ata and f is continuous at g(a), then fg is also continuous at a. 
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Proof. See p. 1665 (Appendices). 





Simple example to illustrate: 


Example 1114. Define f:R- R by f(z) =sin(cosz). 





Since sin and cos are both continuous functions, by Theorem 28, f is also continuous. 


Through the repeated application of Theorems 26 and 28, we can easily tell that even a 
complicated-looking function is continuous: 


Example 1115. Define g: R> R by g(2) =sin [exp Us +a+ 1)| — 32. 


The function g is built from the arithmetic combinations and compositions of continuous 
functions. And so, by Theorems 26 and 28, g is continuous. 





Together, the results presented in this subchapter prove®®° Theorem 24: Every elementary 
function is continuous. 





386 Actually, there is one more thing we need to do: We need to also show that any function created from 
a continuous function through a domain restriction is itself also continuous. This is done in Theorem 
52 (Appendices). 
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87.7. Continuity at Isolated Points (optional) 


In Definition 198 of continuity, the fine print contained an annoying technicality, telling us 
that 


If a is an isolated point of the domain of f, then f is continuous at a. 


We now discuss why this additional fine print is necessary. 


Recall that informally, an isolated point of a set is a point that isn’t “close” to any other 


point in the se 


1, 


t 387 


It turns out that somewhat strangely, the formal definition of limits is such that the 
limit of a function is not defined at an isolated point.**? And so, if a is an isolated point 
of the domain of f, then lim f (a) is simply undefined. 

wa 


Hence, lim f (aie F (a). 


At the same time and also somewhat strangely, we’d like to say that a function is 
continuous at any isolated point (of its domain).°°? 


. Hence the need for the additional fine print, to ensure that a function is also continuous 


at its isolated points. 





387For the formal definition, see Definition 276 (Appendices). 
388See Definition 319 (Appendices)—in order forlim f (a)to be defined, a must be a limit point and, in 


particular, not an isolated point. 

Now, why might we leave lim f (2) undefined in the case where a is an isolated point? Here’s one reason: 
As discussed in Example 1085, “One key motivation for the concept of limits is to help us understand 
how a function behaves ‘near’ a point.” If a is an isolated point, then there is no behaviour of f “near” 
a for us to understand and it might thus make sense to simply leave lim f (x) undefined. 4 


389Here are two reasons why we might want to do this: 


First, we want to be able to say that the restriction of a continuous function to any subset of its domain 
is also continuous (see Theorem 52). Say f : D > R is continuous. Let E ¢ D be a set of isolated points. 
Now consider the function g: F > R defined by g(a) = f (x). Perhaps naturally, we want to be able to 
say that g is also continuous. 


























Second, under more general definitions of continuity (e.g. the topological one where the pre-image of 
open sets are also open), a function is continuous at its isolated points. So here in our more specialised 
setting, we want to also include in our definition of continuity any isolated points. a 
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Example 1116. Let f : {1,2,3} + R be the function defined by f (1) =0, f (2) =5, and 
f{3) =4. 


Figure to be 
inserted here. 


Perhaps surprisingly, the function f is continuous. 


To see why, observe that the point 1 is not “close” to either 2 or 3 (the other two points 
in the domain). Similarly, 2 isn’t “close” to 1 or 3; and 3 isn’t “close” to 1 or 2. Hence, 
each of the three points in f’s domain {1,2,3} is an isolated point. 


By our fine print then, f is automatically continuous at these three isolated points. And 
since f is continuous at every point in its domain, it is continuous. 


Once again, this example demonstrates that our informal definition of continuity (“we 
can draw its entire graph without lifting our pencil”) is actually not quite correct. The 
function f is continuous but to draw its graph, we must lift our pencil. 





Definition 198 (of continuity) is perfectly good, but a bit inelegant due to the fine print. 
Writers of more advanced texts usually prefer definitions of continuity using ¢-d phrasing 
or open sets. 
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88. The Derivative, Revisited 


Differentiation is the problem of finding the gradient of a curve at a point. 
Graphed below is some function f:R-R. 


Consider the point A = (a,f(a)). Let 1 be the tangent line to the graph of f at A. 
(Informally, / is the line that just touches the graph of f at A.) 











Now, how might we find the gradient of the line /? Unsure of how to proceed, we try a 
series of approximations: 


1. We first pick some point B = (b, f (b)) on f. 
Consider the line AB. We have 


AB’s gradient = one = f(b) = fla) 
Run b-a 


Clearly, our actual tangent line / is steeper than the line AB. Nonetheless, AB’s gradient 
serves as our first approximation of l’s gradient. 


Now, can we improve on this first approximation? Sure. Simply 
2. Pick some point C = (c, f (c)) that’s also on f but that’s is closer to A than B. 


Consider the line AC. It’s a little steeper than AB but still not as steep as /. Again, we 
have 
Rise _ f (c) — f (a) 


AC’s gradient = , 
Run c-a 





The gradient of AC’ now serves as our second and slightly improved approximation of /’s 
gradient. 
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We can keep repeating the above procedure to get ever-improved approximations or esti- 
mates of /’s gradient: 


¢ Pick a point D = (d, f (d)) that’s on f but which is closer to A than C. The gradient of 
the line AD serves as our third and slightly improved approximation of l’s gradient. 


¢ Pick a point FE = (e, f (e)) that’s on f but which is closer to A than D. The gradient of 
the line AF serves as our fourth and slightly improved approximation of /’s gradient. 


e Etc. 

The above discussion suggests that the gradient of / is this limit: 
san ft) =f (a) 
wa L-a 


We are thus motivated to write down this formal definition: 
Definition 203. Let f be a nice function with domain D and ae D. Consider this limit: 


anf O-F@. 


=o 2G 


If the above limit exists (i.e. is equal to a real number), then we say that f is differentiable 
at a, call this limit the derivative of f at a, and denote it by f’ (a). 


If not, then we say that f is not differentiable at a. 
f (x) - f(a) 
xL-a 


We call the expression the difference quotient (of f ata). 
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Example 1117. Define f:R—-R by f(z) =|z]. 


Figure to be 


inserted here. 





In secondary school, we learnt that the derivative or gradient of f at a point a is 

e lifa>O; 

e -l ifa<0; and 

e Undefined if a =0. 

We did not however learn how to formally prove the above, which is what we’ll now do. 


We start by considering just the point 2. By Definition 203, f’ (2)—the derivative of f 
at 2—is 


Using what we’ve learnt above, we can show that this limit equals 1: 


# (2) tpg £O-LO) yy -B 


r>2 





(For all x “near” 2, x > 0 and hence |z| = x) 


1 (Constant Rule for Limits). 


Since the derivative of f at 2 exists (it’s equal to 1), we say that f is differentiable at 2. 


We now consider any point a > 0. We can similarly show that f is differentiable at a, 
with, (a) =: 


f(a) =tim SOD F OO) _ yg lH 
wa LG 





= lim (For all « “near” a >0, x >0 and hence |z| = x 
aT — a 

= ite! 

gj (Constant Rule for Limits). 
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Exercise 333. Continue to define f: R—> R by f (2) =|z|. Prove that f is differentiable 
at (Answer on p. 1893.) 
(a) -3, with f’(-3) = -1; 

(b) Any a<0, with f’(a) =-1. 


(... Example continued from above.) 


We’ve shown that the function f : R > R defined by f(x) = |a| is differentiable at any 
a#0, with 


for a> 0, 
f' (a) = 
—1, fora <0. 

It turns out though that f is not differentiable at 0. 


Informally and intuitively, this is because “near” 0, there is no single number that the 
gradient of f stays “near” to. To the left, the gradient is —1; while to the right, it is 1. 


To formally show that f is not differentiable at 0, we can first show”? that the left- and 
right-hand limits at 0 exist but are not equal: 


f(x)- (0) __ 


f(e)- FO) _, 


GUM z-0 tO" r-0 





and 





Corollary 41 said that if the left- and right-hand limits are not equal, then the limit does 
not exist. And so, the following limit does not exist: 


= i (0) 


ha _ 





We shall simply say that f’(0) does not exist and that f is not differentiable at 0. 





(Example continues below ...) 





3°°The steps taken to compute these left- and right-hand limits are very similar to what we’ve just done: 


icra He) FO) | = lim [a= (Simply plug in for the function f) 





207 «0 x —-O 
-x-0 
= lim a ; (For all x “near” but less than 0, x <0 and hence || = -2) 
2>0- G7 
= lim —1=1 
207 
lim f(x) ~ (0) = lim lel 10] 0 (Simply plug in for the function f) 
r—0* rl) a0 g—(Q 
= lim “— (For all x “near” but more than 0, x > 0 and hence |z| = x) 
xw—7O0r is 6} — 


= lim 1£1. 
x—>O0t 
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Definition 204. Suppose the function f is differentiable at every point in a set S. Then 
we say that f is differentiable on S. 


And if f is differentiable at every point in its domain, then we say that f is a differentiable 
function. 


(... Example continued from above.) 


Above, we’ve shown that the function f : R > R defined by f (x) = |z| is differentiable 
everywhere except at 0. Equivalently, f is differentiable on R \ {0} or (-—00,0) U (00,0). 


Since f fails to be differentiable at even a single point in its domain, by Definition 204, 
f is not a differentiable function. 


Example 1118. Define g:R > R by g(z) = 2”. 


Figure to be 
inserted here. 


The derivative of g at 2 is 
—g(2 
2 2 
Does this derivative exist? If so, what does it equal? 


To find out, first simplify the difference quotient a little: 


g(a)-g(2)_ 22-2 _(w-2)(w+2)_ 


a 
T=2 £=2 G=zZ a 


2 2 


lim 
r2 


g(x) - 9 (2) 


at a eee 
=lim (2 +2) =lima + lim2 =4. 
ip x2 EP 


We’ve just shown that yes, the derivative of g at 2 exists. Moreover, it equals 4. 





Exercise 334 continues with this example. 





391 At =, we can use the Sum Rule for Limits because these two limits exist: 


it lim & = 2 (by the Power Rule for Limits) 
2. lim 2 = 2 (by the Constant Rule for Limits) 


If one of the above two limits does not exist, then it would be illegitimate and incorrect to apply the 
Sum Rule for Limits at =. 
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88.1. Differentiable <> Approximately Linear 


Informally, 


© 
A function f is differentiable at a pointa <=> ff is approximately linear at a. 
Or even more informally, RHS of the above statement can be rewritten as 


“Near” a (or, when we “zoom in” to a), f “looks” like a straight line. 


Example 1119. The graph of sin doesn’t “look” like a straight line anywhere. 


Figure to be 
inserted here. 


However, if we pick any point, say 0, and “zoom in”, then the graph does “look” increas- 
ingly like a straight line. We may say that sin is approximately linear at 0. 


It turns out that sin is indeed differentiable at 0. 


Example 1120. Consider the absolute value function. No matter how much we “zoom 
in” at the point 0, it never looks like a straight line. 


We cannot say that the absolute value function is approximately linear at 0. 


Figure to be 
inserted here. 


And as shown earlier, the absolute value function is indeed not differentiable at 0. 


© 
Proposition 32 (Appendices) presents a more restatement of <=>. Here we can provide a 





© 
little heuristic justification of why <=> might be true. 





392Still other writers object to the term locally linear for this reason: To say that f is locally X at a 
is to say that f actually satisfies property X, at least for points that are “near” a. However, if f is 
differentiable at a, then f may not actually be linear anywhere “near” a. Thus, according to these other 
writers, we should avoid the term phrase locally linear. 
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Start with the definition of the derivative: 





f(a) gilim/@-f@ 


Let’s hand wave and assume we can rewrite @1 as 


_£@)=F(a) 


r—-a 


f(a) 


Rearranging, f(x)» f (a) + f' (a) (a- a). 


Figure to be 


inserted here. 
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88.2. Continuity vs Differentiability 


Informally, a function is 


¢ Continuous if its graph contains no “holes” or “jumps” anywhere; and 


¢ Differentiable if it is continuous and moreover, contains no “kinks” or other “abrupt 
turns” in its graph. 


Informally, both continuity and differentiability tell us how “smooth” a function is. 


Example 1121. The functions f,g,h:R — R are graphed below: 








The function f is both continuous and differentiable. 


The function g is continuous but not differentiable. In particular, it is differentiable at 
every point except A. 


The function / is neither continuous nor differentiable. In particular, it is continuous and 
differentiable at every point except B. 





Differentiability implies continuity. Equivalently, differentiability is a stronger condition 
than continuity. Equivalently, 

















Proof. See p. 1673 (Appendices). 


Fun Fact 


Many 19th-century mathematicians believed that continuity implied differentiability. In- 
deed, they often attempted to “prove” this “result”.3" 


They were mistaken, as the function g in the above example clearly shows (g is continuous 
but not differentiable at the point A). 








393Hawkins (1968, p. 74; 1975, pp. 43-44): 
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88.3. The Derivative Is a Function 


Example 1122. Define f : R- R by f (2) = |z!. 


We showed earlier that f is differentiable everywhere except at 0, with f’(a) =1 for a>0 
and f(a) =—1 for a< 0. 


Figure to be 
inserted here. 


So far, we’ve been discussing only the object the derivative of f at some point a, 
which is a (real) number. For example, the derivative of f at -3 is -1 and the derivative 
of f at 2is 1—/’ (-3) =-1 and j’(2) =1. 


We’ll now define a new object called the derivative of f—this is the function f' : 
R \ {0} > R defined by 


ie for 7 > 0, 
i - 


—1, for ve< 0), 


Note the difference in the domains: Domain f = R, while Domain f = R \ {0}. 





Formally and generally, 


Definition 205. Let f be a nice function. Let S be the set of all points at which f is 
differentiable. The derivative of f is the function f’:.S > R defined by 


f@)=F(@) 


Fe) = lim 


ra 





At each point aé S, f’(a) gives us the derivative (or the gradient of the tangent line) of f 
at that point. 


And at each point a ¢ S, f is not differentiable and so f’(a) is simply left undefined (or 
does not exist). 





Between the appearance of Ampére’s paper [1806] and 1870, the proposition that any (contin- 
uous) function is differentiable in general was stated and proved in most of the leading texts 
on the calculus. 


The footnote at the end of the sentence lists several “leading texts” that “proved” this “result”: “Lacroix 
1810-19: I, 241n; Raabe 1839-47: I, 7ff; Duhamel 1847: I, 22; 1856: I, 94-97; Freycinet 1860: 39-42; 
Bertrand 1864-70: I, 2-4; Serret 1868: I, 16-21; Rubini 1868: I, 36-37. Two proofs were also attempted 
by Galois [1962: 383, 413ff].” 
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Example 1123. Define g:R > R by g(z) = 2”. 
We showed earlier that g is differentiable everywhere, with g’ (a) = 2a for each a€ R. 


So, the derivative of g is the function g’: R > R defined by g’ (x) = 22. 


Figure to be 
inserted here. 


(Note that in this example, Domain g = R = Domain g’.) 


In Ch. 17.3, we carefully distinguished between g and g(a): 


e The symbol g denotes a function. 
¢ In contrast, the symbol g(a) denotes a real number (namely, the value taken by the 
function g at the point a). 


Here likewise, we must carefully distinguish between “the derivative of g” and “the deriva- 

tive of g at a”: 

¢ “The derivative of g” is a function and is denoted g’. 

¢ In contrast, “the derivative of g at a” is a real number (namely, the value taken by the 
function g’ at the point a) and is denoted g’ (a). 

So, it makes sense to say “the derivative of g at 17 is 34”, because “the derivative of g at 

17” is a real number, namely 34. 


In contrast, the statement “the derivative of g is 34” makes no sense, because “the deriva- 
tive of g” is a function, not a real number. 
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Example 1124. Define h:R > R by h(x) =x? +2. Let a€R. 


The derivative of h at a is h’(a) =lim C2 a oy 
a 8 ¢-4 


Let’s first simplify the difference quotient: 


h(a) -h(a) (a2 +2) -(a? +a) _ erat Ena _ (x-a)(x+a) 


w—-a r—-a w—-a 





+l=ax+adtl. 


1 
h -—h a a 
a OnE) a een ina alr Aneel one 
w>O w>a wa wa 


Now 
? oO 4 i _— a 


We’ve just shown that for any a € R, the derivative of h at a exists and equals 2a + 1. 
Hence, the derivative of h is the function h’: R > R defined by h’ (x) = 2a +1. 


Figure to be 
inserted here. 





Let us stress, emphasise, and repeat: Given a function f, its derivative f’ is itself also a 
function. In particular, it is the function whose 


e Domain is the set of points at which f is differentiable; 


e Codomain is R; and 


e Mapping rule is ae lim f()-f a), 
ra L-a 








394 At = we can apply the Sum Rule for Limits because these three limits exist: 


1. lima =a (by the Power Rule for Limits); 
2. lima =a (by the Constant Rule for Limits); 
3. lim 1 = 1 (ditto). 
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Exercise 335. For each function, find its derivative (remember: this is a function). Then 
write down its derivative at 2 (remember: this is a number). (Answer on p. 1894.) 


(a) f :R-—R defined by f (x) = 7. 


(b) g:R-R defined by g (x) = 54 +7. 


(c) h:R-R defined by h (x) = 227 +5a +7. 


Figure to be 
inserted here. 





870, Contents www.EconsPhDTutor.com 


88.4. “Most” Elementary Functions Are Differentiable 


Recall that an elementary function is 


a polynomial function, a trigonometric function, 
an inverse trigonometric function, a natural logarithm function, 
an exponential function, a power function, 


any arithmetic combination of two elementary functions, 


or any composition of two elementary functions. 


In Ch. 87.4, we learnt that all elementary functions are continuous. Now, is it likewise 
true that 


“All elementary functions are differentiable”? 


Unfortunately, no. Two examples of elementary functions that aren’t differentiable: 


Example 1125. The absolute value function As explained on p. 403 is an elementary 
function (we explained why on p. 403). 


Figure to be 
inserted here. 





However, it is not differentiable at 0 and hence is not a differentiable function. 
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Example 1126. The arcsine function sin7! : [-1,1] > R is, by Definition 109, elementary. 


Figure to be 
inserted here. 


In Example 1148, we’ll learn that the derivative of sin! is the function (sin) :(-1,1) > 
R defined by 


1 
J1— 22 


Observe that sin is not differentiable at the points +1 and, hence, is not differentiable. 
(What about cos™'?)?9? 


(sin!)’ = 





Nonetheless and somewhat happily, we may make these imprecise claims:>”° 
“Most” elementary functions are differentiable. 
Moreover, their derivatives are themselves elementary. 
So, 


e« “Most” functions we’ll encounter in H2 Maths are elementary. 


e« “Most” elementary functions are differentiable. 


e Therefore, “most” functions we’ll encounter in H2 Maths are differentiable. 


Exercise 336. Show that the function g : R > R defined by g(x) = \/|z| is elementary 
(Answer on p. 1895.) 


but not differentiable. 








395Tt’s possible to show that cos”! is likewise an elementary function, but isn’t differentiable at the points 
+1 and, hence, is not differentiable. 


39For some justification of these imprecise claims, see Ch. 146.10 (Appendices). 
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89. More on Leibniz’s Notation 


d 
89.1. Using ie as the Differentiation Operator 
% 


See Ch. 43.2 for a discussion of Lagrange’s (f’) vs Newton’s (f) vs Leibniz’s (df/dz) 
notation. 


A function maps each object in its domain to (exactly) one object in its codomain. 


Most functions we’ve looked at are nice (Ch. 17.4)—that is, their domain and codomain 
are both sets of real numbers and the function maps a real number to a real number. 


But it is possible to imagine functions that are somewhat more abstract. 


In particular, we can imagine a function whose domain and codomain are both sets of 
functions. Such a function would map a function (in its domain) to a function (in its 
codomain). Such functions go by the special name of operators. This sounds complicated, 
but isn’t: 


Example 1127. Define the functions f,g:R > R by f (x) =x+1 and g(x) =32-1. 
Let S be the squaring operator.*®’ Then 


« S(f) =h, where h: R > R is the function defined by h(x) = (x+1)*. That is, the 
squaring operator S maps the function f to the function h. And we have S(f) (2) = 
(2) = 9) 

« S(g) =i, where i: R > R is the function defined by i(z) = (3x-1)*. That is, the 
squaring operator S maps the function g to the function i. And we have S'(g) (2) = 
i (2) = 25. 

Let T’ be the divide-by-two operator. ‘Then 


« T(f) = 7, where 7 : R > R is the function defined by j (x) = (a+1)/2. That is, 
the divide-by-two operator T’ maps the function f to the function 7. And we have 
TCI 2) 8) es 

« T(g) =k, where k : R > R is the function defined by k(x) = (82-1)/2. That is, 
the divide-by-two operator T’ maps the function g to the function k. And we have 


T (g) (2) = k (2) = 5/2. 


Exercise 337. Continuing with the above example, what are T(h), T (7), S(7), S(k), 
Py 2) TONG). 5) (2) andes (42) (Answer on p. 1896.) 





Let f be a function. Then using Leibniz’s notation, we can denote 


d 
¢ Its derivative by the symbol cf (this is a function); and 
ie 


df 


¢ Its derivative at some point a by the symbol an (a) (this is a number). 
A 





397Strictly and pedantically speaking, since S' is also a function, we should also specify its domain and 
codomain. They could, for example, both be the set of all nice functions. 
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d 
Let’s now go a little further with Leibniz’s notation and use the symbol An to denote the 
g 
d 
differentiation operator. That is, a: will itself denote the function (or operator) that 
a 
maps any function (f) to its derivative (f’ = f = —). 


Example 1128. Define f,g:.R—R by f (x) =27+7x+3 and g(x) = 2x +7. 
Observe that the derivative of f is g. We may write 


Equation 2 says that “the derivative of f is g”. 


Now, we shall also write 


d 
Equation 2 now says that “the differentiation operator ae maps f to g”. 
a8 


Of course, there isn’t any practical difference between equations + and 2. All we’ve done 
is to introduce a new symbol — that denotes the differentiation operator. As we’ll see, 


this new piece of notation will occasionally be convenient. 
1 ; iI 
Example 1129. Define h,i: R\ {0} > R by h(x) =— and i(x) = -—. 
i se 


The derivative of h is 7 a = 
dx 


d 
Equivalently, the differentiation operator an maps fh to 2: 
z 


d 
—h =1. 
dx 5 


Example 1130. The derivative of the sine function is the cosine function: 


dsin 
Cos, 
dx 


d 
Equivalently, the differentiation operator ar maps sin to cos: 
z 


da. 
— sin = cos. 


dx 
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Example 1131. The derivative of the cosine function is the negative sine function: 


dcos : 
=-sin. 


dx 


d 
Equivalently, the differentiation operator an maps cos to —sin: 
z 


Example 1132. Given the functions f and g, the Product Rule says that 


EVeD ati ee dg 
dx dz dx 


d 
Equivalently, the differentiation operator a maps the function f-g to the function 
z 
df dg. 
———se g + 
dx ‘da’ 
dg 


d of 4 7 Y 
se) a ee gry aa 





OO) 


Again, it is merely customary to use “x” as our symbol for the dummy variable. We can 
replace “x” with any other symbol, such as y, t, 6, 4, or *. So, the last five examples 
could’ve a rewritten as: 


Example 1133. dy =g or ey =n 
dy dy 
dh _ d 
E le 1134. — —h =1. 
xample 113 cra or ee a 


Example 1136. eee = 


Example 1137. 


ae Cec 
dx« 7 ae 
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89.2. Using = as Shorthand 
x 


So far, whenever we’ve written 


—o=a 
dx 


each of the objects in the two blank Os is a function. 


So far, we have not written statements such as this: 


d 91 
a = 27 
a z 


We will now go ahead and say that Aisa perfectly legal statement. In particular, * shall 
serve as convenient shorthand for this precise but long-winded statement:°° 


If a differentiable function maps each x to x”, then its derivative maps each x to 
22. 


Example 1138. The statement 

d 1 

7 ea On 41 

5, (2a" +2) 

is convenient shorthand for this precise but long-winded statement: 


If a differentiable function maps each x to 27° +x, then its derivative maps each 
eioer al. 


Note that while convenient, + Jeaves out some information—in particular, the domain 
and codomain of both the function and the derivative. 


; oe ie 
Sometimes we may not have this information and so = is all we can say. 


But where we do have this information, we might prefer to explicitly state it. For example, 


instead of just the statement =, we might prefer to write 


The function f : [-3,5] > R defined by f (x) = 2x°+z has derivative f’ :[-3,5] > 
R defined by f’ (x) = 62? +1. 








3%Definition 206 (Appendices) explains how such shorthand generally works. 
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Example 1139. The statement 


Gar eA al 
— sin x = cosa. 


dx 


is convenient shorthand for this precise but long-winded statement: 


If a differentiable function maps each x to sin x, then its derivative maps each x 
to cosa. 


Again, = leaves out information about the domain and codomain of both the function 
and the derivative. Where we do have this information, instead of just the statement é, 
we might prefer to write, for example, 


The function f : [-3,5] > R defined by f () = sin x has derivative f’: [-3,5] > R 
defined by f’ (a) = cosa. 





Formally, 


Definition 206. Let f(a) and g(a) be expressions containing the variable x. We shall 
write 


=f (0) =9(2) 


to mean the following: 


If a differentiable function maps each x to f (x), then its derivative maps each x 
tO G( a). 


Exercise 338. For each of the following statements, write down its corresponding precise 

but long-winded statement. (Answer on p. 1896.) 

d 3 2 

a) —2° = 32". 
(a) = 


2 


d 
(b) 5, sine = 27 cos x”. 
x 


d il 
(c) qq ne ve 


Exercise 339. What’s wrong with the following “proof” that 0 = 1? 


. Define f :R—>R by f (x) = 27 - 32. 
. The derivative of f is the function f’: IR > R defined by f’ (x) = 22 - 3. 
. Observe that f (2) = 2?-3-2=-2. 


d 
. So, f’ (2) = — (-2) =0. 
0, #2) == (-2) 
. But from Step 2, we also have f’(2) =2-2-3=1. 
. Hence, by Steps 4 and 5, we have 0 = 1. (Answer on p. 1896.) 
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89.3. of Is Not a Fraction; Nonetheless ... 
x 


d 
To repeat, the symbol f’, f, or a denotes a function—namely, the derivative of f: 
c 


Leibniz 
t 
t . 
i a df = Derivative of f 
| daz 


Newton 


Lagrange 


The symbol f’ (a), f (a), a , or = (a) denotes a real number—namely, the derivative 
a z 


of f at a, which is a limit (of ‘a fraction): 


Leibniz 
Lagrange ee, 
xn tey| Alen.) OI) 
f aS ; = 9, () = in 





xv=a 
Newton 


d 
Leibniz’s notation f looks very much like a fraction, with numerator df and denominator 
ie 


dz. However, in is not a fraction. 
x 


I repeat, — is not a fraction. 
x 


d d d 
Neither 7 nor ot (a) denotes a fraction. Instead and to repeat, ct denotes a function, 
x g x 


while as (a) denotes a real number. 
iz 


d 
We just made it clear that - is not a fraction. Nonetheless, as an informal and intuitive 
x 


d 
aid to our understanding, it can be helpful to think of a as a fraction. 
a 


To illustrate, consider the Chain Rule. When formally stated in Lagrange’s notation, it 
looks like this (see Theorem 32 below): 


(fog)'=(fiog)-g'. 


With é, it is difficult to understand or remember the Chain Rule. 
But when stated using Leibniz’s notation, the Chain Rule becomes much easier to under- 
stand and remember: 
dz2dz_ dy 
da dy da’ 
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With 2 we can easily see the intuition behind the Chain Rule: 


The change in z due to — The change in z due to ‘ The change in y due to 


a small unit change inz  asmall unit changeiny  asmall unit change in x 


With 2 we can also easily remember the Chain Rule, by simply thinking of three terms 
dz dz 


d 
—, —, and <Y as fractions and naively applying primary-school algebra: 
dx’ dy dx 


az de, 
dx dg dx 

Of course, formally, we aren’t actually doing anything like cancelling the dy’s. Nonetheless, 

the above is a nice way to remember the Chain Rule. 


The Inverse Function Theorem (which we'll learn about in Ch. 91.2) is another example 
of the advantages of Leibniz’s notation. 


df 


It turns out that Leibniz did think of an as a fraction! For him, 
a 


e dx denoted an “infinitesimal change in x”; 


e df denoted the corresponding “infinitesimal change in f”; and 
d 
° f really was a fraction with numerator df and denominator dz. 
ie 


It turns out though that this approach ran into some technical difficulties. In particular, 
while the concept of infinitesimals was intuitive and plausible, it was difficult to give it 
precise and rigorous meaning. 


In the 19th century, these difficulties were resolved (or perhaps simply sidestepped) by 
redefining the derivative as a limit (of a fraction).°°? This 19th-century solution remains 
the standard modern approach and is also the approach we’ve taken in this textbook.* 


Subchapter summary: 


e Under the modern approach, it is technically and formally wrong to think of of as a 
fraction with numerator df and denominator dz. 

e« Nonetheless, as an informal and intuitive aid to our understanding, it can be helpful 
to think of i as a fraction. For example, it helps us see the Chain Rule’s underlying 


intuition and also remember the Rule more easily. (The same will be true of the Inverse 
Function Theorem. ) 





399See Ch. 146.2 (Appendices). 
00T) the 1960s, Abraham Robinson (1966) introduced what he called Non-Standard Analysis—he gave 


d 
Leibniz’s infinitesimals precise and rigorous meaning, so that dy could at last be legitimately treated 


x 
as a fraction. However, non-standard analysis has not been widely taken up as it is not clearly superior 
to the standard approach. 
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d 
e And actually, Leibniz did think of a as a fraction—it was only in the 19th century that 


as 
such thinking was definitively replaced by the modern approach. 


Also, as explained in Ch. 89.1, another advantage of Leibniz’s notation is that we can use 


d 
the symbol os to clearly denote the differentiation operator. (It’d be harder to do the same 
a 


with Lagrange’s prime ‘ or Newton’s dot notation ‘.) 








401 Below are the other eight questions I’ve come across. They do not add much to the discussions already 
given in the two questions mentioned above. 


e« How misleading is it to regard @ as a fraction? 

e« Can I ever go wrong if I keep thinking of derivatives as ratios? 
e If vat doesn’t cancel, then what do you call it? 

e Why do most of the operations treating dy/dz as a ratio works? 
e What is wrong with treating ae as a fraction? [duplicate] 


e When can’t dy/dz be used as a ratio/fraction? 
e When can we not treat differentials as fractions? And when is it perfectly OK? 


e Why do we treat differentials as infinitesimals, even when it’s not rigorous 
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Fun Fact 


d 
Here’s what Alan Turing thought about Leibniz’s notation - in his recently discovered 
18 


wartime notebooks: 


6) Te hecbncy ushehen Fit pel them 


wnelirtad — ovh yy) yi hows ania) Ce fk 


It ental ipl Hab Mee wlebe hehe 


Ghec . 


hen Caceh clon sv 


d 
The Leibniz notation = I find extremely difficult to understand in spite of 
iD 
it having been the one I understood best once! It certainly implies that some 


relation between x and y has been laid down e.g. 
y=o°+3a 


— Alan Turing (c. 1944).4? 








4021Discussion of what Turing may have meant by this remark: 8. 
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90. Rules of Differentiation, Revisited 


We reproduce from Ch. 43.6 and 43.7 these Rules of Differentiation: 


Theorem 14. (Rules of Differentiation) Let c be a constant and x, y, and z be 
variables. Then 


Constant Rule Constant Factor Rule Power Rule 


d 
= (c y) ES —~ — a? 2 eg 


dx 


Sum and Difference Rules Product Rule 


ae 
ae 


d aay dz 
ae ee = 


oo ae “ 2,0. 
ae 5 (v2) £28 


Quotient Rule Sine Cosine 





d zo = ye : 
Ys “do 5 du = sine = cos’ 
drz z dx 


Natural Logarithm Exponential 
d i d 


—Inzx=- —e” =e” 


ay a dx 


Theorem 15 (informal). (Chain Rule) Suppose y and z are functions. Then 


dz dz dy 
dx dy dx’ 





Some easy exercises to get you reacquainted with the above Rules: 


Exercise 340. XXX (Answer on p. 882.) 
A340. 


We now have a better understanding of what derivatives are. In particular, we now under- 
stand that 


A derivative is itself a function. 


Also, as discussed in Examples 1138 and 1139, the Rules of Differentiation as stated in 
Theorem 14 leave out information about the domain and codomain of both the function 
and its derivative. 


Moreover, Theorem 15 is stated somewhat informally. 


So, in the following four subchapters (optional), we’ll restate and also prove these Rules of 
Differentiation. 
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90.1. Proving Some Basic Rules of Differentiation (optional) 


We start with the simplest, the Constant Rule, which says that the derivative of a 
constant function is a zero function: 


Fact 203. (Constant Rule for Differentiation) Let D,E CR, ce E, and f: D> E be 
a function. If f is defined by f (x) =c, then the derivative of f is the function f': D>R 
defined by 


Ge a 





Proof. Let ae D. At =, use the Constant Rule for Limits (Theorem 23): 


PEIN) ie 8 ating 
La r-ay-qQ «a 


f’ (a) = lim 


wa 


We’ve just shown that for every a € D, the derivative of f at a equals 0. Hence, the 
derivative of f is the function f’: D > R defined by f' (x) £0. 














By the way, a partial’? converse of the above is also true—provided the function’s 


domain is an interval, if that function’s derivative is a zero function, then the original 
function is a constant function: 


Proposition 8. Let D,E CR and f: D> E be a function. Suppose D is an interval. If 
the derivative of f is the function f': D> R defined by f’ (x) =0, then f is a constant 





function—i.e. f defined by f (x) =c (for some ce E). 


Proof. This result is perhaps intuitively “obvious”. However, its proof requires some slightly 
more advanced tools and so we’ll relegate it to the Appendices—see p. 1684. 














The Constant Factor Rule says that the derivative of a constant multiple of a 
function is that constant multiple of that function’s derivative: 


Fact 204. (Constant Factor Rule for Differentiation) Let D,E ¢ R, ce R, 
f:D-—E be a function. Suppose f is differentiable. Then the function cf is 
differentiable and its derivative (cf)': D > R is defined by 


(cf) (x) £ cf’ (2). 





Proof. Let ae D. At =, use the Product Rule for Limits (why is this legitimate?):4% 





“3This is a partial converse because we add a requirement (namely, that the function’s domain is an 
interval). 
44Because these two limits exist: 


1. lime = (by the Constant Rule for Limits); 
f (a) = f(a) 
Ea 


2. lim 


za 


= f'(a) (by the definition of the derivative). 
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OED as) 1) 


= ra [=o 


(cf)! (a) = lim 





© f'(a) 
We’ve just shown that for every a€ D, the derivative of cf at a equals cf’ (a). Hence, the 
derivative of cf is the function (cf))': D > R defined by (cf)! (x) £ cf! (z). 














The Sum and Difference Rules say that the derivative of the sum (or difference) 
of two functions is the sum (or difference) of their derivatives: 


Fact 205. (Sum and Difference Rules for Differentiation) Suppose f,g: D> R 
are differentiable functions. Then the function f +g is differentiable and its derivative 
(f+g)':D-—R is defined by 


(fg) (x)= f' (x) 49! (2). 





Proof. For any a¢€ D, we have” 


fag) (a) tim £29) #9) (0) _,,,, FO) +9@)-L@) + 9(@)] 


=tim [He else) 
£ Yim LOL 4 hy SO 9) = 97) 4 g(a) 


f(a) g(a) 


We’ve just shown that for every a € D, the derivative of f +g at a equals f’ (a) + g' (a). 
Hence, the derivative of f +g is the function (f +g)’: D > R defined by (f +9)’ (x) = 


f' (a) +9" (a). 














Fact 206. (Power Rule for Differentiation) Let DCR and f: D>R be a differen- 
tiable function. Suppose f is defined by f (x) =x° for some ce R. Then the derivative of 
f is the function f': D +R defined by 


f' (@) 2 ex, 








If either of the above limits did not exist, then the application of the Product Rule for Limits would 
not be legitimate. 
405 At =, we can use the Sum and Difference Rules for Limits because these two limits exist: 
tm f@-f@) 
L-a 


«wa 


= f'(a) (by the definition of the derivative) 


zr>a 


2. lim pa) g(a) = g' (a) (ditto) 
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Sadly, a complete proof of the Power Rule is beyond the scope of H2 Maths and this 
textbook. 


Nonetheless and very excitingly, we will now learn to prove the Power Rule in the special 
case where the exponent c is a non-negative integer. 


Of course, we’ve already proven the Power Rule in the simplest case where the exponent 
is 0—this is simply the Constant Rule. And so, let us start with the next simplest case— 
namely, the case where the exponent is 1—and work our way up: 

Example 1140. Define f: R—>R by f(x) =z. 

For any a € R, we have 


fy = lim ~—* =tim1 £1. 


ra Lr L7-ay-qQ «a 


We’ve just shown that for any a eé R, the derivative of f at a exists and equals 1. 
Hence, the derivative of f is the function f’:IR > R defined by f’ (2) = 1. 


Example 1141. Define g:R—> R by g(a) = 2”. 
Let ae R. Simplify the difference quotient: 


g(v)-g(a)_22-a? _(w-a)(w+a) _ 


r+aQ. 
opens 


g(a) = lim 


wa 


g(x) ~ g(a) 


. ae 40 . 
= lim (z +a) = lima + lima = 2a. 
wa wa wa 
—— ——’ 
a a 


We’ve just shown that for any ae R, the derivative of g at a exists and equals 2a. 





Hence, the derivative of g is the function g’: IR > R defined by g’ (x) = 2z. 





“6We actually already did this in Exercise 334(c). 
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Example 1142. Define h: R > R by h(x) = 2°. 


We are given this hint: x —a3 +(x -a) (2? +ax+a7). 


Let ae R. Simplify the difference quotient: 


h(z)-h 3 a3 - 2+ax+ a? 
(2) h(a) _o8-a8 1 (w-a) (2 tanta) 9 
r-a T-a L-a 


Bo! 


2 





h’ (a) = lim Ae) = lim (x 


ae apo ° 5 
+QaKn + i) = limz? + limaz + lima? = 3a’. 
za T-a ra 


wa wa wa 


We’ve just shown that for any a€R, the derivative of h at a exists and equals 3a’. 


Hence, the derivative of h is the function h’:R > R defined by h’ (x) = 3a”. 





We can go on finding the derivatives of higher integer powers using this “hint”: 
a” a7 =(¢~a) (4a), 


xv - a> =(a-a)(x2? +ax+a°), 


* 
a =a = (ea) (a +ar +072 +0"), 


Ilo 


n—-2 


: (v-a) (a1 +a" ata" 3a? +---+20"! +a") 


zc” —a” 


You'll use = and = in the next two Exercises: 


Exercise 341. Define i: R > R by i(x) = 2*. Find the derivative of i. (Answer on p. 
1897.) 


Exercise 342. Define 7 : R > R by j (x) = 2°, where c is a positive integer. Find the 
derivative of j. (Answer on p. 1897.) 








407 At =, we can use the Sum Rule for Limits because these three limits exist: 
1. lima? = a? (by the Power Rule for Limits) 
2. limax = a” (by the Power and Constant Factor Rule for Limits) 


x 


3. lima? = a? (by the Constant Rule for Limits) 
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90.2. Proving the Product Rule (optional) 


Theorem 30. (Product Rule) Let DCR. Suppose f,g: D > R are differentiable 
functions. Then f-g is differentiable and its derivative is 





Wek ors eG 


Proof. Let ae D. Take the difference quotient and manipulate it: 
(f- g(a) -F 9) _ fa)g(@)-f@g@ 


x-a x-a 
Plus Zero Trick 











_ jg) {=H , L)-F) 6) 
eas (fg) (a) = lim (eg) o) - u -g) (a) 
wiy[ se) LO=2O) , LED= 1g) 
lim 7 (elim gla) = 9a) = 94) + lim re)= 7a) (a) ~ F(a) lim g(a) 
f(a) g'(a) f'(a) g(a) 


= f(a)g'(a)+ f(a) g(a). 


We’ve just shown that for any a € D, the derivative of f-g at a exists and is equal to 
f (a) g' (a) + f(a) 9 (a). 

Hence, the derivative of f +g is the function (f-g)’ : D + R defined by (f-g)' (x) = 
f(x)’ (a) + f'(%) 9 (2). 

Or equivalently, the derivative of f-g is the function f-g'+ f'-g. 

















408 At “2°, we can use the Sum and Product Rules for Limits. because these four limits exist: 
1. lim f (@) = f (a) (by the differentiability and hence continuity of f) 


2. limg (a) = g(a) (by the Constant Rule for Limits) 
f@)-F(@) _ 
E-G 


3. lim 


«wa 


f' (a) (by the definition of the derivative) 


za 


4. lim pa) g(a) = g' (a) (ditto) 
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Fun Fact 


The Product Rule (for differentiation) is sometimes called the Leibniz Rule. 


But Leibniz initially got it wrong! He initially guessed (as one might) that analogous to 
the Sum and Difference Rules, the derivative of the product is equal to the product of 
the derivatives. ‘That is, 


Cro = hob 


He quickly realised though that this naive Product Rule was wrong and arrived at the 


correct Product Rule.*2 








409See Ch. 146.11 (Appendices) for more about this story. 
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90.3. Proving the Quotient Rule (optional) 


Theorem 31. (Quotient Rule) Let DCR. Suppose f,g: D > R are differentiable 
functions. Then the function f/g is differentiable and its derivative is 


Cy ian ee gt 
ge 


Note that the domain of f/g and its derivative is D\ {x: g(x) = 0}. 





Proof. Let ae D\ {a: g(a) = 0}. Take the difference quotient and manipulate it: 
(f/9) (@) - (Fig) (@) _ F(a) /9(@) - FO /9 @ 
tig =O, 
Plus Zero Trick 


_f(@)/o(@)-f (2) + f(@) - F(@) /9 (a) 




















1 1 f@a@)-f@9@)+f@s@-f@a@) 
g(a) g(x) ra 

1 LP P@)-F@) _ py 9(2)- 9a) 
ya 2 Oo 


So, 





410 At 2, we can use the Difference and Product Rules for Limits because these six limits exist: 


1 
1. lim ——~ (by the Constant Rule for Limits) 


rag(a) g ia) 
2. lim f (x) = f (a) (by the continuity of f) 
ae lim g (7) =¢ 7 (by the continuity of g) 

4. lim a (by the Reciprocal Rule for Limits) 
rag(t) g 10 
f(@)-F(@) _ 

E- a 


g(x) ~ g(a) 


5. lim 


«wa 


f' (a) (by the definition of the derivative) 


6. lim 


za 


= g' (a) (ditto) 
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(2) (o) =tin LD) 








ctimf 2} [orgy f@-L@ _ 5-4) 9(2)-9@) 
-tin{ a!) ee 1 | 
2 im img (x im 2) 79) im f (x im 242) = 9() 
. lin im [lm 9 ( )lin C—-a lim f ( )lim C-a | 
ie eis g(a) f'(a) f(a) g(a) 
wa) ie) 
_ g(a) f(a) - fla) g' (a) 
[9(a)] 


We’ve just shown that for any a¢ D\ {x: g(x) =0}, the derivative of fg at a exists and is 
g(a) f(a) - Flag’ (a) 

[9(a)]’ 
Hence, the derivative of f/g is the function (f/g)': D \ {x: g(x) = 0} > R defined by 


(£) @- g(x) f(z) - F(z) g' (2) 





equal to 














g [o(x)} 
Or equivalently, the derivative of f/g is the function 
IR Mle eh 
(9) 
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90.4. Proving the Chain Rule (optional) 
Formal statement of the Chain Rule: 
Theorem 32. (Chain Rule) Let f and g be nice functions with Range g © Domain f. 


Suppose f and g are differentiable. Then the function fog is differentiable and its 
derivative ts 





Gece 


“Proof.” Let E =Domaing and ae E. 


(fog) (@)< lim OOO) gy LO) 


Lyn | fO9 (2) - FG (a) 9 (2) -9 @) 
ma} g(x) - g(a) ra 
x pm Ag (2) - F (9 @)),. gi) 9(a) —9(0) 
oe GG), a 

f(g(a)) af(a) 
= f(g (a)) 9 (a). 


We’ve just shown that for any a € FE, the derivative of fog at a exists and is equal to 
f(g (a)) 9" (a). 
Hence, the derivative of f og is the function (f og)’: E > R defined by 


(fog) (x) = f'(g(2))9' (2). 


Or equivalently, the derivative of fog is the function 
(flog) -g. 


Unfortunately, the above “proof” is wrong! It contains two flaws: 














1. At é. there is the possibility that g(a) = g(a) for some values of x “near” a. There is 
thus the possibility that we’re committing the Cardinal Sin of Dividing by Zero (see Ch. 
Ute ee 


2. At =, we use lim 269 6%) ~ f(g (2) z 
wa g (a) -g (a) 
done to show that this is actually true. 


"(g(a))—it turns out that some work must be 


Nonetheless, these two flaws may be regarded as mere blemishes or technicalities that can 
be “easily” fixed (as we do (Appendices), Ch. 146.12). Though flawed, the above “proof” 
is “mostly” correct and “mostly” explains why the Chain Rule “works”. 





BS Bae ANGE i) sta) = g'(a), as discussed in Remark 127. The 


difference is that when considering “lim”, we are assured that x - a #0 but not that g(x) - g(a) #0. 


“11Tn contrast, it is correct to assert that lim 
wa 
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91. Some Techniques of Differentiation 
91.1. Implicit Differentiation 


Example 1143. Consider the curve described by 


v2 + /y=l for x ¢[-1,1], y>0. 


Figure to be 
inserted here. 


What’s the gradient of this curve at each point? 
Method 1. Take 4 and do the algebra to express y in terms of x: 


a+ Jysl Jy =1-2" y2 (1-22). 


d 
Now apply the oe operator to 2; 
19 


(1-2?) =2(1- 2”) (-22) = 42 (1-2). 


This last expression*!? gives us the gradient of the curve at each point 2. 


For example, at the point x = 0, the gradient of the curve is 


1 
While at x = 5? it is 








412.4 bit more formally and in Leibniz’s notation, we’d say that 2 allows us to define a function y : [-1, 1] > 
R by 


























The derivative of y is then the function y’ :[0,0o) > R defined by 


y' (x) =-4¢(1-2°). 
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(... Example continued from the previous page.) 


d 
Method 2. Take + and directly apply the ap operator: 
uD 


d d 
ie (2? + Vy) = ae, tes + 0 2 -dr/Y. 


This last expression gives us the gradient of the curve at each point x. 


For example, by z, at x =0, we have y = 1; so, the gradient of the curve is 


a =-4.0V1=0. 
az 
(z,y)=(0,1) 


1 9 
While at x = 5? we have y = 16? so, the gradient of the curve is 


Not surprisingly, these numbers agree with what we found in Method 1 (we’d be worried 
if they didn’t). 


By the way, if we’d like, we can plug 2 into 2 to get 


2 = —4r\/ (1-22)? = -4r (1 - aa 


Again, this agrees with what we found in Method 1. 


d 
In Method 1, we first expressed y explicitly in terms of x, then found a 
a 


In contrast, in Method 2, we skipped the step of expressing y explicitly in terms of 2. 
Instead, we directly applied the da operator to the original equation +. This method is 
x 


called implicit differentiation (because we skip the step of expressing y explicitly in 
terms of 2). 


Implicit differentiation will usually involve using the Chain Rule. Here for example, we 


used the Chain Rule at Buin ; 
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Example 1144. Consider the curve described by 


e+yrel for x €[-1,1], y>0. 


Figure to be 
inserted here. 


What’s the gradient of this curve at each point? 
Method 1. Take = and do the algebra to express y in terms of x: 


=> y=4V1-2?. 


We were given that y > 0. So, we can discard any negative values of x to get 


y2V1— 22. 


d 
Now apply the = operator to 2. 
a 


dy d Jia e i a 
— > {L= 25 =) a a nt 
de de 4 9/1 — 72 a”) ta 


d 
Note that whereas our curve is defined for all x € [-1, 1], - is not defined at x = +1. 
z 


Indeed, as we can see from the above graph, at x = +1, the gradient of the curve is vertical 
(and thus considered undefined). 


This last expression*!? gives us the gradient of the curve at each point 2. 


(Example continues on the next page ...) 








413 4 bit more formally and in Leibniz’s notation, we’d say that 2 allows us to define a function y:[-l,1]- 
R by 














y (x) 2 V1 = 2. 


The derivative of y is then the function y’ : (-1,1) > R defined by 
x 

















NE) OP 


Again, note the subtle difference in the domains of y and 1 (the latter excludes +1). 
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(... Example continued from the previous page.) 


For example, at the point x = 0, the gradient of the curve is 





il 
While at x = 5? it is 


d 
Method 2. Take = and directly apply the ar operator: 
R 


die ay CR dy _ 
pra ee Viera! => 4 =0 


By é at x =0, we have y = 1; so, the gradient of the curve is 


dy 0 
— =--=0. 
dx 

(x,y)=(0,1) 


1 3 
By z. aie = 5? we have y = ue so, the gradient of the curve is 


a 


i (zy)=(3.>8) 





If we'd like, we can plug 2 into 2 to get 
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In the above two examples, Method 2 (implicit differentiation) was not obviously superior 
to Method 1. It will be in the next example, where Method 1 can’t even be used: 


Example 1145. Consider the curve described by 


5 


yi +yte +220. 


Figure to be 
inserted here. 


Q: What’s the gradient of this curve at the point x = 0? 


Here, Method 1 does not work at all. Try as we might, we are unable to express y in 
terms of x. 


d 
So, we turn to Method 2 (implicit differentiation). Apply the in operator to 2: 
is 


d d 
a ly +y +a +2) = —0 ea 


By 4, at x =0, we have y’ + y =0 or y(y°+1) =('or y =0- So, plus(z,4)) =(0,0) into 2 
to get 

dy oe ee 

dal(zy)=(0,0) 7-06+1 — 


A: The gradient of the curve at x = 0 is -1.4!4 








“14Tn this example, we’ve used not just the Chain Rule, but also a slightly deeper result called the Implicit 
Function Theorem. The Implicit Function Theorem says that even though we may be unable to 
express y in terms of x, equation + does nonetheless implicitly define a function y in terms of x and, 
moreover, this function y is differentiable. 

For the Implicit Function Theorem to work, certain technical conditions need to be satisfied. But we 
needn’t worry about this—in H2 Maths, we can safely assume that these conditions are always satisfied. 
We won’t say much more about the Implicit Function Theorem, which isn’t covered in H2 Maths or 
most introductory calculus courses, as it requires a little knowledge of multi-variable calculus and partial 
derivatives. For more information, see Wikipedia or Krantz & Parks (1993, in particular Theorem 1.3.1). 
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Example 1146. Consider the curve described by 


x? +sin (xy) +y=0. 


Figure to be 
inserted here. 


Q: What’s the gradient of this curve at the point x = 0? 


Again, Method 1 does not work at all. Try as we might, we are unable to express y in 
terms of «x. 


d 
So, we turn to Method 2. Apply the ce operator to s: 
e 


d d 
=, [2 + sin (xy) +y| =- —0 nase 2x + cos (ry) (year 
y 


0. 
dx dz 


nares 


dx 
By 4, at x =0, we have y =0. So, plug (x,y) = (0,0) into 2 to get 


dy dy dy 
2-0+cos(0-0 (o+0. \+2 =() =— — =() 
o> dax!(2,y)=(0,0)/ dax!(«,y)=(0,0) da !(x,y)=(0,0) 


A: The gradient of the curve at x = 0 is 0. 
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Example 1147. Consider the curve described by 


ay +22 +y +0. 


Figure to be 
inserted here. 


What’s the gradient of this curve at the point x = 0? 


Again, Method 1 does not work at all. Try as we might, we are unable to express y in 
terms of x. 


d 
So, we turn to Method 2. Apply the oe operator to Z: 
x 


d d 
= (yaa fy) = “0 aca 3a7y? + 5asyt— + 2x + = =0 
£ £ 
dy 2 +3zy? 


dr "Baby! ah 


And so, at x = 0, we have dy =\(), 
da 


Conclude: The gradient of the curve at x = 0 is 0. 


Exercise 343. Consider the curve described by 


a sin =), for x € (—00, -1) U(1, 0), ye(-5,5). 


Find the gradient of the curve at each point x, in terms of 2. (Answer on p. 1899.) 
Exercise 344. Consider the curve described by 


ysing =e"*4 - 1, 


Find the gradient of the curve at the point x = 0. (Answer on p. 1899.) 





The following derivatives are in List MF26 (p. 3), so no need to mug:*° 





“We also mentioned this earlier in Part I, Remark 87 (p. 493). 
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where x € (-1, 1), 


where xz € (-1, 1), 


d 
(d) Fp COsee B= —cosece Cole. where zx is not a multiple of 7, 
a 


d ™ 
(e) qa Ces secxrtan gz, where x is not an odd multiple of 5 
o 





Proof. (a) is proven in Example 1148 (below). 
You're asked to prove (b) and (c) in Exercise 345. 
As for (d) and (e), in Exercise 186(f) and (g), we already showed that 


1 cosx d 9 sina 
—cosecr = -—> and = seed > — 
dx sin’ x dx cos* x 














Can you explain why = and 2 are consistent with (d) and (e)?41° 





d 
Example 1148. We’ll use implicit differentiation to show that ae sing = 
£ 


Te 1 
Let y=sin ze 5. |, so that x = sin y. 
d 1 
Apply the a operator to =: 
a 


lee CR d 
x= —siny — 1 = cosy— 


1 
pe — f + 0. 
di dz da y moog. 


By the identity sin? y + cos” y = 1, we have cos y S4V1— 22. 


i FH 
But since y € \->. | we know that cosy > 0. And so, we may discard negative values 


oe 4 
and rewrite = as cosy = V1 -2?. 


d 1 
Now plug 4 into 2 to get aS sin g¢=— = a which is defined for x € (-1,1).4"" 
I Bs 











416 COs © 1 cosz sin x 1 sing 
ee = = —cosecz cot 2; 25 =secr tang. 
ins sin?z _ sing sing cos COS & 
So, sin™’ is not a differentiable function: Its donna is oe area i but that of its derivative is (-1, 1). 
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91.2. The Inverse Function Theorem (IFT) 


Example 1149. Let x be the mass (g) of Milo powder in a cup and y be the volume 
(cm?) of Milo powder in that cup. 


Suppose that when we add 1g of Milo to the cup, the volume of Milo powder increases 


by 5 cm?. That is, the rate of change of volume (y) with respect to mass (2) is 


Intuition suggests that if we had wanted to increase the volume of Milo in the cup by 
1cm?, then we should instead have added 2g of Milo. That is, the rate of change of mass 
(x) with respect to volume (y) is 





In the above example, we used this piece of intuition: 


The change in y due to 1 1 
a small unit change in x The change in x due to 


a small unit change in y 
This is the Inverse Function Theorem (IFT):*'® 


Theorem 33 (informal). (The Inverse Function Theorem) 


d 
(provided 7 0) 





Earlier we saw that with the Chain Rule, Leibniz’s notation shines. Here with the IFT, 
Leibniz’s again notation shines. From 2 we can easily see the IF'T’s underlying intuition 
(as given by =).419 


d d 
Exercise 346. Suppose x7y+sinz = 0. Find ra Hence write down a (You may leave 
x 7] 





your answers expressed in terms of both x and y.) (Answer on p. 1900.) 





“18For a formal statement, see Theorem 54 (Appendices). 
“19Tn contrast, in Lagrange’s notation, we’d have to state the IFT as 


13 ll 
gy 


From 3 it is arguably harder to see the IFT’s underlying intuition (as given by +). 


y 
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Example 1150. Suppose x = siny, for y € |. |. 


\ oe ohn 
= = oly — On 
dy 


d 1 
By helen <= 
dz cosy 
d d i | 
But y= sin! x. So, ed ee = aa 
dx dz Cosy cos (sin ) 





d 1 
Exercise 347. The last example showed that — sin! x é — Fs lt) Example 
dx cos (sin! x) 


d i 
1148 also showed that ae sin! x 2 . Why aren’t + and 2 contradictory? (Answer 
ua 


Vlei 
on p. 1900.) 
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91.3. Term-by-Term Differentiation (optional) 


Let fo, fi, fo,... be nice, differentiable functions, whose derivatives are f}, fi, f5,---- 


Suppose we define a new function g by 


G= jor i: 


Then by the Sum Rule (for Differentiation), g is also differentiable and its derivative is 
simply 


g = fot fi 
Similarly, suppose we define a new function h by 


h= fot fit fo. 


Then again by the Sum Rule, we know that h is also differentiable and its derivative is 
simply 


hl = fit f+ Bh 
More generally, for any n € Z", suppose we define a new function 7 by 
n 
t=) fi=fot fit thn. 
i=0 


Then by the Sum Rule, 2 is also differentiable, with 


; d n n d nm 
= dad ag Ala fot fit + he 


da 
1 
In =, we 


d nm 
interchange differentiation oo and finite summation > 
s i=0 


We see that if a function 2 is the finite sum of n other differentiable functions, then 7 is 
also differentiable and its derivative is simply the sum of those n functions’ derivatives. 
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Now, suppose instead we define a new function 7 by 


j= fi=fotfithet.... 
i=0 
Q1. Is j also differentiable? 
Q2. Is it the case that 


1 Amp 2yd,_y 
ater) ia? eri er a a 


In particular, is 2 legitimate? That is, can we 


d co 
interchange differentiation ae and infinite summation \)? 
a i=0 


Is this interchange operation legitimate? 


It turns out that the answer to Q1 and Q2 is, 


Yes, but only if certain technical conditions are met. 


In particular, if certain technical conditions are met, then, as was done in =, we can 


d co 
interchange differentiation ay and infinite summation a 
we i=0 


Fortunately, these “certain technical conditions” are beyond the scope of H2 Maths and 
we shall not discuss them in this textbook. Based on past-year A-Level exams, it would 
seem that your A-Level examiners simply assume that the interchange operation is always 
legitimate—so, at least for H2 Maths, there is probably little danger if you also do likewise. 


In Ch. 101.5, we'll revisit the questions just asked, but in the special case where 7 is an 
“infinite polynomial function”. 


Exercise 348. XXX (Answer on p. 905.) 
A348. 
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92. Parametric Equations 


We revisit Example 666 (from Part I): 


(... Example continued from the previous page.) 


One nice interpretation is that our parametric equations describe the motion of some 
particle P in the plane as time ¢ progresses. P’s rightward and upward displace- 
ments (metres) away from the origin are given by x and y. 


As time progresses, P moves anti-clockwise along a unit circle centred on the origin O. 
At any instant of time t, P’s position is (cost, sint). 
¢ At (the instant of) time t = 0, P is at (x,y) = (cos0,sin0) = (1,0). Thus, P’s starting 
position is 1m to the right of O. 
At time t = 1, P is at (2,y) = (cos1,sin1) » (0.54,0.84). Thus, after 1s, P is 0.54m 
right and 0.84m above O. 
At time t = 5n/4, P is at (x,y) = (cos(5m/4),sin(5n/4)) = (-V2/2,-V2/2) » 
(-0.71,-0.71). Thus, after 57/48, P is V2/2m left of and V2/2m below O. 


Every 27ts, P travels one full circle and returns to its starting position (1,0). 
Under this interpretation, we can also calculate the particle’s velocity at each instant in 
time. We will decompose the particle’s velocity into the z- and y-components. 


Its velocity in the x-direction is the rate of change of displacement in the x-direction with 
respect to time t, i.e. the (first) derivative of x w.r.t. t: 


Ug = — sint 
bdes 
And its velocity in the y-direction is the rate of change of displacement in the y-direction 
with respect to time t, i.e. the (first) derivative of y w.r.t. t: 
dy 


—=cost. 
dt 


Vy = 


Altogether, (vz, vy) = (—sint, cost). And so, 
e At time ¢ = 0, the particle P has velocity (vz, vy) = (—sin0,cos0) = (0, 1)—it is moving 
upwards at lms! (and not moving rightwards at all). 


e At time t = 1, P has velocity (vz, vy) = (—sin1,cos1) » (—0.84,0.54)—it is moving 
leftwards at 0.84ms! and upwards at 0.54ms". 


¢ At time t = 57/4, P has velocity 


5 pe Od), 


*)-(4.-2) 


5 
(7,0) = (-sin il cos — 
4 4 


At time t = 57/4, P is moving rightwards at V2/2ms"! and downwards at V2/2ms"!. 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


We can similarly calculate the particle’s acceleration at each instant in time. We will 
decompose the particle’s acceleration into the x- and y-components. 


Its acceleration in the x-direction is the rate of change of velocity in the x-direction with 
respect to time ¢ or, in other words, the second derivative of x w.r.t. t: 





And its acceleration in the y-direction is the rate of change of velocity in the y-direction 
with respect to time t or, in other words, the second derivative of y w.r.t. t: 





Altogether, (a,,a,) = (-—cost,-sint). And so, 


e At time ¢ =0, the particle P has acceleration (a,,a,) = (-cos0,-sin0) = (-1,0)—it is 
accelerating leftwards at 1ms~? (and not upwards at all). 
At time ¢ = 1, P has acceleration (az,a,) = (—cosl,-sin1) » (-0.54, -0.84)—it is 
accelerating leftwards at 0.54ms 7 and downwards at 0.84ms’. Note that at t = 1, 
we have v, = 0.54 > 0 but a, = -0.84 < 0—this means P is still moving upwards, but 
this upwards movement is slowing down. 


At time t = 57/4, P has acceleration 


(az, dy) = (—cos (57/4) ,- sin (571/4)) = (V2/2, v2/2) ~ (0.71,0.71). 


At time t = 57/4, P is moving rightwards at V2/2ms~! and also upwards at V2/2ms"!. 
Note that at t = 57/4, we have v, = —V/2/2 < 0 but a, = V2/2 > 0—this means P is still 
moving downwards, but this downwards movement is slowing down.*” 


Exercise 349. Particle Q travels on the same plane as P (from the above example). The 
position of Q is described by {(x,y): a =sint, y = cost,t > 0}. (Answer on p. 1828.) 
(a) What is Q’s starting position (i.e. at t = 0)? 

(b) Is Q travelling clockwise or anticlockwise? 

(c) Where will P and Q be at t = 6657? 

(d) At what times t are P and Q at the exact same position? 





(e) What are Q’s velocity and acceleration in the z- and y-directions at each instant t? 





“20\We will revisit this example when we study vectors in Part III. There, we will show that P’s direction of 
movement is always tangent to the circle and its direction of acceleration is always towards the centre. 
Moreover, the magnitudes of the particle’s (overall) velocity and acceleration are always constant. 
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We revisit Example 667 (from Part I): 
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We revisit 


Exercise 351. The set {(2,y): x = tant, y = sect,t > 0} describes particle B’s position in 
the plane. (Answer on p. 1830.) 


(a) Rewrite the equations x = tant and y = sect into a single equation that does not 
contain the parameter t. 

(b) What is particle B’s starting position (i.e. at t = 0)? 

(c) Compute da/dt. And hence, conclude that the particle B always moves 


(d) Describe qualitatively what happens during each of these time intervals: 


™ 3 on OT 


(i) te|0,<). (ii) re(2.—). (iii) re(F). 


(e) Marked below are the positions of the particle B at six different instants in time 
t=0, 1, 2, 3, 4, and 5. However, we do not know which position corresponds to which 
instant in time. Without using a calculator, match each of the six positions to 
the corresponding instant in time. (Hint: 0.57 x 1.57, m * 3.14, 1.5m » 4.71, and 
27 = 6.28.) 


Voy): = tant, y = sect, > 0} 





Arrows indicate 
instantaneous 
direction of travel. 
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92.1. Parametric Differentiation 


Nec 


Theorem 34 (informal). (Parametric Differentiation Rule 


dy dy , dx 
dx dt dt 





(provided = 0) 





“Proof.” Simply use the Chain Rule (CR) and the Inverse Function Theorem (IFT): 


dy CR dy dt IET dy : dx 
dx dtdx dt dt 




















Example 1151. Let x=? and y =t®. Find = 
x 


Method 1. First express y in terms of x: y = 21”. 


d 
Then compute SF = 1.2992. 
aa, 


d d 
Method 2. Compute = = 6t? and = = 5t*. By the Parametric Differentiation Rule, 


eAGt ) = letea le = 


d 
Example 1152. Let x =sint and y = 2t. Find = 
x 


Method 1. First express y in terms of x: y = 2sin' x. 
d 2 

Then compute ee 

dx 1-2? 


dy 


(for + # +1). 


d 
Fras 2 and = =cost. By the Parametric Differentiation Rule, 


Method 2. Compute 


dy2dy dr 2 
dx dt dt cost 





(for cost #0). 


Note that = and Z are consistent with each other because 


V1l-272 =V1-sin?t = Vcost = cost.*?? 





In the last two examples, Method 2 (parametric differentiation) was not obviously superior 
to Method 1. In the following example, it is, because Method 1 can’t even be used: 





422 And also, 7 #+1 <=> cost #0. 
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d 
Example 1153. Let x =?? +t and y =t®-t. Find a 
q 


Here it is difficult (or even impossible) to express y in terms of x. So, we can’t use Method 
1. Instead, we must use 


d d 
Method 2. Compute = = 6° - 1 and = = 5t*-1. By the Parametric Differentiation 
Rule, 
dgudg de 2G 1. 


4 
Ap ka ae (for 5¢*-1+#0). 


d 
So, for example, at t = 0, we have od iat 
da !t=0 





d 
Exercise 352. Let x = cost +?¢? and y=e'-¢?. Find a (Answer on p. 1900.) 
2 
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93. The Second and Higher Derivatives 


Example 1154. Define f :R—> R by f (x) = 2°. 
The (first) derivative of f is the function f’:R —> R defined by f' (x) = 32’. 
The (first) derivative of f at -1 is the number f’ (-1) =3(-1) =3. 


Figure to be 
inserted here. 


The second derivative of f is the function f” :IR > R defined by f” (x) = 6. 
The second derivative of f at —1 is the number f” (-1) =6(-1) =-6. 
Observe that f” is itself 

¢ the (first) derivative of f’; and 


e a function. 





The second derivative is simply the derivative of the (first) derivative. Importantly, just 
like the (first) derivative, the second derivative is itself a function. 


Formal definitions concerning the second derivative: 
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Definition 207. Let D be a set of real numbers and a ¢ D. Suppose f: D> Risa 
differentiable function whose derivative is f’: D > R. Consider this limit: 


anf O-f@), 


ta x-a 


If the above limit exists (i.e. is equal to a real number), then we say that f is twice 
differentiable at a, call this limit the second derivative of f at a, and denote it by f” (a), 
z ay dee 
1a), a (os OF aaa 


dx? dx} __ 
rv=a 


If not, then we say that f is not twice differentiable at a. 


ee) 





The expression is called the difference quotient (of f' at a). 


If f is twice differentiable at every point in a set S, then we say that f is twice differentiable 
Ono. 
Suppose T is the set of points on which f is twice differentiable. Then the second 


ae 
derivative of f is the function denoted f”, f, or —> 


da?’ with domain T’, codomain R, 
2 


and mapping rule 
! meee 
cent’ = F(a) 
ra x-a 


If T = D (i.e. f is twice differentiable on its domain), then we call f a twice-differentiable 
function. 





For comparison, we now reproduce Definitions 203, 204, and 205 (concerning the first 
derivative), which are very similar to Definition 207 (second derivative): 


Definition 203. Let f be a nice function with domain D and aé D. Consider this limit: 


san fot =F (a) 


ra Ip = 





If the above limit exists (i.e. is equal to a real number), then we say that f is differentiable 
at a, call this limit the derivative of f at a, and denote it by f’ (a). 


If not, then we say that f is not differentiable at a. 
f (x) - f(a) 
xL-a 





the difference quotient (of f at a). 


We call the expression 
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Definition 204. Suppose the function f is differentiable at every point in a set S. Then 
we say that f is differentiable on S. 


And if f is differentiable at every point in its domain, then we say that f is a differentiable 
function. 


Definition 205. Let f be a nice function. Let S be the set of all points at which f is 
differentiable. The derivative of f is the function f’:.S > R defined by 


F(@)- F(a) 





f' (x) = lim 


wa 





Again, we have the notation of Lagrange, Newton, and Leibniz: 


e The second derivative of f is a function and may be denoted 


: d2f 
" Md 
f or f or dx? 
Lagrange) Newton Leibniz 


e The second derivative of f at a point a is a number and may be denoted 


ss d?f d? f 
I oe ae, es 
hi (a) or f (a) or dx? (a or dr2 _ 
Lagrange Newton Leibniz 


d 
Under Leibniz’s notation, the differentiation operator is denoted qa while the repeated 
x 
2 


d 
application of this operator is denoted res Thus, the second derivative of f is denoted*”* 
0 
d2f df? 
— and not —-~. 
dx? dx? 





2 
“°3Given our notation for the second derivative, the expression qd is confusing and should be avoided. 
- 


However, if used, it would denote the derivative of the composite function f? = fo f with respect to the 
variable 2. 
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Example 1155. Define g:R > R by g(z) = x? + 2z. 


The (first) derivative of g is the function with domain R, codomain R, and mapping rule 
Tete Gy re A 


d 
as and write its mapping rule as 


We may denote this function by g’, g, or 7 
x 


g! (a) = (2) = 2 (aw) = 30? +2 


d 
Note: In the case of Leibniz’s notation, we may write — (a) + 3x? +2. But as explained 
ih 


d 
earlier, because the dummy variable x is already specified in oe the “(x)” is actually 
ag 


d 
superfluous. So, + could be written more simply as (and is entirely equivalent to) oe 


ay 
oa eo, 


The (first) derivative of g at -1 is the number 5—we may write 


g/(-1)=5 g(-1) =5 


Figure to be 
inserted here. 


The second derivative of g is the function with domain R, codomain R, and mapping rule 
Ti On, 
2 


te aes and write its mapping rule as 


g” («) = g(x) = <8 § (x) = 


We may denote this function by g”, g, or 


- oS - 


The second derivative of g at -1 is the number —6—we may write 


gi" (-1)=- 





A twice-differentiable function must also be differentiable. However, the converse need not 
be true. That is, a differentiable function need not be twice differentiable: 
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Example 1156. Define h: Ri > R by h(a) = x°/? 
The (first) derivative of h is the function with domain Rj, codomain R, and mapping 


3 
rule te 5a. 


; dh 
We may denote this function by h’, h, or an? and write its mapping rule as 
x 


hi (a) = h(a) =F (@) =F = Sal, 


The (first) derivative of h at 4 is the number 3—we may write 
dh 
(Ales h(4) =3 — (4) =3 
(4) (4) — (4) 


Since h is differentiable at every point in its domain Rj, it is a differentiable function. 


Figure to be 
inserted here. 


The second derivative of h is the function with domain IR*, codomain R, and mapping 


. 
rule te go. 


2 


We may denote this function by h”, h, or ie B 


hi" (x) = A(n)=< 5 (*) 


and write its mapping rule as 


oh. eye 
=a 2, 


The second derivative of h at 4 is the number 3/8—we may write 


eh 
dx? 


ni (4) = 5 h(a)== (4) = 


Since h is not twice differentiable at every point in its domain Rp (in particular, it is not 
twice differentiable at 0), it is not a twice-differentiable function. Hence, h is an example 
of a function that is differentiable but not twice differentiable. 
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Exercise 353. Find each function’s first and second derivatives. State if each function is 
differentiable and/or twice differentiable. Evaluate each function’s derivative and second 
derivative at 1. (Answer on p. 1901.) 


(a) f: R-R defined by f (x) =e”. 
(b) g:[1,00) > R defined by g(x) = Vz? -1. 
(c) h: R= R defined by h(x) = sin (x? +1)+2. (Tedious and requires care.) 





Exercise 354. Explain whether each statement is true or false. (As usual, one way to 
show that a statement is false is to provide a counterexample. ) (Answer on p. 1902.) 


(a) “f is twice differentiable — f is differentiable.” 


(b) “If f is differentiable and its derivative f’ is also differentiable, then f is twice dif- 
ferentiable.” 


(c) “If f is not differentiable at a, then f cannot be twice differentiable at a.” 


Exercise 355. Suppose f : D > R is a differentiable function. Then what can we say 
about the domains of f’ and f”? (Answer on p. 1902.) 
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93.1. Higher Derivatives 


The third derivative is the derivative of the second derivative, the fourth is that of the 
third, etc. And in general, the nth derivative is the derivative of the (n - 1)th derivative. 


Again, we have the notation of Lagrange, Newton, and Leibniz: 


e« The third derivative of f is a function and may be denoted 


- d3 f 
Wr 
2 or f or dx3 
Lagrange Newton Leibniz) 


(Again, we stress that every derivative is a function.) 


e« The third derivative of f at a point a is a number and may be denoted 





a d3 f d3 f 
m ad. ait 
rg FO ge Ga@or sal 
Lagrange Newton Leibniz 


With higher derivatives, Lagrange’s prime and Newton’s dot notation will start getting 
messy. And so, for n > 4, 


e The nth derivative of f is a function and may be denoted 





n d"f 
(n) 
if or f or dr” 
Lagrange (Newton Leibniz 


e The nth derivative of f at a point a is a number and may be denoted 








‘ de ay 

(n) 

FO) ge FO ge Ger Or a] 
Lagrange Newton Leibniz 
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Example 1157. The function f : R > R defined by f (x) = 2° is (at least) seven-times 
differentiable. Its first seven derivatives have domain R, codomain R, and these mapping 
rules: 


Pe-f@ -E@-G| = 
F(a) = Flo) = Fa @)= Sa] =202% 
f(a) = Fe) = 2 (=F) = 602", 
a = 1202, 


w=a 


5 d5f ay 
f(a) =f(e) = 75 @)= 75] = 120. 


w=a 


f (x) = f(x) = TS (a ie 


ore ae eer 
FO (x) = fe) = SE) = 


w=a 


Lye el) er 


da? 


w=a 


Oe oe = 


Clearly, for any integer n >6, f is n-times differentiable and its nth derivative is defined 
by 


DR ce oa 
(n) = = = = 0. 
FO (2) = #2) =F) = Sem] 
As we'll learn in the next subchapter, f is an example of an “infinitely differentiable” or 
smooth function. 
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Example 1158. The sine function sin is (at least) six-times differentiable—its first six 
derivatives have domain R, codomain R, and these mapping rules: 


sin’ = sin 


sin) = sin = 


5 
sin) = sin = 


6 


sin) = sin = —sin. 


As you can probably tell, sin is actually “infinitely differentiable” or smooth. We'll 
continue with this example in the next subchapter. 





Formal definitions concerning the nth derivative (once again, these seem long and intimi- 
dating, but are in fact just a mere formalisation of common sense): 
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Definition 208. Let D be a set of real numbers, a € D, and n > 3 be an integer. 
Suppose f : D > R is a (n-1)-times differentiable function whose (n - 1)th derivative is 
fos). p= 


Consider this limit: 


eae) 


w—-a 


lim 


If the above limit exists (i.e. is equal to a real number), then we say that f is n-times 
differentiable at a, call this limit the nth derivative of f at a, and denote it by f™ (a) 


(or fj" (a) tor a= 3), Fen (or f (a) for n = 3), — (a), or 


dey 
dz” 


v=, 
If not, then we say that f is not n-times differentiable at a. 
fim C9 ev dm, 
xL-a 
If f is n-times differentiable at every point in a set S, then we say that f is n-times 
differentiable on S. 


Suppose T is the set of points on which f is n-times differentiable. Then the nth derivative 
nm 


of f is the function denoted f) (or f” for n = 3), f (or f for n = 3), or — with 
gn 


The expression is called the difference quotient (of f ara i: 





domain 7’, codomain R, and mapping rule 


io ee aay) 


ta x-a 





If T = D (ie. f is n-times differentiable on its domain), then we call f a n-times- 
differentiable function. 


Exercise 356. Let g: R > R be defined by g(x) = 24-2? +27-x+1. Write down all 
of its derivatives. Evaluate each of these derivatives at 1. Write all of your answers in 
Lagrange’s, Newton’s, and Leibniz’s notation. (Answer on p. 1903.) 
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93.2. Smooth (or “Infinitely Differentiable”) Functions 


A smooth function is simply one that’s “infinitely differentiable”: 


Definition 209. Let f be a function. If for every n € Z*, f is n-times differentiable at 
a, then we say that f is smooth at a. 


We say that a function is smooth on a set S if it is smooth at every point in S. 
4 





If a function is smooth on its domain, then we call it a smooth function.” 


Most functions you'll encounter in H2 Maths are smooth. This includes, for example, all 
polynomial functions: 


Example 1159. Consider the function f :R > R defined by f (x) = x°. 


In Example 1157 (previous subchapter), we showed that for every positive integer n, f is 
n-times differentiable. Hence, f is a smooth (or “infinitely differentiable”) function. 


Example 1160. Consider the function g:R > R defined by g(x) = + - a2 +.2?-a +1. 


In Exercise 356 (previous subchapter), we showed that for every positive integer n, g is 
n-times differentiable. Hence, g is a smooth (or “infinitely differentiable”) function. 


Exercise 357. Define h: R > R by h(x) = 2°-24+2°-x?+2-1. Find all the derivatives 
of h. Is h smooth? (Answer on p. 1903.) 





As the above examples and exercise suggest, every polynomial function is smooth; moreover, 
if a polynomial function is of order k (i.e. the highest power is k), then its (k + 1)th and 
subsequent derivatives are zero functions.*”° 


The exponential function is also smooth: 
Example 1161. Consider the exponential function exp. Its (first) derivative is itself: 
exp’ = exp. 
So, the second derivative of exp is also itself: 
exp” = exp’ = exp. 


And for any positive integer n, the nth derivative of exp is again itself: 


exp = exp("") =... exp” = exp’ = exp. 


We’ve just shown that for every positive integer n, the nth derivative of exp is exp itself 
and that exp is n-times differentiable. Hence, g is smooth. 





The sine function is also smooth: 





424\fore advanced texts may also say that a smooth function is of class C®. 
“>This assertion is formally stated as Fact 295 (Appendices). 
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Example 1162. Consider the sine function sin. Here are its first four derivatives: 


sin’ = cos, 
sin” = —sin, 
sin’” = —cos, 


sin) = sin. 


The cycle then repeats. Specifically, for each k = 0,1,2,3,..., we have 


sin“ +4) = cos, 


sin?*4*) — — sin, 
sin”) = = cos, 


sin(4*) = sin. 


So, for example, the 8603rd derivative of sin is 


gin (8 603) — gin (8+4x2 150) — _ cog. 
We’ve just shown that sin is n-times differentiable for every positive integer n. Hence, 


exp is smooth. 





In Ch. 88.4, we made these imprecise claims:*”° 


“Most” elementary functions are differentiable. 
Moreover, their derivatives are themselves elementary. 


From the above imprecise claims, we have this imprecise corollary: 


“Most” elementary functions are smooth. 





“6For some justification of these imprecise claims, see Ch. 146.10 (Appendices). 
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93.3. Confusing Notation 


We reproduce from Ch. 22 (Composite Functions): 


Definition 77. Let f be a function. The symbol f? denotes the composite function fo f. 





For any integer n > 3, the symbol f” denotes the composite function fo f”"!. 


We just 4?” defined f to be the nth derivative of the function f. 
We now also define the symbol (f)”: 


Definition 210. Given the nice function f : D > Rand n€R, the function (f)": D> R 
is detimed by (7) (a) =|/(@)| . 





Hence, the three symbols f”, f™, and (f)” denote three different functions and we must 
take care to distinguish between them: 


Example 1163. Define f:R-—> R by f (x) = 2°. 
Consider the three functions f*, f, and Or 
1. The function f*: R > R is the composite function f o f 0 fo f and is defined by 
3 3\3 
F(a) = FFE OM) =F) = FF) =F ((Y))- (YY) 
= ((2*)’) = (08) = 2". 
2. The function f) :R > R is the fourth derivative of f and is defined by 
4 
f (a) = =a = x{x|x(xa*)]} = x {x[x(327)]} = x [x (6e)] = x6 = 0. 
3. The function (f i :R = R is the function f raised to the power of four and is defined 
by 


4 


f) @al @l =@) 207 


Clearly, the functions f*, f“, and Gs ye are distinct. If we evaluate each of them at 2, 
we get wildly different values: 


f4 (2) = 2% ~2.417x10%, =f (2)=0, — (f)* (2) = 2 = 8096. 





Separately, we must also take care to distinguish the following five symbols, which denote 
five different functions: 


uy aw 
dx)” 











an ; 
dx”’ dx ’ da”’ da” 





“7Definition 208. 
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Example 1164. Define f :R-—> R by f (x) = 2’. 


ai ef dy df 2dr 
day da da a2 





Below we consider the five functions ( 


d 
But first, let’s note that the derivative of f is the function ~ : IR > R defined by 
aD 


oe es ae 


Also, the composite function f? :R > R is defined by 


f? (x) = f (Ff (2)) = f (2) = (2?)° = 2°. 


2 
1. The function (4) : IR > R is defined by 
is 


(“) = (3x2) = 9x4, 


2 


2. The function = : IR > R is the second derivative of f and is defined by 
x 


=x (x27) = = (327) = 62. 


df2 
3. The function wie :R = R is the first derivative of the composite function f? = f o f 
a 
and is defined by 
dj?) Tdi, 


gl! = Oy 
AP re 9x”. 


d 
4. Informally, the function = gives us the rate of change of f with respect to x7. By 
z 
the Chain Rule, 


af ax? af 
dx? dx dx’ 
20 32 


dy sod} ade 37" 3 
Rearranging, a = oe > one = ae = Sa 





(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


aye 
5. Informally, the function on gives us the rate of change of the composite function f? 
Mg 


with respect to x”. By the Chain Rule, 
df? da? dj 





dr2 drs dx 
———) ———) 


2x 9x8 


ay dye dae 0a = 97 


Rearranging, dxv2 dr dr 2x 2 








2 9 9 2 
Clearly, the functions (<4) ; = - 7 and ae are distinct. 


If we evaluate each of them at 2, we get wildly different values: 


=6-2=12, 


I= 2 


= 9.28 = 2304, 


Exercise 359. In Lagrange’s notation, why do we denote the nth derivative of f with 
parentheses? That is, why do we denote the nth derivative of f by f rather than more 


simply f”? (Answer on p. 1904.) 


Exercise 360. Define h : R > R by h(x) = 27 +1. Find each of the following eight 
functions and evaluate each at 1: (Answer on p. 1904.) 


(a) (b) (c) (a) (s) 


d*h dh? lie 


os © (h) 





da? 
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94. The Increasing/Decreasing Test 


In Ch. 19, we asked, “When is a function increasing or decreasing?” We now revisit this 
question. 


We reproduce (from Ch. 19) this definition: 


Definition 73. Let f be a nice function with domain D. Suppose S ¢ D. Then f is 


(a) Increasing on S if for any a,be€ S with a< b, we have f (a) < f(b). 
(b) Strictly increasing PCN aae 
(c) Decreasing j (a) > 7 (0). 
(d) Strictly decreasing Cae alae 


If f is increasing on D, then f is an increasing function. 
If f is strictly increasing on D, then f is a strictly increasing function. 
If f is decreasing on D, then f is a decreasing function. 


If f is strictly decreasing on D, then f is a strictly decreasing function. 





The problem of finding a function’s derivative is the problem of finding the function’s gra- 
dient. And so, not surprisingly, the derivative is closely related to whether a function 
is increasing or decreasing. We have, in particular, the intuitively “obvious” Increas- 
ing/Decreasing Test (IDT): 


Fact 208. (Increasing/Decreasing Test, IDT) Let f be a function and D be an 
interval. Suppose f is differentiable on D. 


(a) f'(z)20 for allae D — f is increasing on D. 


(b) f’(#)>0 for all xe D — 
(c) 7 @) <0 forall ee D — Hels decreasing on D. 
(d) f’(#)<0 for allxeD — 


f is strictly increasing on D. 


f is strictly decreasing on D. 








Proof. See p. 1686 (Appendices). 
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Example 1165. Define f :R—> R by f (x) = 2”. 
The derivative of f is the function f’: IR > R defined by f’ (x) = 2z. 


Figure to be 
inserted here. 


(a) f’(x) 20 for all x [0, 00) f is increasing on [0, 00). 
(b) f’(x) >0 for all x € (0, «) f is strictly increasing on (0, co). 
(c) f’(x) <0 for all x € (-c0,0] f is decreasing on (-00, 0]. 


(d) f’(2) <0 for all x € (-00,0) f is strictly decreasing on (—0o,0). 


Example 1166. Define i: R > R by i (az) = 1. 
The derivative of 7 is the function 7’: IR > R defined by 7’ (x) = 0. 


Figure to be 
inserted here. 


(a) (x) 2>0foralxaeR <= iis increasing on R. 





(c) # (2) <OforallaeR iis decreasing on R. 
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Example 1167. Define j: RR > R by j (2) = 2° - 2x7 +2. 


Figure to be 
inserted here. 


The derivative of j is the function j’:IR > R defined by j’ (x) = 372 ~dr +1. We have 
j (a) =0 3a? - 4a +1 = (3a-1)(a-1)=0 


So, we can draw this sign diagram for 7’: 


Figure to be 
inserted here. 





1 
(a) j’ (x) = 0 for all xe [t 66:) j is increasing on (-ce, ;| i/o). 
7 is strictly increasing on (2°, 5) id) 00). 


E 
(b) 7’ (x) > 0 for all x € (- 
(c) 7’(x) <0 for all xe Fe 


1 il 
~ } isd —, 1). 
3 j is decreasing on E 


1 
(d) 7’ (x) <0 for all xe (=. 1 j is strictly decreasing on (= 1) 


The following example shows that the converse of IDT(b) is false—that is, we cannot 
replace the => with <>. 
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Example 1168. Define g:R—> R by g(z) = 2°. 
The derivative of g is the function g’ : R > R defined by g’ (x) = 32”. 


Figure to be 
inserted here. 


Observe that g is strictly increasing on R because if a < b, then g(a) = a® < b? = g(b). 


However, g’ is not strictly positive on R. In particular, g’ is not strictly positive at 0: 
7 (0)=3-0 =U 20; 


This shows that the converse of IDT(b) is false. (Similarly, the converse of IDT(d) is 
false, as Exercise 362 will show.) 


Exercise 361. For each function, find its derivative and draw the sign diagram for the 
derivative. Explain how your findings are consistent with the Increasing/Decreasing Test 
(IDT). (Answer on p. 1906.) 
(a) f: R-R defined by f (x) =e” -z. 

(b) g: [0,27] > R defined by g(x) =sinz. 


Exercise 362. Define i: R > R by i(x) = -2°. (Answer on p. 1907.) 


(a) Find the derivative of 7. 


(b) Identify the intervals on which 7 is increasing, decreasing, strictly increasing, and/or 
strictly decreasing. 


(c) Fill in the blanks: The function i shows that the of is false. 
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95. Determining the Nature of a Stationary Point 


We reproduce (from Ch. ??) these informal definitions of the eight types of extrema (i.e. 


maximum and minimum points):*”° 

(a) A global maximum is at least as high as any other point. 
(b) The strict global maximum is higher than _—_ any other point. 
(c) A local maximum is at least as high as any “nearby” point. 
(d) A strict local maximum is higher than —_ any “nearby” point. 
(e) A global minimum is at least as low as any other point. 

(f) The strict global minimum is lower than any other point. 
(g) A local minimum is at least as low as any “nearby” point. 
(h) A strict local minimum is lower than any “nearby” point. 


We reproduce (from Ch. 43.8) these formal definitions of stationary and turning points: 


Definition 115. A point x is a stationary point of a function f if f’ (x) = 0. 


Definition 58. A turning point is a point that’s both a stationary point and a strict 





local maximum or minimum. 


Quick examples to remind you of how these work: 





“8For the formal definitions of extrema, see Definition 277 (Appendices). 
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Example 1169. Define f :R—> R by f (x) = 2”. 
The derivative of f is the function f’: IR > R defined by f’ (x) = 27. We have 


7 (a) =2a => 2a =0 a a= 0) 


So, f has exactly one stationary point, 0. 


Figure to be 
inserted here. 


The point 0 is also a strict local minimum of f, because it is lower than any “nearby” 
point: f (0) < f(x) for all x “near” 0. 


(Indeed, 0 is actually also a strict global minimum of f, because it is lower than any 
point. (0) <7 @) tor alla 2.0.) 


Since 0 is both a stationary point and a strict extremum, by Definition 58, it is also a 
turning point of f. 


Does f have any global maxima or strict global maximum?*”? 








429No 
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Example 1170. Define g:R > R by g(x) = 2° - 227 + 2. 
The derivative of g is the function g’ : R > R defined by g’ (x) = 3x7 - 4 + 1. We have 


ik 
g' (a) =0 3a? - 4a +1 = (3a-1)(a-1)=0 a= ora=1. 


1 
So, g has two stationary points, namely 3 and 1. 


Figure to be 
inserted here. 


1 
From the graph, we can informally tell that — is a strict local maximum of g, while 1 is a 
strict local minimum of g. (In the following subchapters, we’ll learn about two tests that 
will formally confirm these informal observations. ) 
dl 
Each of = and 1 is a turning point of g, because each is both a stationary point and a 
strict extremum of g. 


Does g have any global maxima, global minima, strict global maximum, or strict global 
minimum?*°? 








430No 
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Example 1171. Define h: R > R by h(x) = 2°. 
The derivative of h is the function h’:R > R defined by h’ (x) = 3x”. We have 


(a) =10 => —= a=), 


So, h has exactly one stationary point, 0. 


Figure to be 
inserted here. 


Observe that h (0) = 0. However, h(x) <0 for all « <0, while h(x) > 0 for all x > 0. 
Thus, 0 is neither a minimum nor a maximum of h. Indeed, h has no extrema. 


The sole stationary point, 0, is not a strict local extremum. Hence, fA has no turning 
points. 


Example 1172. Define i: R > R by i (az) =1. 
The derivative of 7 is the function 7’: IR > R defined by 7’ (x) = 0. 


For every a € R, we have i’ (a) = 0. So, every point a€ R is a stationary point of 7. 


Figure to be 
inserted here. 


Note also that every point a € R is a local maximum, local minimum, global maximum, 
and global minimum of 7. (Why?)*?! 


However, no point is a strict local maximum or strict local minimum of 7. Hence, 7 has 
no turning points. 


























*81Pick any a€ R. We have i(x) <i(a) <i(z) for all z¢€R. Hence, a is a local maximum, local minimum, 
global maximum, and global minimum of 7. 
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95.1. The First Derivative Test for Extrema (FDTE) 


The First Derivative Test for Extrema (FDTE) is one way to check if a stationary 
point (or indeed any point) is an extremum: 


Proposition 9 (informal). (First Derivative Test for Extrema, FDTE) Let f be a 
continuous function and a be a point. 


(a) If f’>0 ona’s “immediate left” and f’ < 0 on a’s “immediate right”, then a is a local 
maximum of f. 


(b) If f’ > 0 on a’s “immediate left” and f’ < 0 on a’s “immediate right”, then a is a 


strict local maximum of f. 


(c) If f’< 0 on a’s “immediate left” and f’ > 0 on a’s “immediate right”, then a is a local 
maximum of f. 


(d) If f’ < 0 on a’s “immediate left” and f’ > 0 on a’s “immediate right”, then a is a 
strict local maximum of f. 





“Proof”. (a) If f’>0 on a’s left, then by the IDT, f is increasing on a’s left. 
Similarly, if f’ <0 on a’s right, then by the IDT, f is decreasing on a’s right. 


So, “obviously”, f attains a global maximum at a. 


Figure to be 
inserted here. 








The proofs of (b)—(d) are similar. 
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Example 1173. Define f :R—> R by f (x) = 2”. 


Figure to be 
inserted here. 


The derivative of f is the function f’: IR > R defined by f’ (x) = 22. 
The sole stationary point of f is 0. 
At all points to the left of 0, f’ <0. And at all points to the right of 0, f’> 0. 


Figure to be 
inserted here. 


So, by FDTE(d), 0 is a strict local minimum of f. (And since 0 is both a stationary 
point and a strict extremum, it is also a turning point of f.) 
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Example 1174. Define g:R—> R by g(x) = 2° - 22? + 2. 


Figure to be 
inserted here. 


The derivative of g is the function g’: R > R defined by g(x) = 327 — 4x +1. 


1 
The two stationary points of g are 3 and 1. 


1 1 
At all points to the left of 3 and right of 1, g’ > 0. And at all points between 3 and 1, 
Gg <0, 56, 


1 1 
« By FDTE(b), 3 is a strict local maximum of f. (And since 3 is both a stationary 


point and a strict extremum, it is also a turning point of f.) 


¢« By FDTE(d), 1 isa strict local minimum of f. (And since 1 is both a stationary point 
and a strict extremum, it is also a turning point of f.) 





In the next example, the FDTE can tell us nothing whatsoever. In such cases, we say that 
the FDTE is inconclusive. 
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Example 1175. Define h: R > R by h(x) = 2°. 


Figure to be 
inserted here. 


The derivative of h is the function h’: IR > R defined by h’ (x) = 32”. 
The sole stationary point of h is 0. 


Figure to be 
inserted here. 


At all points to the left of 0, h’ > 0. And at all points to the right of 0, we also have 
h'>0. 

So, the FDTE can tell us nothing whatsoever. In cases such as this, we say that the 
FDTE is inconclusive. 


Since the FDTE is inconclusive, we must determine the nature of the stationary point 0 
using other methods. 


One such method is to simply observe that at 0, we have h(0) =0. But at all points to 
the left of 0, we have h < 0; while at all points to the right of 0, we have h > 0. Hence, 0 
is neither a minimum nor a maximum of h. 


And since 0 is a stationary point but not a strict local extremum, it is not a turning point 
of h. (And h has no turning points.) 
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Example 1176. Define i: R > R by i(z) = 1. 


Figure to be 
inserted here. 


The derivative of 7 is the function 2’: IR > R defined by 7’ (x) = 0. 


For each real number a, we have 7’(a) = 0. So, every point a € R is a stationary point of 
is 


Figure to be 
inserted here. 


At each a € R, we have 7’ = 0 everywhere on its left and also everywhere on its right. 


So, at each a € R, i’ < 0 everywhere on its left and 2’ > 0 everywhere on its left. Hence, 
by FDTE(a), each a€ R is a local maximum of f. 


Also, at each a € R, 7’ > 0 everywhere on its left and 2’ < 0 everywhere on its left. Hence, 
by FDTE(c), each ae R is a local minimum of f. 


Altogether, every a € R is both a local maximum and a local minimum of f. 


However, each a € R is neither a strict local maximum nor a strict local minimum. This 
is because for any a € R, we can always find some “nearby” x for which f (x) = f (a). 


So, although every aé R is a stationary point of 2, 2 has no strict local extrema. Hence, 
2 has no turning points. 


Exercise 363. For each function, (i) find all stationary points; (ii) determine whether 
each of these stationary points is a local maximum, local minimum, strict local maximum, 
or strict local minimum; (iii) hence write down all turning points. (Answer on p. 1908.) 


(a) 7 RoR Wemed by f(a) —=—6 Sar = 
(b) g:[0,27] > R defined by g(x) =sinz. 
(c) h:R->R_ defined by h(x) = er. 
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95.2. A Situation Where a Stationary Point Is Also a Strict Global 
Extremum 


As we’ve seen above, it is not generally true that a stationary point is an extremum. 
But fortunately, we have the following result, which says this: 


If on a closed interval, a function has only one stationary point and the value of the function 
at that stationary point is greater (or smaller) than at the interval’s two endpoints, then 
that stationary point is a strict global maximum (or strict global minimum). 


A bit more precisely, 


Proposition 10. Let f : [a,b] > R be a differentiable function and c€ (a,b). Suppose c 
is the only stationary point of f in (a,b). 


(a) If f (c) ts greater than f (a) and f(b), then c is the strict global maximum of f. 
(b) If f (c) is smaller than f (a) and f (b), then c is the strict global minimum of f. 








Proof. See p. 1690 (Appendices). 











Here’s an informal argument for why Proposition 10(a) might be true: 


Since f (c) > f (a), f must be increasing on (a,c) (otherwise we’d have another stationary 
point somewhere in (a,c)). 


Similarly, since f (c) > f(b), f must be decreasing on (c,b) (otherwise we’d have another 
stationary point somewhere in (c,b)). 


Thus, c must also be the strict global maximum of f. 


Figure to be 


inserted here. 





Example 1177. XXX 


Example 1178. XXX 


Exercise 364. XXX (Answer on p. 941.) 
A364. 
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95.3. Fermat’s Theorem on Extrema 
Loosely, the First Derivative Test for Extrema (FDTE) says that 
If f’ changes signs at c, then c is an extremum of f. 


Fermat’s Theorem on Extrema is a partial converse to the FDTE: 


Theorem 35. (Fermat’s Theorem on Extrema) Let f be a function that is differen- 
tiable on (a,b). Suppose cé (a,b). 


If c is an extremum of f, then f’ (c) =0. 














Proof. See p. 1683 (Appendices). 





Example 1179. XXX 


Example 1180. XXX 


Here’s an example where Fermat’s Theorem seemingly fails: 


Example 1181. Define f : [2,3] > R by f (2) =z. 
The derivative of f is the function f’ : [2,3] > R defined by f’ (x) =1. 


Figure to be 
inserted here. 


Then 2 is an extremum of f. (Indeed, 2 is a strict local and global minimum of f.) 
However, 2 is not a stationary point of f because f’ (2) = 1 #0—this seems to contradict 
Fermat’s Theorem. 


Similarly, 3 is an extremum of f. (Indeed, 3 is a strict local and global maximum of f.) 
However, 3 is not a stationary point of f because f’ (3) = 1 + 0—this seems to contradict 
Fermat’s Theorem. 


Of course, there isn’t actually anything wrong with Fermat’s Theorem. In Fermat’s The- 
orem, we have the subtle but important condition that c € (a,b)—that is, the extremum 
c should be in the interior (and not be an endpoint) of the function’s domain. 


In our present example, 2 and 3 aren’t in the interior (and are instead endpoints) of f’s 
domain. So, Fermat’s Theorem may not apply. 
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In the main text, we don’t define what exactly the term interior means.**? But if we did, 
then we could’ve written Fermat’s Theorem on Extrema more simply: 


Theorem 36. (Also Fermat’s Theorem on Extrema) Let f be a function that is 


differentiable. If c is an interior extremum of f, then f’(c) =0. 





By the way, this is why Fermat’s Theorem on Extrema is also called the Interior Ex- 
tremum Theorem. 


Example 1182. XXX 
Example 1183. XXX 


Exercise 365. XXX (Answer on p. 943.) 
A365. 





‘82For this, see Definition XXX (Appendices). 
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95.4. The Second Derivative Test for Extrema (SDTE) 


Given a twice-differentiable function, the Second Derivative Test for Extrema (SDTE) 
can help us determine whether a stationary point is a strict local maximum or minimum. 


Below, we'll formally state the SDTE as Proposition 11. But first, let’s illustrate the SDTE 
with two quick examples. It turns out that the SDTE is an immediate consequence of the 
Increasing/Decreasing Test (IDT) and the First Derivative Test for Extrema (FDTE). 


Example 1184. Define f:R— R by f (z) = 2”. 
The derivative of f is the function f’: IR > R defined by f’ (x) = 2z. 
The sole stationary point of f is 0. 


Figure to be 
inserted here. 


The second derivative of f is the function f” : R > R defined by f” (x) = 2. 


So, f” is strictly positive everywhere and in particular at 0. By the IDT(b) then, f’ is 
strictly increasing at 0. 


Since f’ (0) = 0, it must be that f’ < 0 at all points to the left of 0 and f’ > 0 at all points 
to the right of 0. By the FDTE(d) then, 0 must be a strict local minimum of f. 


This illustrates SDTE(b) (to be formally stated below), which says that if f’(c) =0 and 
f" (c) > 0, then c is a strict local minimum of f. 
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Example 1185. Define g:R > R by g(x) = 2° - 227 + x. 
The derivative of g is the function g’ : R > R defined by g’ (x) = 3x? - 4241. Since 





g(a) =0 == Ge a 1, the two stationary points of g are 5 nacelle 
The second derivative of g is the function g” : R > R defined by g” (x) = 6x - 4. 
1 1 1 1 
At ae have g”’ (=) =6- 5 ~4=-2<0. So,* g” is strictly negative “around” = Hence, 


1 
by the IDT(d), g’ must be strictly decreasing “around” 3 


i] 1 1 
Since g’ (=) = 0, it must be g’ > 0 to the left of 5 and g’ < 0 to the right of - By the 


FDTE(b) then, 3 must be a strict local maximum of g. 


This illustrates SDTE(a) (to be formally stated below), which says that if g’(c) =0 and 
g” (c) < 0, then c is a strict local maximum of g. 


Figure to be 
inserted here. 


Similarly, at 1, we have g”(1) =6-1-4=2>0. So, g” is strictly positive “around” 1. 
Hence, by the IDT(b), g’ must be strictly increasing “around” 1. 


Since g’ (1) = 0, it must be g’ < 0 to the left of 1 and g’ > 0 to the right of 1. By the 
FDTE(d) then, 1 must be a strict local minimum of g. 


This illustrates SDTE(b) (to be formally stated below), which says that if g’(c) = 0 and 
g" (c) > 0, then c is a strict local maximum of g. 


Proposition 11. (Second Derivative Test for Extrema, SDTE) Let f be a function 
that is twice differentiable at c. Suppose f'(c) =0 (i.e. c is a stationary point of f ). 


(a) If f" (c) <0, then c is a strict local maximum of f. 
(b) If f"(c) >0, then c is a strict local minimum of f. 
(c) If f" (c) =0, then the SDTE is inconclusive—more specifically, c could be any of the 
following: 
e a strict local maximum; 
e a strict local minimum; 
e an inflexion point; or 
e none of the above. 








‘83 By the continuity of g”. 
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Proof. See p. 1690 (Appendices). 





Exercise 366. For each function, (i) find all stationary points; (ii) determine whether 
each of these stationary points is a strict local maximum or strict local minimum; (iii) 
hence write down all turning points. (Answer on p. 1909.) 


(a) f:R-R defined by f (x) =2°+ aT — 8 


(b) g: (-5, =| > R defined by g(x) = tana. 





(c) h:[0,27] > R defined by h(x) =sinz +cosz. 
We now illustrate SDTE(c) with four examples: 


Example 1186. Define k:R > R by k(x) =-2"*. 
The first derivative of k is the function k’: R > R defined by k’ (x) = —4x?. Observe: 


kay = 0 — —Aa? = 0 — a= 0) 


So, the sole stationary point of k is 0. 
The second derivative of k is the function k” :IR > R defined by k” (x) = -122?. 


Figure to be 
inserted here. 


At 0, we have k”(0) = -12-0? = 0. So, by SDTE(c), the SDTE is inconclusive—the 
stationary point 0 could be 


e a strict local maximum; 
e a strict local minimum; 
e an inflexion point; or 

e none of the above. 


Here it so happens that the stationary point 0 is a strict local maximum. (For all x #0, 
k(x) <k(0) =0.) 


By the way, the above example also shows that the converse of SDTE(a) is false—O is a 
strict local maximum of k, but k’" (0) ¢ 0. 
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Example 1187. Define j: R > R by j (x) = 2°. 
The first derivative of j is the function 7’: R > R defined by j’ (x) = 42°. Observe: 


7 (oO — 4a? = 0 — a0. 


So, the sole stationary point of 7 is 0. 
The second derivative of j is the function 7”: IR > R defined by j” (x) = 122. 


Figure to be 
inserted here. 


At 0, we have j’(0) = 12-0? = 0. So, by SDTE(c), the SDTE is inconclusive—the 
stationary point 0 could be 


e a strict local maximum; 

e a strict local minimum; 

e an inflexion point; or 

e none of the above. 

Here it so happens that 0 is a strict local minimum. (For all x #0, k (7) > k (0) =0.) 


By the way, the above example shows that the converse of SDTE(b) is false—0 is a strict 
local minimum of j, but 7” (0) #0. 
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Example 1188. Define h: R > R by h(x) = 2”. 


The derivative of h is the function h’: R + R defined by h’ (x) = 3x. Since h’ (a) = 0 
<> a=Q0, the sole stationary point of h is 0. 


The second derivative of h is the function h’ : IR > R defined by h” (x) = 62. 


Figure to be 
inserted here. 


At 0, we have h” (0) = 6-0 = 0. So, by SDTE(c), the SDTE is inconclusive—the stationary 
point 0 could be 


e a strict local maximum; 
e a strict local minimum; 
e an inflexion point; or 


e none of the above. 


Here it so happens that 0 is an inflexion point—we’ll learn why this is so in Ch. 97 
(below). 
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Example 1189. Define i: R > R by i (az) =1. 


The derivative of i is the function i’: IR > R defined by 7’ (x) = 0. Since 7’ (a) = 0 for all 
aeéR, every point a € R is a stationary point of 2. 


The second derivative of 7 is the function i”: R > R defined by i” (x) = 0. 


Figure to be 
inserted here. 


At each a € R, we have i”(a) = 0. So, by SDTE(c), the SDTE is inconclusive—the 
stationary point a could be 


e a strict local maximum; 

e a strict local minimum; 

e an inflexion point; or 

e none of the above. 

Here it so happens that a is “none of the above”. We can already prove that a is neither 


a strict local maximum nor a strict local minimum." In Ch. 97 (below), we’ll also learn 
to prove that a is not an inflexion point. 




















“47 et ae R. We have i(a) = 0. There exists some “nearby” point c such that i(c) = 0, so that i(c) < 
i(a) <i(c). So, a is neither a strict local minimum nor a strict local minimum. (This proof is slightly 
informal in that the term “nearby” is slightly imprecise.) 
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Example 1190. Let f : (0,2) > R be the function defined by 


f (a) =2-sin >. 


Q. Find any maximum or minimum points of f. 
Compute the first derivative: 


; M2 = TE 
eve cos = 
7 @) 7 C8 


Find any stationary points cé (0,2) of f: 


7 (e)=0 1-5 cos =0 


2 2 
So, the sole stationary point of f is c= = cos + me 0.561. 


By Fermat’s Theorem on Extrema, any extremum of f must also be a stationary point. 
Hence, c is the only possible extremum of f. Note though that at this point, we have not 


yet shown that c is actually an extremum of f. 


To do so, we can use the Second Derivative Test for Extrema (SDTE). Compute the 


second derivative: 


f" (a) =1+ el sin > 


Evaluate the second derivative at c: 
pe m\? | 7 
i (eyal+ (= sin > * 2.90 > 0. 


A. Hence, by the SDTE, c is a strict local minimum of f. 
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96. Concavity 


Definition 211 (informal). Let 1<r.*? We say that a function is'*® 


(a) Concave on (l,r) if its gradient is decreasing on (I,r). 
(b) Strictly concave on (l,r) if its gradient is strictly decreasing on (l,r). 
(c) Convex on (1,r) if its gradient is increasing on (1,7). 
(d) Strictly convex on (l,r) if its gradient is strictly increasing on (l,r). 


Example 1191. Define f :R—> R by f (x) = -2’. 


Figure to be 
inserted here. 


The gradient of f is decreasing on R = (-00, 00). So, f is concave on R = (-00, 00). 
Indeed, the gradient of f is strictly decreasing on R = (—co, 0). So, f is strictly concave 
on R = (-00, 00). 


Example 1192. Consider the exponential function exp. 


Figure to be 
inserted here. 


The gradient of exp is increasing on R = (-00, 00). So, exp is convex on R = (-00, 00). 


Indeed, the gradient of exp is strictly increasing on R = (-—0o0,00). So, exp is strictly 
convex on R = (00, 0). 








*°These definitions are “unofficial” because of these two flaws: They (i) define concavity only on open 
intervals, even though concavity may be more generally defined on any interval with more than one 
number; and (ii) require that the function be differentiable on (1,7). Definition 329 (Appendices) fixes 
these flaws (and gives this textbook’s official definitions of concavity). 

*6We allow for the possibilities that 1 =—oo and/or r = oo. 
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Example 1193. Define g:R—> R by g(z) = 2°. 


Figure to be 
inserted here. 


The gradient of g is decreasing on (—0c0,0). So, g is concave on (-00,0). 


Indeed, the gradient of g is strictly decreasing on (-c,0). So, g is strictly concave on 
(—00, 0) : 


The gradient of g is increasing on (0,00). So, g is convex on (0, 00). 


Indeed, the gradient of g is strictly increasing on (0, co). So, g is strictly convex on (0, co). 





Mnemonic: Concave functions look like a cave, while e” is conve’. 


Figure to be 
inserted here. 
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Example 1194. Define h: [0,27] by h(x) =sinz. 


Figure to be 
inserted here. 


The gradient of h is decreasing on (0,70). So, h is concave on (0,7). 


Indeed, the gradient of h is strictly decreasing on (0,7). So, h is strictly concave on 
(0) 70). 


The gradient of h is increasing on (71,271). So, h is convex on (7, 271). 


Indeed, the gradient of h is strictly increasing on (71,270). So, h is strictly convex on 
(71, 270). 


Example 1195. Define i: R > R by (az) =1. 


Figure to be 
inserted here. 


The gradient of i is decreasing but not strictly increasing on R = (—0«, 00). 
The gradient of 7 is increasing but not strictly decreasing on R = (—0«, 00). 


So, 7 is both concave and convex on R = (—0«, 0), but neither strictly concave nor strictly 
convex on R = (00, 0). 





Example 1196. XXX 
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A367(a) 


Figure to be 
inserted here. 





(b) 
(c) 


(d) The gradient of i is decreasing and indeed strictly decreasing on R. Hence, i is concave 
and also strictly concave on R. 


Figure to be 


inserted here. 





(e) The gradient of 7 is increasing and indeed strictly increasing on R. Hence, 7 is convex 
and also strictly convex on R. 
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96.1. The First Derivative Test for Concavity (FDTC) 


Take the unofficial definitions of concavity (p. 951) and replace each instance of the word 
“eradient” with “f’”. This gives us the First Derivative Test for Concavity (FDTC): 


Proposition 12. (First Derivative Test for Concavity, FDTC) Let D be an interval 
with endpoints | andr. Suppose f: D> R is differentiable. Then 


(a) f' is decreasing on (l,r) <=> f is concave on D. 


(b) ff’ is strictly decreasing on (l,r) <=> f is strictly concave on D. 
(c) f' is increasing on (l,r) <=> f is convex on D. 
— 


(d) f' is strictly increasing on (l,r) f is strictly convex on D. 








Proof. See p. 1693 (Appendices). 











We illustrate the FDTC using the same examples as before: 


Example 1197. Define f :R—> R by f (x) =-2’. 
The first derivative of f is the function f’:IR > R defined by f’ (x) = -2z. 


Figure to be 
inserted here. 


Since f’ is decreasing on R, by FDTC(a), f is concave on R. 
Indeed, f’ is strictly decreasing on R. So, by FDTC(b), f is strictly concave on R. 


Example 1198. Consider the exponential function exp. 


Its first derivative is itself: exp’ = exp. 


Figure to be 
inserted here. 


Since exp’ = exp is increasing on R, by FDTC(c), exp is convex on R. 


Indeed, exp’ = exp is strictly increasing on R. So, by FDTC(d), exp is strictly convex on 
R. 
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Example 1199. Define g:R—> R by g(z) = 2°. 
The first derivative of g is the function g’ (x): IR > R defined by g' (x) = 32”. 


Figure to be 
inserted here. 


Since g’ is decreasing on (-00,0), by FDTC(a), g is concave on (-00,0). 


Indeed, g’ is strictly decreasing on (-00,0). So, by FDTC(b), g is strictly concave on 
(=ce, 0), 


Since g’ is increasing on (0,00), by FDTC(c), g is convex on (0, 0). 
Indeed, g’ is strictly increasing on (0,00). So, by FDTC(d), g is strictly convex on (0, 00). 





Example 1200. Consider the sine function sin. 
Example 1201. XXX 


Example 1202. XXX 


Exercise 368. XXX (Answer on p. 956.) 
A368. 
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96.2. The Second Derivative Test for Concavity (SDTC) 


For convenience, we reproduce the First Derivative Test for Concavity (FDTC) (Ch. 96.1) 
and the Increasing/Decreasing Test (IDT) (Ch. 94): 


Proposition 12. (First Derivative Test for Concavity, FDTC) Let D be an interval 
with endpoints | and r. Suppose f: D> R is differentiable. Then 

(a) f' is decreasing on (l,r) <>  f is concave on D. 

(b) f’ is strictly decreasing on (l,r) <=> f is strictly concave on D. 

(c) f' is increasing on (l,r) <=> f is convex on D. 


(d) f' is strictly increasing on (l,r) <=> ff is strictly convex on D. 


Fact 208. (Increasing/Decreasing Test, IDT) Let f be a function and D be an 
interval. Suppose f is differentiable on D. 


(a) f'(x)2>0forallaeD f is increasing on D. 
(b): f' (2) 20 tor all we D f is strictly increasing on D. 
(ce) § (a) = 0tor alle) f is decreasing on D. 
(da) @) <0 tor all’ D f is strictly decreasing on D. 





By applying the IDT to the FDTC, we get the Second Derivative Test for Concavity 
(SDTC): 


Corollary 44. (Second Derivative Test for Concavity, SDTC) Let D be an interval 
with endpoints | andr (l<r). Suppose f: D>R is differentiable. Then 


(a) f"<Oon(l,r) <= f is concave on D. 


(b) f" <0 on (I,r) f is strictly concave on D. 


—=> 
(c) f">Oon(lr) =— > f is convex on D. 
(a) "son. Ce) 


f is strictly convex on D. 





We illustrate the SDTC using the same examples as before: 
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Example 1203. Define f :R—> R by f (x) =—-2’. 
The second derivative of f is the function f” : R > R defined by f” (x) = -2. 


Figure to be 
inserted here. 


Since f” < 0 on R, by SDTC(a), f is concave on R. 
Indeed, f” <0 on R. So, by SDTC(b), f is also strictly concave on R. 


Example 1204. Consider the exponential function exp. 


Its second derivative is itself: exp” = exp. 


Figure to be 
inserted here. 


Since exp” = exp > 0 on R, by SDTC(c), exp is convex on R. 
Indeed, exp” = exp > 0 on R. So, by SDTC(d), exp is also strictly convex on R. 


Example 1205. Define g:R > R by g(a) = 2’. 
The second derivative of g is the function g” (a) : IR > R defined by g” (x) = 62. 


Figure to be 
inserted here. 


Since g” < 0 on (-00,0), by SDTC(a), g is concave on (-00,0). 

Indeed, g” <0 on (—o0,0). So, by SDTC(b), g is strictly concave on (-00,0). 
Since g” > 0 on (0,00), by SDTC(c), g is convex on (0, 00). 

Indeed, g” > 0 on (0,00), so that by SDTC(d), g is strictly convex on (0, 00). 





958, Contents www.EconsPhDTutor.com 


Example 1206. Consider the sine function sin. 
Example 1207. XXX 


Example 1208. XXX 


By the way, note the one-way ==> ’s in the SDTC(b) and (d).*°’ That is, the converses 
of (b) and (d) are false: 


Example 1209. Define i: R ~ R defined by i (x) = -2*. 
The first derivative of i is the function 7’: R > R defined by i’ (x) = -42°°. 
The second derivative of i is the function i” : IR > R defined by i” (x) = —122”. 


Figure to be 
inserted here. 


We already showed in Exercise 367(d) that 7 is strictly concave on R. 


However, contrary to the converse of SDTC(b), it is not true that i” <0 on R. In 
particular, at 0, we have i” (0) = -12-0?=0 £0. 


This shows that the converse of SDTC(b) is false—the function 7 is strictly concave on 
R, but 7” £0 on R. 








437These one-way ==> ’s are simply inherited from the Increasing/Decreasing Test (IDT). 
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Example 1210. Define j : R > R defined by j (x) = 2’. 
The first derivative of j is the function 7’: IR > R defined by j’ (x) = 42°. 
The second derivative of j is the function j” : R > R defined by j” (x) = 122”. 


Figure to be 
inserted here. 


We already showed in Exercise 367(d) that 7 is strictly convex on R. 


However, contrary to the converse of SDTC(b), it is not true that 7” > 0 on R. In 
particular, at 0, we have j” (0) = 12-07 =0 #0. 


This shows that the converse of SDTC(b) is false—the function 7 is strictly convex on 
R, but 7” #0 on R. 





Exercise 369. XXX (Answer on p. 960.) 
A369. 
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96.3. The Graphical Test for Concavity (GTC) 


Example 1211. Define f :R—> R by f (2) = -2’. 


Figure to be 
inserted here. 


Below, we’ll formally state the Graphical Test for Concavity (GTC). Here the GTC 
says that 


(a) Since f is concave, if we pick any two points, say A and B, then f is on or above the 
line segment AB. 


(b) Since f is strictly concave, f is (strictly) above the line segment AB (excluding the 
two endpoints A and B). 


Example 1212. Consider the exponential function exp. 


Figure to be 
inserted here. 


The GTC says that 
(a) Since exp is convex, if we pick any two points, say A and B, then exp is on or below 
the line segment AB. 


(b) Since exp is strictly convex, exp is (strictly) above the line segment AB (excluding 
the two endpoints A and B). 
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Fact 209. (Graphical Test for Concavity, GTC) Let D be an interval and f be a 
function. Then 


f is on or above the line segment connecting 
(a) f is concave on D 


any two points in D. 


f is above the line segment connecting any 


(b) f is strictly concave on D 


two points in D (excluding the two endpoints). 


(c) ones Fea) f is on or below the line segment connecting 


any two points in D. 


@)) 49 siete comer f is below the line segment connecting any 





two points in D (excluding the two endpoints). 





Proof. See p. 1695 (Appendices). 











Example 1213. Define g:R > R by g(z) = 2°. 


Figure to be 
inserted here. 


The GTC says that 


(a) Since g is concave on (-00,0), if we pick any two points, say A and B in (-o,0), 
then g is on or above the line segment AB. 


(b) Since g is strictly concave on (-00,0), g is (strictly) above AB (other than the 
endpoints A and B). 

(c) Since g is convex on (0,00), if we pick any two points, say C and D in (0,00), then 
g is on or below the line segment C'D. 


(d) Since g is strictly concave on (0,00), g is (strictly) below CD (except the endpoints 
C and D). 
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Example 1214. Define h: [0,27] by h(x) =sinz. 


Figure to be 
inserted here. 


The GTC says that 
(a) Since h is concave on (0,71), if we pick any two points, say A and B in (0,70), then 
h is on or above the line segment AB. 


(b) Since h is strictly concave on (0,70), h is (strictly) above AB (other than the endpoints 
A and B). 


(c) Since h is convex on (71, 27t), if we pick any two points, say C' and D in (7, 27t), then 
h is on or below the line segment C'D. 


(d) Since h is strictly convex on (71,27), h is (strictly) below on C'D (other than the 
endpoints C and D). 


Example 1215. Define i: R> R by i(z) =1. 


Figure to be 
inserted here. 


We showed earlier that 7 is both concave and convex on R, but neither strictly concave 
nor strictly convex on R. 


The GTC says that 


(a) Since 7 is concave on R, if we pick any two points, say A and B in R, then 7 is on or 
above the line segment AB. 


(b) Since i is not strictly concave on R, we can find two points C' and D such that 7 is 
not (strictly) above the line segment CD. 


(c) Since 7 is convex on R, if we pick any two points, say FE and F in R, then i is on or 
below the line segment F’F’. 


(d) Since i is not strictly convex on R, we can find two points G and H such that 7 is 
not (strictly) below the line segment GH. 
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Exercise 370. XXX (Answer on p. 964.) 
A370. 
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96.4. A Linear Function Is One That’s Both Concave and Convex 


Informally, a linear function is one whose graph is a straight line—or one whose gradient 
is both decreasing and increasing. Hence, a linear function is a function that’s both concave 
and convex. 


Example 1216. Graphed below is the function h: R > R defined by h(x) =<. 


Figure to be 
inserted here. 


“Obviously”, fh is linear on R. 
Less obviously, h is also concave on R. Because XXX 


Similarly, h is also convex on R. Because XXX 





Exercise 371. XXX (Answer on p. 965.) 
A371. 
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97. Inflexion Points 


Informally, an inflexion point is a point at which a function’s concavity (or curvature) 
changes. A little more precisely, an inflexion point is a point at which the function “changes 
from” 


(a) Strictly concave to strictly convex; OR 


(b) strictly convex to strictly concave. 


Even more precisely (but still not quite formally), 


Definition 212 (informal). We call c an inflexion point of a function if the function 
) 438 


satisfies either statement (a) or (b 


(a) Strictly concave on c’s “immediate left” & strictly convex on c’s “immediate right”. 
(b) Strictly convex on c’s “immediate left” & strictly concave on c’s “immediate right”. 


Example 1217. Consider the function f : IR — R defined by f (x) = 2°. 


Figure to be 
inserted here. 


At 0, f “changes from” strictly concave to strictly convex. Hence, 0 is an inflexion point 


Ont; 


Example 1218. Consider the function g : R > R defined by g (2) = -2°. 


Figure to be 
inserted here. 


At 0, g “changes from” strictly convex to strictly concave. Hence, 0 is an inflexion point 
of g. 








“88The definition given here is informal in that the phrases “immediate left” and “immediate right” are 
imprecise. For a formal definition, see Definition 330 (Appendices). 
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Example 1219. XXX 


Example 1220. XXX 


Exercise 372. XXX (Answer on p. 967.) 
A372. 





439 According to Google Ngram, even when we restrict attention to British English, inflection is the more 
common spelling. It seems that inflerion is an archaic spelling (similar to connezion). 
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97.1. The First Derivative Test for Inflexion Points (FDTT) 


Proposition 13 (informal). (First Derivative Test for Inflexion Points, FDTT) 


Let f be a function and c be a point. If c is an inflexion point of f, then c ts a strict local 


extremum of f’.4 





Proof. Since c is an inflexion point, by definition, f is either (a) strictly concave on c’s 
“immediate left” and strictly convex on c’s “immediate right”; OR (b) strictly convex on 
c’s “immediate right” and strictly concave on c’s “immediate left”. 


Suppose (a). Then by the First Derivative Test for Concavity (FDTC), f’ is strictly 
decreasing on c’s “immediate left” and strictly increasing on c’s “immediate left”. And so, 
by the First Derivative Test for Extrema (FDTE), c is a strict local minimum of f". 











Suppose (b). Then we can similarly show that c is a strict local maximum of f’. 





Example 1221. Consider the function f : R > R defined by f (x) = («- 1)’. 


« Its first derivative is the function f’:R > R defined by f(x) =3(x-1)°. 


e It has an inflexion point at 1. 


Figure to be 
inserted here. 


As promised by the FDTI, the point 1 is a strict local minimum of f’4" 








“0For a formal statement of the FDTI, see Proposition 35 (Appendices). 
“41Tndeed, 1 is a strict global minimum of f'—for all x #1, f’ (xr) >0= f’ (1). 
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Example 1222. Consider the function g:R > R defined by g(x) = -(z-1)°. 


« Its first derivative is the function g’:R > R defined by g’ (x) = -3(x- 1)’. 


e It has an inflexion point at 1. 


Figure to be 
inserted here. 





As promised by the FDTI, the point 1 is a strict local maximum of f’.4” 
Example 1223. XXX 


Example 1224. XXX 


Exercise 373. XXX (Answer on p. 969.) 
A373. 








“Indeed, 1 is a strict global maximum of g'—for all x #1, g’ (x) < 0=4' (1). 
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97.2. The Second Derivative Test for Inflexion Points (SDTI) 


Proposition 14 (informal). (Second Derivative Test for Inflexion Points, SDTT) 


Let f be a function and c be a point. Suppose f is twice-differentiable “around” c. If c is 
an inflexion point of f, then f"(c) =0.4% 





Proof. By the First Derivative Test for Inflexion Points, c is a strict local extremum of f’. 
So, by Fermat’s Theorem on Extrema, f” (c) = 0. 














Example 1225. Consider the function f :R > R defined by f (x) = (x- 1)”. 


« Its first derivative is the function f’:R > R defined by f’ (x) =3(x-1)’. 
¢ Its second derivative is the function f” : R > R defined by f” (x) =6(a#-1). 


e It has an inflexion point at 1. 


Figure to be 
inserted here. 


As promised by the SDTI, f” (1) =6-(1-1) =0. 


Example 1226. XXX 
Example 1227. XXX 


Exercise 374. XXX (Answer on p. 970.) 
Exercise 375. Write down the statement that is the converse of the SDTI—is this 





statement true? Explain the relationship between the SDTI and SDTE(c). (Answer on 
p. 970.) 


A37A. 
A375. The statement that is the converse of the SDTI is this: 





“Tf f" (c) =0, then c is an inflexion point of f.” 


The above statement is false. As we learnt in SDTE(c), if f”(c) = 0, then c could be a 
strict local maximum, a strict local minimum, an inflexion point, or “none of the above”. 


The SDTI is a (partial) converse to SDTE(c). 





“For a formal statement of the SDTI, see Proposition 36(Appendices). 
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97.3. The Tangent Line Test (TLT) 


In Ch. 96 (Concavity), we had three tests to help us investigate concavity—the FDTC, the 
SDTC, and the GTC. 


In the present chapter, we have, analogously, three tests to help us investigate inflexion 
points—the FDTI (Ch. 97.1), the SDTI (Ch. 97.2), and now the Tangent Line Test 
(TLT):™* 


Proposition 15 (informal). (Tangent Line Test, TLT) Let f be a function and c be 
a point. If c is an inflexion point of f, then the tangent line of f at c satisfies either (a) 


or (b): 
(a) Strictly above f on c’s “immediate left” & strictly below f on c’s “immediate right”. 
(b) Strictly below f on c’s “immediate left” & strictly above f on c’s “immediate right”. 


Example 1228. The function f : IR — R defined by f (x) = x has the inflexion point 0. 
Let | be the tangent line of f at 0. 


Figure to be 
inserted here. 


We can easily verify that the inflexion point 0 passes the TLT(a)—_ is 


e Strictly above f on c’s “immediate left”; and 


e Strictly below f on c’s “immediate right”. 





Example 1229. XXX 
Example 1230. XXX 


Example 1231. XXX 


Exercise 376. XXX (Answer on p. 971.) 
A376. 





““4For a formal statement of the TLT, see Proposition 37 (Appendices). 


971, Contents www.EconsPhDTutor.com 





972, Contents www.EconsPhDTutor.com 


97.4. Non-Stationary Points of Inflexion (optional) 


In H2 Maths, you'll only ever be asked about stationary points of inflexion. That is, 
every inflexion point you’ll ever be asked about will also be a stationary point. 


Nonetheless, you should be aware that non-stationary points of inflexion exist. That 
is, an inflexion point can be a non-stationary point. Indeed, such points occur in commonly 
encountered functions: 


Example 1232. Define f:R—>R by f (x) = 2° +2. 
The first derivative of f is the function f’:R— R defined by f’ (x) = 327 +1. 


Figure to be 
inserted here. 


Consider the point 0. 
Observe that 0 is not a stationary point because f’ (0) =3-07+1=1+#0. 


However, at 0, f is strictly concave to the “immediate left” and strictly convex to the 
“immediate right”. In other words, at 0, f changes from being strictly concave to strictly 
convex. Hence, by our 212 informal definition of inflexion points, 0 is an inflexion point 


io aie 


Exercise 377. Verify that the inflexion point in the last example satisfies (a) the FDTI; 
(b) the SDTI; and (c) the TLT. (Answer on p. 973.) 


A377. 





Example 1233. Define h: R > R by h(x) = x? - 227 + 2. 
XXX 
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98. A Summary of Chapters 94, 95, 96, and 97 


In the last four chapters, we covered nine tests. Quick recap: 


1. A function can be increasing or decreasing on an interval. 
The IDT can help us determine which is the case. 

2. A stationary point can be an extremum, inflexion point, or neither of the above. 
The FDTE can help us determine whether a stationary point is a local maximum, local 
minimum, strict local maximum, or strict local minimum. 
The SDTE can help us determine whether a stationary point is a strict local maximum 
or strict local minimum. 

3. A function can be concave or convex on an interval. 
The FDTC, SDTC, and GTC can help us determine which is the case. 

4. An inflexion point is a point at which a function’s concavity (or curvature) changes. 


The FDTI, SDTI, and TLT can help us determine whether a point is an inflexion 
point. 


Exercise 378. For each function, identify any intervals on which the function is (i) 
increasing, decreasing, strictly increasing, or strictly decreasing; and (ii) concave, convex, 
strictly concave, or strictly convex. Then also identify (iii) any stationary points; (iv) 
the nature of each stationary point; and (v) any turning points. (Answer on p. 974.) 


(a) f:R-—R defined by f (x) =XXX 
(b) g:R-R defined by g (x) =XXX 
(c) h:R->R defined by h (a) = XXX 





A378(a) XXX 
(b) XXX 
(c) XXX 
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99. More Techniques of Differentiation 


Your H2 Maths syllabus (p. 8) includes 


¢ “relating the graph of y = f’(x) to the graph of y = f (x)”; 
e “finding equations of tangents and normals to curves, including cases where the curve is 
defined implicitly or parametrically”; and 


« “connected rates of change problems”. 


So, these are the three topics we’ll cover in this chapter. 
99.1. Relating the Graph of f’ to That of f 


Example 1234. XXX 
Example 1235. XXX 


Example 1236. XXX 


Exercise 379. XXX (Answer on p. 975.) 
A379. 
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99.2. Equations of Tangents and Normals 
We reproduce from Ch. 7 the following fact and definition: 
Fact 28. The line that contains the point (x1,y,) and has gradient m is 
y-y1 =m (z-2)). 


Definition ??. Two lines are perpendicular if 


(a) Their gradients are negative reciprocals of each other; or 


(b) One line is vertical while the other is horizontal. 


If two lines / and m are perpendicular, then we will also write / 1 m. 


Example 1237. XXX 


Figure to be 
inserted here. 





We now also write down the formal definitions of tangent and normal lines: 


Definition 213. Let f be a nice function that is differentiable at the point a. 


The tangent line (or simply tangent) of f ata is the line that has gradient f’(a) and the 
point (a, f (a)). 

The normal line (or simply normal) of f at a is the line that has gradient -1/f’(a) and 
the point (a, f (a)) (provided f' (a) #0). 


Corollary 45. Suppose f is a nice function that is differentiable at the point a. Then 
the tangent of f at a is described by 


y= f(a) + fi (a)(w-a). 


And if f' (a) #0, then the normal of f at a is described by 


1 
ey ao 

















Proof. Apply Fact 28 to Definition 213. 
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Example 1238. The curve C has parametric equations*” 


g=t+t, y=t®-t, for te R. 


Let 1 be the normal line of C' at t= 0. 


Figure to be 
inserted here. 


Q. Other than at t = 0, at which point(s) do / and C intersect? 
First, at t = 0, we have (x,y) = (0,0). 
Next, by implicit differentiation, 


a (oe fe 
7 j= 


dt dt 


t=0 t=0 


dx 
Hence, / has gradient —-1 and / is described by y = 0- (-1) (a - 0)—or more simply, y = x. 
The point(s) where / and C intersect is/are given by this system of equations: 


L 


Lo. y 218 —¢, and x2t+t. 


Plug 2 and 2 into = to eet Pt £6 _ +. Rearranging, He ae - 2) =). 50,7 =U or 
t # 1.45 (calculator). 
A. Other than at t= 0, / and C intersect at ¢ » 1.45 or at (x,y) » (7.88, 7.88). 


Exercise 380. The curve C’ has parametric equations 
g=t+t, y=t*-t, for te R. 


Let 1; and Jy be the tangent lines to C' at t = 0 and ¢ = 1, respectively. Find the point 
where J; and ly intersect. (Answer on p. 1915.) 








“1Here we again appeal to the Implicit Function Theorem (see n. 414 on p. 897)—we assume it is possible 
to express y as a function of x (even though we don’t know how exactly to). 


446By the Fundamental Theorem of Algebra, the sixth-degree polynomial equation = must have six roots. 
In this case, only two are real (and these are the two that we’ve found), while the other four are complex. 
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99.3. Connected Rates of Change Problems 


Example 1239. XXX 
Example 1240. XXX 
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Example 1241. Sand lands on the floor at a steady rate of lm?®s~!. (The floor is initially 
spotless. ) 


Magically, the sand always forms a perfect cone with volume V, height h, and base radius 
r. Moreover, the height and base diameter of the cone are always equal. 


Figure to be 
inserted here. 


Q. At the instant t = 2s after sand begins landing on the floor, what is the rate at which 
r is increasing? 


| 
You will doubtless**’ recall the formula for the volume of a cone: V = gh. 


2 
Since the height and base diameter are always equal, h = 2r. Plug this in: V 2 ae 


Differentiate + with respect to t: 


(What Rule did we just use?)** 


dV 
We are given that sand lands at a steady rate of 1m?® aa 3] (for all t > 0). 


4 


At t = 2, the volume of sand that has landed is 2m°—V 2. 


U=Z 


Plug * into + and rearrange—at t = 2, the base radius is r 2 ( 
t=2 


Plug 3 and 2 into 2: 


dV 3\2/3 dr 
2 a he =2 =| = 
di iar ONG ee 


t=2 = = i 


dr 
29 3 32/3 
is dt 


dr 6 1 
ne ; ~ Oaqql/3 . 32/3 


te 


Rearranging, x 0.164. 





A. The base radius r is increasing at a rate of 0.164ms"t. 





447 Just. kidding. You probably don’t. 
But unfortunately, you are supposed to remember everything you learnt in O-Level Maths. And one of 
those things is this formula! (See e.g. 2019 O-Level Maths syllabus, p. 8.) And even more unfortunately, 
this formula is not on List MF26. So yea, you may want to remember this formula (which we'll learn 
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Exercise 381. Continue with the last example. Let A be the base area of the cone. At 
t = 2, what is the rate at which A is increasing? (Answer on p. 1915.) 


Exercise 382. A cone has lateral J, base radius r, and height h. (Answer on p. 1915.) 


Let V be the cone’s volume. Let S be the cone’s external surface area excluding the base. 
From O-Level Maths, we know that**” 


S nr. 


Suppose we want to construct a cone with volume 1m* and whose external surface area 
(excluding the base) is minimised. What should this cone’s height h be? (You can follow 
the steps below.) 


(a) Express r in terms of h. 


(b) Use Pythagoras’ Theorem to express / in terms of r and h. Hence express / solely in 
terms of h. 


(c) Now express S solely in terms of h. 


1/3 
(d) Show that the only stationary point of S (with respect to h) is at h = (=) ; 


(e) Explain why this stationary point is a strict global minimum and hence, conclude 
6 1/3 
that A= (=| 
Tl 








to derive in Ch. 109.6). 


Note though that in N2016/II/1 (Exercise 685), they were nice enough to provide this cone volume 
formula. Maybe they'll be nice with you too. Or maybe not. 
“48Chain Rule. 


“49 Avain, both these two formulae are from O-Level Maths. Hence, you’re supposed to know them. 
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100. More Fun With Your TI184 


As part of your training as an obedient monkey, your H2 Maths syllabus (p. 8) includes: 
¢« “locating maximum and minimum points using a graphing calculator”; and 


e “finding the approximate value of a derivative at a given point using a graphing calcu- 
lator”. 


So, these fascinating topics are what we’ll cover in this chapter. 
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100.1. Locating Maximum and Minimum Points Using Your TI84 


Example 1242. Let f: (0,2) > R be the function defined by 


i (a) = 2— sin, 


Q. Find any maximum or minimum points of f. 
In Example 1190, we already solved this question without using our TI84. 


But now as an exercise, let’s also solve it using our T184: 


Step 1. Step 2. Step 3. 


. Press to turn on your calculator. 

. Press to bring up the Y = editor. 

. Press RR @ (0) « (5). To enter “7”, press the blue button and then 7 
(which corresponds to the @ button). Now press and altogether you will 
have entered “x — sin (0.57t2)”. 

. Now press and the calculator will graph y = x - sin (0.5712). 
Note that in our question, the domain of f is actually (0,2), but we didn’t bother 
telling the calculator this. So the calculator just went ahead and graphed the equation 
y = x-sin(0.57tz) for all possible real values of x and y. 
No big deal, all we need to do is to zoom in to the region where 0 < x < 2. 

. Press the button to bring up a menu of ZOOM options. 

. Press (2) to select the Zoom In option. Using the || and || arrow keys, move the cursor 


to where X = 1.0638298, Y = 0. Now press (QE and the TI will zoom in a little, 
centred on the point X = 1.0638298, Y = 0. 





(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


It looks like starting at x = 0, the function is decreasing, hits a minimum point, then 
keeps increasing. Our goal now is to find this minimum point. 


Step 7. Step 8. Step 9. Step 10. 


Step 11. Step 12. Step 13. 


. Press the blue W button and then CALC (which corresponds to the (QN@3 
button). This brings up the CALCULATE menu. 

. Press (3) to select the “minimum” option. This brings you back to the graph, with a 
cursor flashing. Also, the TI84 prompts you with the question: “Left Bound?” 
T1I84’s MINIMUM function works by you first choosing a “Left Bound” and a “Right 
Bound” for x. TI84 will then look for the minimum point within your chosen 
bounds. 

. Using the |(| and })] arrow keys, move the blinking cursor until it is where you want 
your first “Left Bound” to be. For me, I have placed it a little to the left of where I 
believe the minimum point to be. 

. Press WININSIN) and you will have just entered your first “Left Bound”. 

T1I84 now prompts you with the question: “Right Bound?” 

. Now repeat: Using the |(] and ))| arrow keys, move the blinking cursor until it is where 
you want your first “Right Bound” to be. For me, I have placed it a little to the right 
of where I believe the minimum point to be. 

. Again press U3)MIN38) and you will have just entered your first “Right Bound”. 


TI84 now asks you: “Guess?” This is just asking if you want to proceed and get T184 
to work out where the minimum point is. So go ahead and 


. Press WQUH5N). T184 now informs you that there is a “Minimum” at “X = .56066485”, 
“¥ S— 2051387 





Exercise 383. XXX (Answer on p. 983.) | 
A383. 
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100.2. Find the Derivative at a Point Using Your TI84 


This example will also illustrate how to graph parametric equations on the T184. 
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Example 1243. The curve C has parametric equations 


a a es for teR. 


d 
Q. What is oe 


fe 
dx 





t=1 
We can solve this question easily without using our T184: 


PXde de 7 
el 


But as an exercise, let’s try solving this question using our TI84: 


dy 
dx 
=a 


6t? - 1 
bo ed 
t=0 


5 





A. 











Step 1. Step 2. Step 3. Step 4. 
Step 5. Step 6. Step 7. Step 8. 


1. Press to turn on your calculator. 


2. Press QWI@Q)B)3 to bring up a menu of settings that you can play with. In this example, 
all we want is to plot a curve based on parametric equations. So, 


3. Using the arrow keys, move the blinking cursor to the word PARI (short for parametric) 
and press [2NCEDIEY. 


4. Now as usual, we’ll input the equations of our curve. To do so, press to bring up 
the “Y =” editor. Notice that this screen looks a little different from usual, because 
we are now under the parametric setting. 


5. Press R@gatean RGM and altogether you will have entered “T° +7” in 
the first line. 


6. Now press (21095) to go to the second line. 


7. Press 6) §)) @ GSERA and altogether you will have entered “T° - T” in 
the second line. 


8. Now press (@iNes9 and the calculator will graph the given pair of parametric equa- 
tions. 





Notice that strangely enough, the graph seems to be empty for the region where 
x < 0. But clearly there are values for which x < 0—for example, if ¢ = -2, then 
(x,y) = (-34,18). So why isn’t the TI84 graphing this? 
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(... Example continued from the previous page.) 


The reason is that by default, the TI84 graphs only the region for where 0 < t < 27 (at 
least this is so for my particular calculator and operating system). We can easily adjust 
this: 

9. Press the WZ BONY button to bring up a menu of WINDOW options. 


10. Using the arrow keys, the number pad, and the U2NIU3I8) key as is appropriate, change 
Tmin and Tmax to your desired values (I use Tmin = -10 and Tmax = 10). 


. Then press (Qe) again and the calculator will graph the given pair of parametric 
equations, now for the region Tmin < t < Tmax, where Tmin and Tmax are whatever 
you chose in the previous step. 


Step 9. Step 10. Step 11. Step 12. 


Step 13. Step 14. Step 15. 


Actually, the last few steps were really not necessary, if all we wanted was to find 


dy 

—]  ,as we do now: 
dz ~ 

. Press the blue button and then CALC (which corresponds to the (QN@3 
button). This brings up the CALCULATE menu, which once again looks a little 


different under the current parametric setting. 

. Press (2) to select the “dy/dx” option. This brings you back to the graph. 
Nothing seems to be happening. So, 

. Press (1) and now the bottom left of the screen changes to display “T= 1”. 


d 
. Hit (YOR. What you’ve just done is to ask the TI84 to compute ~ . It duly 
z 


t=1 
tells you that “dy/dzx = .83333528”.4°° 








“0Vour TI84 will sometimes be a tiny bit off as it uses numerical (rather than exact) methods. Here for 


d 5 
example, we know that the exact correct answer is 7 aoe 0.8333.... So, the T184’s “.83333528” is 
z 
t=1 


a tiny bit off. 
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Exercise 384. XXX (Answer on p. 987.) 
A384. 
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101. Power Series 


101.1. A Power Series Is Simply an “Infinite Polynomial” 
We reproduce from Ch. 5.8 our definition of polynomials: 


Definition 33. Let co,c),...,¢p, be constants, with c, + 0. The expression 


Cal” HG Sehr a ee hey 


is called an nth-degree polynomial (in one variable x). We also call 


¢ Each cr! the ith-degree term (or more simply the ith term); 
¢ Each c; the ith coefficient on x' (or the ith-degree coefficient, or the ith coefficient); 


¢ The Oth coefficient co the constant term (or, more simply, the constant). 
A (nth-degree) polynomial equation (in one variable x) is any equation 


Gr 2G See 


+o Ci eg =), 
or any equation that can be rewritten in the above form. 


Example 1244. The expression 7x — 3 is a 1st-degree (or linear) polynomial. 


The expression 327 + 42 — 5 is a 2nd-degree (or quadratic) polynomial. 


The expression —5x° + 2x +9 is a 3rd-degree (or cubic) polynomial. 


The expression 18 + 5x — x? + x* is a 4th-degree (or quartic) polynomial. 





You can easily imagine what an “infinite-degree polynomial” is. Except we don’t call it 
that. Instead, we call it a power series: 
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Example 1245. The following expression is a power series: 


S\ (Ltn) a” =14 2x +32? +403 + 527+ 62? +... 
n=0 


The Oth coefficient is 1. The constant term, constant, or Oth term is 1. 
The Ist coefficient is 2. The 1st term is 22. 

The 2nd coefficient is 3x7. The 2nd term is 32. 

For each n € Z5, the nth coefficient is 1 +n; and the nth term is (1+n) a”. 
Note that a power series is, by definition, an infinite series.*”! 


By the way, we’ll often find it convenient to give our power series a name—that is, denote 
it by a symbol. 


Here for example, we might call the above power series P. This is convenient, because 
then we can write such things as 


Pa ere an A oe eee 


1 1 i! 1 
=1+2- rece a re 7 


Example 1246. The following expression is a power series: 
 ( eS a a ee 


The Oth coefficient is 1. The constant term, constant, or Oth term is 1. 
The Ist coefficient is -1. The lst term is —2. 

The 2nd coefficient is 1. The 2nd term is 2’. 

For each n € Zj, the nth coefficient is (-1)"; and the nth term is (-1)" 2”. 


If we call this power series Q, then we can write such things as 


OG) = I= e272 7 = 


11 11 1. 
Ba ey ae oe 





Formal definitions: 





“°1Tf you don’t remember what an infinite series is, you should review Ch. 47.1. 
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Definition 214. Let co,ci,c2,... be constants. We call*” the following expression a 
power series:*9 


co 


S\Cn&" = Co + CL + C20" +... 


We also call 


e Each c,x” the nth-degree term (or more simply the nth term); 


e Each c, the coefficient on x” (or the nth-degree coefficient, or the nth coefficient); 


¢ co the constant term (or, more simply, the constant). 





Exercise 385. XXX (Answer on p. 990.) 
A385. 





*°To be clear, this is the definition for a power series (a) in one variable; and (b) centred on 0. 
(a) Example of a power series in two variables: 2 + 3x2 + 4y + 5vy + 6x2y+7ry?+.... 
(b) Example of a power series centred on 3: 1+2(x-3)+3(x-3)?+4(x-3)°+.... (Of course, a 


power series centred on 3 can always be rewritten as a power series centred on 0.) 


453 As usual, it is customary to use the symbol x to denote our variable. But as usual, this is merely a 
dummy variable that can be replaced by any other symbol, such as y, z, ©, or x. 
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101.2. A Power Series Can Converge or Diverge 


In Part II (Sequences and Series), we briefly and informally discussed the concept of con- 
vergence. 


Two quick examples to jog your memory and illustrate convergence: 


iene 
Example 1247. The series = One + 39 +... converges to 1. Or equivalently, 


HEY an, Mell 
—~+—+—+4+ 
16 32 


By the w the e essi eee geet - i infinite seri but i if W 
y ay XpTress1on ... 1S an ininite series, Dut 1S NOt a power 
£ 2 4 8 16 32. ABA P 


series. This is because a power series involves a variable. 


Remark 144. As repeatedly emphasised in Part II (Sequences and Series), we must be 
very careful when dealing with infinite series. 


In particular, 
the equals sign = above is not the usual equals sign. 


Instead, it means converges to, which has a precise and technical meaning.*” 


Fortunately, for H2 Maths, you are not required to know anything about this “precise 
and technical meaning”. You need merely intuitively understand that = means that the 
1 i 1 1 


series = + — + ao “gets ever closer to” 1. And more generally, whenever we 


say that “a series converges to some number L”, we simply mean that “the series gets 
ever closer to L”. 


This is just so you know. For H2 Maths, there will be little danger in simply and blithely 
assuming that = is the same as the usual equals sign (even if this is, strictly speaking, 
incorrect). 


Example 1248. The expression 1+ 7+27+2°+a24+2°+... is a power series. 


As we learnt in Ch. 50.2 (Infinite Geometric Sequences and Series), for any x € (-1,1), 
this power series converges to 1/(1-). That is, 


oe 
ltaetaertortettot+...2 


(The above Remark also applies here.) 


And so for example, for x = 1/2, we have 


BackO Rn. meena 


#3 And this variable is usually and customarily denoted by the symbol x (but could be denoted by any 
other symbol). 
#4See Ch. 143 (Appendices). 
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Three quick examples to illustrate divergence: 


Example 1249. Informally, the series 1+1+1+1+... “blows up to oo”. 


So, it does not converge. Equivalently, it diverges. 


Example 1250. Informally, the series 1+2+4+8+16+... “blows up to o”. 


So, it does not converge. Equivalently, it diverges. 


Example 1251. Informally, the series -1-2-3-4-5-... “blows up to —co”. 





So, it does not converge. Equivalently, it diverges. 


Exercise 386. XXX (Answer on p. 992.) 
A386. 
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101.3. A Power Series and Its Interval of Convergence 


Definition 215. Given a power series, the set of values on which it converges is called 
its interval of convergence. 


Example 1252. Consider the power series 1+ 2+ 27+23+.... 


We know that for every x € (-1,1), this power series converges (to a real number).*°° 


Conversely, for every x ¢ (-1,1) or x € (—00, -1] U[1, 09), it diverges (i.e. fails to converge 
to a real number). 


Hence, this power series has interval of convergence (-1, 1). 


Example 1253. Consider the power series x — = 


It can be shown that for every x € (-1, 1], this power series converges (to a real number) .*°” 


Conversely, for every x ¢ (-1,1] or x € (-c0, -1]U(1, 0), it diverges (i.e. fails to converge 
to a real number). 
Hence, this power series has interval of convergence (-1, 1]. 


Note. In H2 Maths, we will not learn why this power series converges on (-1,1] but 
diverges elsewhere. Instead, this is simply something we mindless monkeys are to “know” 
without any actual understanding. 


2 is 
Example 1254. Consider the power series 1 + x + ai + a +o... 

It can be shown that for every x ¢ R, this power series converges (to a real number).*° 
Hence, this power series has interval of convergence R. 


(Note from previous example applies here.) 


Example 1255. Consider the power series 1+ 7+ 2!77+ 3!a°4+.... 


“Clearly”, at x = 0, we have 1-0 + 2!-0? +3!-0%+..., which converges to 1. 


It can be shown that, conversely, for every x #0, 1+2+2!a7+3!2°+... diverges (ie. fails 
to converge to a real number). 


Hence, this power series has interval of convergence {0}. That is, it converges at 0 and 
no other point. 





(Note from previous example applies here.) 


The following result is beyond the scope of H2 Maths but is simple and good to know. It 
says that the set of values on which a power series converges is always an interval (which 
is why we call it the interval of convergence, rather than say the set of convergence). 
Moreover, this interval always has midpoint 0: 





*6Indeed, it converges to 1/(1- 2). 
“Tt converges to In(1+.2), as we'll learn in the next chapter. 
481t converges to exp x, as we’ll learn in the next chapter. 
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Fact 210. The set of values on which a power series’? converges is (-R,R), [-R, R], 





(R,R], or [R, R), for some R>0 (we also allow for the possibility that R = co). 





Proof. Omitted.* 


We revisit our last four examples of power series. We can easily verify each obeys Fact 210: 











Example 1256. 1+2+27+2°+... has interval of convergence (-1,1) or (—R, R) with 


Heel 

qe ee 
Example 1257. x - cre has interval of convergence (-1,1] or (-R, R] with 
aa 


3 


2 
Example 1258. 1+x+ = a = +... has interval of convergence R or (-R, R) with R = oo. 


Example 1259. 1+ 7+ 2!x?+3!x2°+... has interval of convergence {0} or [—R, R] with 
R= 0. 





Exercise 387. XXX (Answer on p. 994.) 
A387. 





459 \ power series centred on 0—see footnote to Definition 214. 
460See e.g. Tao (Analysis IT, 2016, Theorem 4.1.6, pp. 76-77). 
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101.4. (Some) Functions Can be Represented by a Power Series 


1 


Example 1260. Let f : (-1,1) > R be the function defined by f (x) = a 
-2x 


Separately, let P be the power series defined by P(x) =1+%+27+2°+.... 


Observe that for x = 0, we have 


L020 404) .2 —. 
et = 


(0) 


So, we say that f can be represented by P at 0 (or that P is a power series repre- 
sentation of f at 0). 


1 
Observe that ee ee = r every x € (-1,1) = Domain f. 
— — 7 


P(a) ———— 


f(x) 
That is, f can be represented by P at every x € Domain f (or P is a power series 
representation of f at every x € Domain f). 


So, we say that f can be represented by P (or that P is a power series representa- 
tion of f). 


Definition 216. Let P be a power series and f be a function. 


If P(a) = f (a) (for some a € Domain f), then we say that f can be represented by P at 
a (or that P is a power series representation of f at a). 


If f can be represented by P at every x € Domain f, then we say that f can be represented 
by P (or that P is a power series representation of f). 





Informally, given a function that can be represented by a power series, we can think of it 
as an “infinite polynomial function”. 


Example 1261. XXX 


Example 1262. XXX 


Exercise 388. XXX (Answer on p. 995.) 
A388. 
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101.5. Term-by-Term Differentiation of a Power Series 


By the Sum Rule for Differentiation, we can differentiate a polynomial function “term-by- 
term” to get its derivative: 


Example 1263. Define the function g:R— R by 


g(x)=27%+a4+1. 
By the Sum Rule (for Differentiation), we can simply differentiate the expression x7+2+1 
“term-by-term” to find that the derivative of g is the function g’: R > R defined by 


gf (2) =a? + Sat C1 = Ie th. 





Example 1264. Define the function h: R —> R by 
h(a) = 172° + 14a" + 112? + 827 + 5x +2. 


By the Sum Rule, we can simply differentiate the expression 177° + 1424+ 11x°+8x7+5x+2 
“term-by-term” to find that the derivative of h is the function h’: R > R defined by 


d d d d d d 
Ce ql + qt + qi” 4 a8" tape ae = 852° S 56a + 33a + l6z 5. 





It turns out that happily, we can also differentiate “infinite polynomial functions”—or 
power series—term-by-term: 


996, Contents www.EconsPhDTutor.com 


Example 1265. Define the function f : (-1,1) > R by 


f(x)= = 


By proceeding as usual, we find that its derivative is the function f’: (—1,1) > R defined 
by 


'(g)= : 
Matos 


We now introduce another method for finding the derivative of f. 


We know that f can be represented by the power series 1+ 2+27+2°+.... That is, 


| 
F(t)=7 alter g oa + lfor all ge (—1, 11) = Domain |. 
7 


It turns out that by Theorem 37 (below), we can differentiate this power series term by 
term to find that f’s derivative is the function f’: (-1,1) > R defined by 


f'(v) 2142x0432? 4... 


By the way, as a bonus, together, + and = show that 


i 
ess cola fom alle (= 1,407 
-2 


Example 1266. XXX 
Example 1267. XXX 


Here’s the formal result that justifies term-by-term differentiation: 





Theorem 37. (Term-by-Term Differentiation) Let R>0 and co,c1,C2,::-€ R. Sup- 
pose the function f :(-R,R) > R is defined by 


f (2) = co +x + ox" + 32° - -> eae 


Then f is differentiable and its derivative is the function f':(—R,R) > R defined by 


co 
! = Ds ene n-1 
nm 
i (@) = ca or sea t2= > nes 








Proof. Omitted.*°! 














461See e.g. Abbott (2015, Theorem 6.5.7). 
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Given Theorem 37, we can easily prove that every function that can be represented by a 
power series is smooth (or infinitely differentiable): 


Corollary 46. Let R>0O and f:(-R,R) > R be a function. If f can be represented by 
a power series, then f is smooth. 


Proof. Let n be a positive integer. 


By Theorem 37, f is differentiable and its derivative f’ : (—R,R) > R can be represented 
by a power series. 


By Theorem 37, f’ is differentiable and its derivative f” : (—R,R) > R can be represented 
by a power series. 


By Theorem 37, f‘"-)) is differentiable. 


We’ve just shown that for any positive integer n, f is n-times differentiable.“°? Hence, by 
Definition 209, f is smooth. 














Exercise 389. XXX (Answer on p. 998.) 
A389. 





4©2Here we use the Principle of Mathematical Induction (see e.g. Wikipedia), a simple and delightful 
topic that was sadly taken out from the H2 Maths syllabus in 2017. 
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101.6. Chapter Summary 


1. A power series is simply an (infinite) series and an “infinite polynomial”. 


Example 1268. The following expression is a power series: 


2 3 


erea=l+aotart+eet... 


n=0 





2. Like any series, a power series can converge or diverge. 


Example 1269. The power series 1+ 2+ 27+2°+... converges at x = 0, but diverges at 


C=. 





3. Every power series has an interval of convergence centred on 0. 


Example 1270. The power series 1+2+27+2°+... has interval of convergence (-1,1). 


4. Some functions can be represented by a power series. 


Example 1271. Let f:(-1,1) > R be the function defined by 


f(a) =——. 


For all x € (-1,1) = Domain f, we have 


1 
f(@)=7—alteta sats... 


So, we say that f can be represented by the power series 1+x+a7+2°+.... 





5. Term-by-term differentiation: We’ve previously been differentiating polynomial func- 
tions term by term. It turns out that if a function can be represented by a power series, 
then we can also do the same. 


Example 1272. Continuing with the last example, f’s derivative is the function f’ : 
(-1,1) > R defined by 


fi (v) 2142x0430? +4... 
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102. The Maclaurin Series 


We'll begin by learning to mechanically compute a special sort of power series that we call 
the Maclaurin series (expansion) (of a function). We’ll initially provide no motivation 
for what the Maclaurin series is, where it comes from, and why it matters. We'll treat 
you like a mindless monkey. (It is only in the next subchapter that we’ll learn why the 
Maclaurin series is important.) 


Example 1273. Consider the function f : (-1,1) > R defined by f (x) = ae 
= 4 
To compute the Maclaurin series (expansion) of f, we use these four steps: 


1. Compute all of f’s derivatives: 


f(a) = 


Oe = HT = 3 
aeae (a) = 7 i" (0) = 


Qps 2 ia CO ee 
$0 (0) = 7 pe) =a 


2. Evaluate each of f, f’, f”, etc. at 0: 


=1=0!, LOE a a 1=U, iO =— 


= FO (0) = —™ 


(3) = aa 
(0 e= 72 0)! =) om Go 


(Example continues on the next page ...) 





Here we interrupt the example to introduce a definition that will prove convenient: 


Definition 217. Let f be a function. We define the zeroth derivative of f as simply the 
function f itself and write 





(a 
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(... Example continued from the previous page.) 


3. For each n € Zj, the nth Maclaurin coefficient of f is denoted m,, and is defined 
by 


_ fF (0) 


n . 


n! 


And so, the Oth, 1st, 2nd, 3rd, and nth Maclaurin coefficients of f are 


fi vo - ely Se 
9= OT ob se ar, “ame oy” 


Oe £2). _ 2 


; a 7 7 | ae 
nN: nN: 





Sa ei 


Here it so happens that every Maclaurin coefficient for f is simply 1. As we’ll see, this 
will not generally be the case. 


. Given the function f, we now define its Maclaurin series (expansion) M to be the 
power series whose coefficients are simply the above Maclaurin coefficients: 


2 2 


ae ie ee 


co 
M (2) = ¥) mp2” = mo + mz + Moe 
n=0 





In the above example, we were given a function f. We then used f to write down a power 
series that we denoted M and called the Maclaurin series of f. At no point did we claim 
that there was any relationship whatsoever between f and its Maclaurin series M. That 
such a relationship does exist is something we’ll learn only in the next subchapter. 
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Example 1274. Consider the sine function sin. 


To compute the Maclaurin series for sin, we use the same four steps as before: 


1. Compute all of sin’s derivatives. 


Unsure of how to proceed, we try writing down the first few derivatives: 


ein = cos, sin” = — sin, sin) = - cos, sin) = sin, sin) = cos. 


We observe a cycle after every four derivatives. And so, we have 


fOr = OAS 
COs, for 7 = 15,9) e 
—sin, for 72 = 2.6, 10.24 


— COS, for = 3. 7, LI 
2. Evaluate each of sin, sin’, sin”, etc. at 0: 


tor 7 =U Se: 

for 72 1.5.9) :. 

for n= 2,6,10). 4 
—1, for we 3, (oll. 


3. For each n€ Zj, the nth Maclaurin coefficient of sin is 


O/n! ="0, for 7 = 0,46, .42, 

e eee tat, her iil, eee 
“ont Q/n!=0,  forn=2,6,10,..., 
-1/n!, fora 3.4% lle 


4. Thus, M, the Maclaurin series of sin, is defined by 


A) Pe aaa 1 1 
M (a) = > mine =O4 72+ 02° — Sa" + 0x! - Fa” 


n=0 : ij 





Formal definitions: 





463 Actually, we already did this in Example 1162. 
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Definition 218. Let n ¢ Zj. Suppose the function f is n-times differentiable at 0. Then 
the nth Maclaurin coefficient of f is denoted m, and is the real number defined by 


f°) (0) 


Te 


If f is also smooth (infinitely differentiable) at 0, then the Maclaurin series (expansion) 
of f is the power series denoted M and defined by 





2) n! 


For ease of reference, let’s write down the Four-Step Maclaurin Recipe that we used in 
the above examples: 





<= 02 f(n) (( 2 £20) 5, £O) 
My = er re ee = 7(O)+y" One . on eee) IO + 


Four-Step Maclaurin Recipe (for Finding Maclaurin Series) 


Let f be a function that is smooth at 0. 


1. Find each of f’s derivatives. 
2. Evaluate each of f, f’, f”, f’”, etc. at 0. 
3. For each n =0,1,2,..., the nth Maclaurin coefficient of f is 


_fO) _ 


SF nl 


4. The Maclaurin series of f is 


00 oo f£(n) (3) (n) 
M (a) = Soma = San =f FOr (0)« a oe et iE (0) e+ ’ pe sae 


3! n\ 





To repeat, so far, all we’ve learnt to do is this: 
Take a function (that is smooth at 0) and compute its Maclaurin series. 


We have not yet shown or even asserted that there is any relationship whatsoever between 
f and its Maclaurin series WM. This will be done only in the next subchapter. 


More examples: 
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Example 1275. Consider the exponential function exp. 


. We know that each derivative of exp is simply exp itself. That is, for every n € Zj, we 
have exp” x = exp. 
. For every n€ Zé, we have exp) 0 = exp0 = 1. 
. So, for every n € Zj, the nth Maclaurin coefficient of exp is 
exp(™ (0) 1 
eee Oe 


n! nl 


. Thus, WM, the Maclaurin series of exp, is defined by 


Sep), 1 1 1 af eae 
ee 


Mi(z)= ma =) 


—+ + sss ]l+e+—4+—1+... 
m0 n! oO! 1! | 2S: 


Exercise 390. Find the Maclaurin series for each function. (Answer on p. 1919.) 


(a) f:(-1,1) > R defined by f (x) = (1+2)", where n is any real number. 
(b) cos 
(c) g:(-1,1] >R defined by g(x) =In(1+72). 


Remark 148. Happily, your H2 Maths syllabus (p. 9) explicitly states, “Exclude derivation 
of the general term of the series”. I take this to mean that they’ll never ask you to derive 
the general nth term of a Maclaurin series. 


Instead, what your A-Level exams usually do is to just ask for the “first few terms”. And 
so, that’s what we’ll also usually do in this textbook. 


(So far, they’ve kept their promise of not asking for the general nth term of a Maclaurin 
series. But of course, it is always possible that they’ll break this promise and use this as 
their “creative” curveball question. ) 


Remark 149. Your H2 Maths syllabus and exams make no mention of the Taylor series. 
But just so you know, the Taylor series is simply a more general version of the Maclaurin 
series—namely, the Maclaurin series for a function f is simply the Taylor series for f 
centred at 0. 
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102.1. (Some) Functions Can Be Rep’d by Their Maclaurin Series 


Given a function that’s smooth at zero, we learnt to compute its Maclaurin series. We did 
not claim that there was any relationship whatsoever between a function and its Maclaurin 
series. We now learn that there is such a relationship (and this is why the Maclaurin series 
is important): 





Proof. Let f : D> Rbe our function and co, C1, C2, ... be constants such that 
f(v) ee +e t cor? +c30° + caxtt..., for every x ¢ D. 
By the Term-by-Term Differentiation Theorem, f is smooth and 


f' (a) =e + 2cge + 3c327 + 4egz?..., 

f" (2) = (2-1) e+ (3-2) cgx + (4-3) caz?..., 

fl! (x) = (8-2-1) eg 4 (4-3-2) cae + (5-4-3) esr? +..., 
f (2) = (4-3-2-1) 04+ (5-4-3-2) e530 + (6-5-4-+3) cpa? +..., 











f™ (2) =nlen + eo 





+1)! n+2)! n+3)! 
I ) Cy410 + ( y ) Cra gd? Pa 31 ) Cia Aces 
Now evaluate each of f, f’, f”,..., and f™ at the point 0: 
f (0) = f (0) =a, f (0) = Alea, 
‘a (0) = Cl, fo (0) = Sles, 
‘ia (0) = 2C9, : 
FR) (0) =3!cs, fo) (O) es 
(nr) (0 
Rearranging, we find that c, = I ) for each n€ Zo. 
n! 


Hence, in é each c, is simply f’s nth Maclaurin coefficient. Thus, f can be represented by 
its Maclaurin series. 
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i) 
Example 1279. Consider the function f : (-1,1) > R defined by f (x) = ae 
-x 


In H2 Maths and this textbook, we will not learn when or why a function can be rep- 
resented by a power series. Instead, we’ll simply and blithely assume that this is true 
whenever the need arises. 


So, here for example, let’s simply assume f can be represented by a power series (without 
knowing why). Then by Theorem 38, f can also be represented by its Maclaurin series. 


And as we already showed in the previous subchapter, this Maclaurin series is 


1 ae ee 


Conclude: By Theorem 38, f can be represented by the above Maclaurin series. 


That is, ae a Lee for every x € (-1,1). 


1 
Example 1280. Consider the function g: (-1,1) > R defined by g(x) = Goa) 

-2 
Again, let’s simply assume that g can be represented by a power series. In which case, 
by Theorem 38, g can also be represented by its Maclaurin series. 


So, let’s find g’s Maclaurin series (as usual, we use the Four-Step Maclaurin Recipe): 


1. The first three derivatives of g are 


5:4°-3 
g" (2) = 


g' (a) = G—2)* 


cole! 
(1-2)*’ 


2. Evaluate each of g, 9’, g”, g’” at 0: 
g(O)=1, = g(0)=3, sg" (0) =12, —g" (0) = 60. 
3. The first four Maclaurin coefficients of g are 


_ g(O) _ _ . 7 Z Oe 
eee a m aoe 


4. The Maclaurin series of g is 


12 34 6" £10n° 4 ce. 


Conclude: By Theorem 38, g can be represented by the above Maclaurin series. 


That is, Gia) = =1+30+627+100r7+..., for all ze R. 


sar. 
=a 
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Example 1281. Let n¢R and define the function h: (-1,1) > R by h(x) =(1+2)". 


Again, let’s simply assume that h can be represented by a power series. In which case, 
by Theorem 38, h can also be represented by its Maclaurin series. 


In Exercise 390(a), we already showed that this Maclaurin series is 


n(n=1) 9, n(m-1W(n-2) 3. 


l+nz + 7 3] 


Conclude: By Theorem 38, h can be represented by the above Maclaurin series. 


2 1-2 
That is, A(x) =(1+2)"=1l+nr+ a + mina Dn ?),9 for. all x € (1,1). 


Example 1282. Consider the exponential function exp. 


Again, let’s simply assume that exp can be represented by a power series. In which case, 
by Theorem 38, exp can also be represented by its Maclaurin series. 


In Example 1275, we already showed that this Maclaurin series is 


i x 
+£2+—+—+4.... 
=) 5p 7 3 


Conclude: By Theorem 38, exp can be represented by the above Maclaurin series. 


fe A 
That is, a for all xe R. 


If we have in particular x = 1, then 


ee 
ee 


Also, by Fact 63, e! = exp1. And so, we also have 


tee See ee | 
expl=e ol 


We’ve just proven Theorem 5 (given long ago in Part I of this textbook). 
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Example 1283. Consider the sine function sin. 


Again, let’s simply assume that sin can be represented by a power series. In which case, 
by Theorem 38, sin can also be represented by its Maclaurin series. 


In Example 1274, we already showed that this Maclaurin series is 


That is, for allaeR. 


Example 1284. Consider the cosine function cos. 


Again, let’s simply assume that cos can be represented by a power series. In which case, 
by Theorem 38, cos can also be represented by its Maclaurin series. 


In Exercise 390(b), we already showed that this Maclaurin series is 


Pere a Be 


Conclude: By Theorem 38, cos can be represented by the above Maclaurin series. 


That is, for allaeR. 


Example 1285. Consider the function 7: (-1,1] > R be the function defined by 7 (x) = 
In(1 +2). 

Again, let’s simply assume that 7 can be represented by a power series. In which case, by 
Theorem 38, 7 can also be represented by its Maclaurin series. 


In Exercise 390(c), we already showed that this Maclaurin series is 


That is, i(e)=In(L+2)=2->+ 








464There is actually a flaw in the argument here. Theorem 38 actually specifies that the function’s domain 
must be an open interval of the form (—R,R). In contrast, here our function’s domain is the right- 
closed interval (-1,1]. So, here Theorem 38 actually only enables us to conclude that i (a) = In(1+ 2) = 
oe 
+ +... for all x € (-1,1). 
2 3 4 
de a 
Theorem 38 does not tell us whether we also have i(1) = In(1+1) =1- Ue ae ase That is, 


whether 2 can also be represented by its Maclaurin series at its domain’s endpoint. 
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A391(a) XXX 





It turns out that this is a somewhat delicate technical issue. But for simplicity and for H2 Maths, we'll 
simply hand-wave and say that Theorem 38 also works at the endpoints. That is, we’ll always just 
blithely do the same as in the above example. 
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The Five Standard Series and Their Intervals of 
Convergence 


102.2. 


The following six equations are in List MF26 (p. 2), so no need to mug. 


The first of these six equations is simply the Maclaurin series of a general function f. 


The general Maclaurin series > 


+x)" =l+nx+t+ 


2 n 
f(x) =1(0) + xf'(O) + F£'O)+...+ FO +... 
! nN: 





n(n-1) pet n(n—-1)...n—r+l]) i 


(|x|<1) 


r! 














2: 3 r 
ee ee (all x) 
2! 3! r! 
The five “standard” ee a 7 a Cie (all x) 
Maclaurin series 31S! (2r +1)! 
2 4 ay 2r 
cosx=1 zt ea (all x) 
2! (2r)! 
2 3 = r+] or 
Inl+x)=x . a ee (-1<x<1) 
r 


The next five equations are the Maclaurin series for five specific functions. We already 
derived all five of these Maclaurin series on the previous pages. 


Your H2 Maths syllabus (p. 9) and exams*® call these five Maclaurin series the “standard” 
+ 466 
series. 


Observe that for each of these five “standard” series, List MF 26 also lists on the right (in 
parentheses) the corresponding interval of convergence. 


For each of the second, third, and fourth “standard” series (exp, sin, and cos), on the right, 
we have “(all x)”. This means that each of these three power series has R as its interval of 
convergence. That is, each of these three statements is true for all x € R: 


ae 
expx=1+z2+—+... 


2! 
aS a? 
TW ler ers 
sing =2- a +e 
a se 
Cs = 11> er 
2! A! 


In contrast, for the first “standard” series, on the right, we have “(|z| < 1)”. This means 
that this power series has (—1,1) as its interval of convergence. That is, 


n(n=1) 9, n(n-1(n-2) 3 


mI 3 + for,every x € (-1,1). 


(l+2)"2l+nc+ 





46°See e.g. Exercise 677 (N2017/I/1). 
46Rew other writers do the same. There’s nothing especially “standard” about these five series (other 
than that they frequently appear in your syllabus and exams). 
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But for any x ¢ (-1,1), = is false.467 


-] 
To repeat, in H2 Maths and this textbook, we shall not explain why 1+na + mn" D2 + 


. converges only on (-1,1). Instead, this is simply something that you the obedient 
Singaporean monkey are supposed to “know”. 





Similarly, for the fifth “standard” series, on the right, we have “(-1 <a <1)”. That is, this 
power series has interval of convergence (-—1,1]. That is, 


D of yt oh 4 
n(ite)=e- = Pe ts for every x € (-1,1]. 


However, for any x ¢ (-1, 1],2 is false.4 


rg at 
To repeat, in H2 Maths and this textbook, we shall not explain why x - os + a - 5 
6 
= +... converges only on (-1,1]. Instead, this is simply something that you the obedient 


Singaporean monkey are supposed to “know”. 





4°7Suppose for example that x = 2 and n= 1.5. Then the LHS of + ig 
(1+2)'° = 315 & 5.20. 
But the RHS of = is the following, which “clearly” diverges: 


.0(1.5-1 1.5(1.5-1)(1.5-2 
Sg 8S 





1+1.5-2+ 


So, + is false in this case (x =2, n=1.5). More generally, + is false for any x ¢ (-1,1). 
468Tt does remain though on List MF 26. 
469Suppose for example that x = 2. Then the LHS of ? is 


In (1+ 2) =In3 * 1.10. 


But the RHS of 2 is the following, which “clearly” diverges: 
2 2. 2 2 


i 
a° 9 a's 





So, 2 is false in this case (x = 2). More generally, 2 is false for any x ¢ (-1,1). 
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102.3. Sine and Cosine, Formally Defined 


In Ch. ??, we learnt about the right-triangle and unit-circle definitions of the sine and 
cosine functions. However, these definitions are considered informal. It turns out that to 
formally define the sine and cosine functions, there are several approaches. We'll use the 
most common approach, which is to use their power series:4” 


Definition 219. The sine function sin: R > R is defined by 


+1 


= air 


2 





With “some” work,*”! we can show that Definitions 219 and 220 correspond to our earlier 
unit-circle definitions. We can also show that under these definitions, all previous results 
involving sine and cosine still hold. 


Given Definitions 219 and 220, and the Term-by-Term Differentiation Theorem, we can 
easily find the derivatives of sine and cosine: 


Fact 211. sin’ = cos. 


3 ab 
Proof. By Definition 219, sinx = x - I + Tes —... for every x€R. 


By the Term-by-Term Differentiation Theorem, sin is differentiable and sin’ is defined by 


2 4 


sn v= 3, 7 ap =o A 











For every x € R, sin’ x = cosx (Definition 220). Hence, sin’ = cos. 





Fact 212. cos’ = -sin. 


x? tt 


— + a, le 
2! A! 
47Three other common approaches are to use the exponential function, and differential equations, or first 


define the arcsine function using an integral. 
“71See Ch. 146.19 (Appendices). 


Proof. By Definition 220, cosx = 1 - . for every x ER. 
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By the Term-by-Term Differentiation Theorem, cos is differentiable and cos’ is defined by 


1 2a 4ar* 7 e 
COS v= Or AL = = 31 4a 











For every x € R, cos’x = —-sin x (Definition 219). Hence, cos’ = - sin. 





Facts 211 and 212 have just shown that sin and cos are differentiable. By Theorem 29 
(Differentiability Implies Continuity) then, they are also continuous. We have thus proven 
the following result that was stated earlier in Ch. 87.4 and now reproduced: 


Fact 201. The sine and cosine functions (sin and cos) are continuous. 


We can now also provide a general proof for the First Pythagorean Identity (reproduced 
from p. s42): 


Fact 71. (First or The Pythagorean Identity) Suppose 0¢R. Then 


sin? 6 + cos? 6 = 1. 





Proof. Define f : RR by f (A) =sin? A+ cos? A. 


Then f is differentiable and its derivative f’: R > R is defined by f’(A) = 2sin Acos A - 
2cos Asin A = 0. 


By Proposition 8, f is a constant function. That is, there exists some c € R such that for 
all Ae R, 


f (A) =sin? A+ cos? A=c. 


Plug in A =0 to find that c= f (0) = sin?0 + cos?0 =0+1=1. Hence, for all Ac R, 





sin? A +cos? A= 1. 











Exercise 393. XXX (Answer on p. 1013.) 
A393. 
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102.4. Maclaurin Polynomials as Approximations 


Given a function f, its nth Maclaurin polynomial is simply its Maclaurin series up to 
and including the x” term. A bit more precisely, 


Definition 221. Let n«¢Z). Suppose the function f is n-times differentiable at 0. Then 
the nth Maclaurin polynomial of f is defined to be the following polynomial: 


PO) 2, FO 84 LO yn 





Mola y= Ym = f(0)+f' (0)a+ 


Not surprisingly, if a function can be represented by its Maclaurin series, then it can also 
be approximated by its Maclaurin polynomials. This is yet another useful application of 
the Maclaurin series. 

Example 1286. Consider the function f : (-—1,1] > R defined by f (x) = In(1+2). 


Previously, we already showed that the Maclaurin series of f is 





Fe 
M =2"- 
(io) Sa ao 
So, by Definition 221, the Oth, Ist, 2nd, 3rd, 4th, and 5th Maclaurin polynomials 
of f are 


eee 

Mo (x) = 9, M3 (2) =2-> +5, 
ee ha 

Mita) =a, Ma(x)=2-— +2 - 
2 3 

Ma (x) = 2-5, == 


Figure to be 
inserted here. 


In Example 1285, we argued that g can be represented by its Maclaurin series. So, not 
surprisingly, the first six Maclaurin polynomials of g serve as (decent) approximations of 


g. 
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Example 1287. Consider the sine function sin. 


5 
ames 
Previously, we already showed that the Maclaurin series of sin is M (x) = x- 31 tp a 


So, the Oth, Ist, 2nd, 3rd, 4th, and 5th Maclaurin polynomials of sin are 


Mo (ee rs 0, 
VEE) =o. 


Mae) =, 


Figure to be 
inserted here. 


In Example 1283, we showed that sin can be represented by its Maclaurin series. So, not 
surprisingly, the first six Maclaurin polynomials of sin serve as (decent) approximations 
of sin. 

ao ae 
By the way, observe that if x » 0, then we also have 31 x 0 and 5 «(Q. Indeed, each term 
in the Maclaurin series grows smaller. , 


And so, if x is small, then even low-degree Maclaurin polynomials should serve as “good” 
approximations of sin. 


Indeed, if x is very small (i.e. very close to zero), then we may simply assert that sin x Na. 


We call # the small-angle approximation for sine. 





In Exercises 394(c) and 402, we’ll similarly show that the small-angle approximations 
for cosine and tangent are 


2 


£ 
Copan Tees and tan rx 2. 


One might think that that a higher-degree Maclaurin polynomial is always a better approx- 
imation (than a lower-degree one). But unfortunately, this is not generally true, especially 
if x is far from zero: 
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Example 1288. Let x = 10, so that sinz = sin 10 x —0.544. 


Evaluating the 5th Maclaurin polynomial of sin at 10, we have 


103 9) 
Ms (10) = 10-=- +4 


Clearly, Ms (10) » 677 is a terrible approximation for sin 10 * —0.544. 


One might expect that the 9th Maclaurin polynomial does better. But this is not the 
case 


3 5 7 9 
Be ee ee aa 
BL col tote 8! 


Mg (10) = 10- 


While M; (10) was a terrible approximation for sin 10, it turns out that Mo (10) is even 
worse! 


Figure to be 
inserted here. 


It’s true that the Maclaurin series (which is the “ooth Maclaurin polynomial”) will even- 
tually get it exactly right: 


Gee Abe 
M (10) = 10 - — + — -.---=sin 10 = -0.544. 
3! 5! 
However, as the present example shows, as approximations, the Maclaurin polynomials 
can get worse before they get better. 


Exercise 394. For each function, write down its first four Maclaurin polynomials. Then 
sketch on a single figure the graphs of the function and the first four Maclaurin polyno- 
mials. (Answer on p. 
1921.) 


(a) exp 
(b) f:(-1,1) +R defined by f (x) = (1+ 2)”, where n is any real number. 
(c) cos 


The small-angle approximation for cosine is given by the 2nd Maclaurin 
polynomial—write it down. 
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102.5. Creating New Series Using Substitution 


Example 1289. From the second “standard” Maclaurin series, we know that 


9 
oo og a for all ze R. 


We can take the above statement and substitute “x” with any other expression, such as 


A ss and a6 lee “simi or ela 


2 3 
me (2r)" | (2x) 
ew =14+20+ mI + 31 +..., 


(2241)? (x24+1)° 
= too wee 


oe) Bee] 
; sin*x sin’x ; 
=l1+sinz+ T a 31 +... for all sinzéeR orzxveR. 


for all 2re€R orzeR. 
for all z?7+1¢€RorzveR. 


=1+(a2?+1)+ 


sin x 





3 Bie S45)" 3 
eS a a forall eR orgveR. 





In the above example, it so happens that the exponential function can be represented by 
its power series everywhere. That is, the interval of convergence is R. 


And so, we could substitute “x” with any other expression and obtain a new statement 
that continued to be true for all x ¢ R. That is, the new interval of convergence was still R. 


This is not the case in the next example: 
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Example 1290. From the fifth “standard” Maclaurin series, we know that 


3 


2 
no for all x € (-1,1]. 


Note that the interval of convergence is (-1, 1]. 


As in the previous example, we can substitute “x” with “2x”, but now we must be careful 
about finding the new interval of convergence: 


3 


Z ieee 
In(1+22)22-~ 4+ +4..., for all 2x € (-1,1] or xe (-5.5| 
er) 22 
11 1 
Note that the new interval of convergence is (-5. AR Although = holds for all x € (-1, 1], 
ate 
2 holds only for all x € (-5, >| 
22 
Similarly, we can substitute “x” with “a?4+1”, “sinx”, or ies ae But in each case, we 
must be careful about finding the new interval of convergence, which can get tricky: 


_@ + Wry 
2 3 


sin?x sin? x 


+ +..., 
2 3 


In(1+a?+1) 22741 forall le( 1c ae (2 


In (1+ sin x) < sinx - for all sin x € (-1, 1]. 


5 


in(1+ =) 2 De ey 


i 3 
re for all a (-1,1] or ve (-%, 


Here’s what the interval of convergence in each of these three statements says: 
« 2 holds for x =0 (and no other values)! 


« + holds for all real numbers x, except those for which sinx = —1.*7 
« 2 holds for all xe (-¥5, v5]. 








STU 
ong ==] => g= Es + 2k7 for some integer k. 
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Example 1291. Consider the function f : (0,2) > R defined by 
1 
f@)=— 
z 
It’s not obvious whether f can be represented by any power series. 


But take the first “standard” Maclaurin series: 


=l-gta*-az'+..., for all x € (-1,1). 
La 


Substitute “x” with “x -1”: 


1 1 
Tee ee ee for all x —1¢ (-1,1) or x (0,2). 





And so, for all x € (0,2) = Domain f, we have 


f(a)===1-(e-1)+(@-1)"-(@- 1+... 


Thus, we’ve just shown that f can be represented by the power series 1-(x — 1)+(a - ae 
Gai 





Example 1292. XXX 


Example 1293. XXX 


Example 1294. XXX 
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Exercise 395. For each given Maclaurin series, create four new series by substituting 
“x” with (i) “2x”; (ii) “x? +1”; (iii) “sinz”; and (iv) “x?/5”. In each case, take care to 
specify the new interval of convergence. (Answer on p. 1923.) 


5 45 
(a) sing =2-—+—-... forallxeR. 








b) (l+z Peden aD ie Nee. for all x € (-1,1). 
2) 3! 


joe. an 
(Cc) cosa = + pe ter ell acl, 


Exercise 396. For each function, find a power series representation. (Answer on p. 
1923.) 


(a) The function f : (0,2] + R defined by f (x) =Inz. 
(b) x 
(c) x 


Exercise 397. Let g be a nice function defined by g (x) = Suppose the domain 


wien 
of g contains zero and is the largest possible such that g can be represented by a power 


series. Find this power series and the domain of g. (Answer on p. 1924.) 
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102.6. Creating New Series Using Multiplication 


Given two (finite) polynomial functions, we know how to find their product function. Two 
quick examples: 


Example 1295. Let f,g:.R > R be the functions defined by 
J(@)=245 and g(x) = 2a? - 3. 


Then the function f-g:R-— R is defined by 


Ga) @)=7 @)e@) =(@ 45) (r= 3) or = Me" 37 15. 


Example 1296. Let h:[1,5] > R and 7: [3,7] > R be the functions defined by 
h(x) =22?-x2+1 and i(x) = x? + Qe. 
Then the product function h-i:[3,5]— R is defined by 


(ie)\@)=h@i@= (22 241) le tor\a le ade a 





By the way, note the domain of the product function h-i.47 


Very happily, given two functions that can be represented by power series, we can do 
something similar: 4” 


Theorem 39 (informal). Suppose the functions f and g can be represented by the power 


series A and B. Then the function f-g can also be represented by the power series that 
is the product of A and B® 








73 As explained in Ch. 20, Domain h-i = Domain hn Domain? = [1,5] 9 [3,7] = [3,5]. 

“4This informally stated result omits an important technical condition. For a formal (and correct) state- 
ment of this result, see Theorem 57 (Appendices). 

“Formally, we call this power series the Cauchy product of A and B. 
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Example 1297. Let f:(-1,1) > R be the function defined by 
i! 
f (2) = at 


Let g = exp-f. That is, let g: (-1,1) > R be the function defined by 


exp & 


g(x) = (exp-f) (2) = 


es 
From the first two “standard” Maclaurin series, we know that for every x € (-1,1), 


a ae i et 
expxr=lt+vt+—+—+... and ie) = =l+xr+a°+@°+... 
PA es 


So, by Theorem 39, g can be represented by a power series. Moreover, this power series 
is simply the product of the above two power series. To find it, let’s write 


1 il 
(141045074 Ha84...|(Le les la? 1 +...)- ag toe + cyt? + cg0° +... 


Comparing coefficients, we have 


oe © ea» 
@=ixls+ixt=2, 


i] 
i 


5 
a? 
1x 


i 8 
cjg=-x1l+-xl1lt+1xi1l+ =. 
6 2 3 


Thus, we conclude that the function g = exp-f can be represented by this power series: 
5 8 
= ee % 
2 3 


eX 


5 8 
That is, Oh Ga — =1+2x+ st a gt +for.all x € (-1,1) = Domain g. 
-2x 


By the way, we could’ve arrived at this same conclusion using the Four-Step Maclaurin 
Recipe and ‘Theorem 38, as was done in earlier subchapters, and which we’ll do on the 
next page: 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


By Theorem 39, g can be represented by a power series. And so, by Theorem 38, g can 
be represented by its Maclaurin series, which we now find: 


1. The first three derivatives of g are defined by 
expe | expe 


Ft eae=a(a)(1+ 5) 


s(x) “pale aa) 


DG ee ye ake g(x) 
a (1 =) (=e =) cea 


g (x)= 








2. Evaluate each of g, 9’, g”, g’” at 0: 


9(0) -P ena 


g' (0) =9(0) (1+) = 1-41) =2, 





g” (0) = of (O)(1+ hp) + Peta 22 24728 





5 po 2 + 2s = 16, 
(1-0) (1-0) 


3. The first four Maclaurin coefficients of g are 


g(0) 1 
oe 
SS ae ea 


/ 
Ss a) 
oe 


1) 


5 8 
4. The Maclaurin series of g is 1 + 2” + st ee 


So, g can be represented by the above Maclaurin series. 
; exp x a ee ; 
That is, Ge) = i =1+2xr+ sae +for.all x € (-1,1) = Domain g. 
—2£ 


Happily, this is the same conclusion as before (we’d be worried otherwise). 
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5 


Example 1298. Define the functions f :(-1,1) > R and g: (-S. 


oe a 
h(x) = (f-9) (@) = f (#) 9 () = 


Let h= f-g. That is, define hs (- v2 E)- > R by 





(2 ear 


In Exercises 392 and 397, we already showed that the functions f and g can be represented 
by these power series (respectively): 


1 a ee and 127? 4a S67 
And so, by Theorem 39, the function h can also be represented by a power series. More- 


over, this power series is simply the product of the above two power series. To find it, 
let’s write 


(tf2c por fae) (lage a4 = 62°42, } =Co t+ Ce + Cot? +0372 +... 
Comparing coefficients, we have 


C= bea, 
ee ee) 
Cg =1x(-2)+3x1=1, 
c3 =2x (-2)+4x1=0. 


Thus, we conclude that h can be represented by 
2 ee x 


i 


1 
That i ee 
at is, h(a) (Gaal +e 


= 1427 + forall? € e(=¥ v2 22) = Domain, 


By the way, just like in the last example, we could have arrived at this same conclusion 
by using the Four-Step Maclaurin Recipe and Theorem 38. You are asked to do so in 
Exercise 400. 
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Example 1299. Consider the function f : IR — R defined by 
f(z) = sina cosa, 


To show that f can be represented by a power series, observe that f = sin-cos. We 
already knew that sin and cos can be represented by power series: 


3 5 2 4 
; ) ae t= 
sinz=x-—+—-... and Coc = ee rior lle ek. 
Be! 2) Al 


And so, by Theorem 39, f can also be represented by a power series. Moreover, this 
power series is simply the product of the above two power series. To find it, write 


1. )=c0 + ea + on? + oy0 +... 


Comparing coefficients, we have 


ej= 0x l= 0, 
c, =0x04+1x1=1, 


e=0 (2) +1x0+0%1=0, 


il 
c= 0x 041% 5) 


Thus, we conclude that f can be represented by 


D 
Adee 78 t 


2 
0 Ox? - — 
oe gt 3 


2 
That is, f (2) =sinwcosr=x- 3a" +..., for all  € R = Domain f. 


By the way, just like in the previous examples, we could have arrived at this same con- 
clusion by using the Four-Step Maclaurin Recipe and Theorem 38. You are asked to do 
so in Exercise XXX. 
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Exercise 398. Define f :R— R by f (x) =sinzexpz. 


(a) Explain why f can be represented by a power series. 


(b) Use Theorem 57 to find a power series representation of f, up to and including the 
3 
> Germ, 


(c) Do the same as in (b), but now use the Four-Step Maclaurin Recipe and Theorem 
38. 


(d) Do your answers in (b) and (c) agree? (Answer on p. 1924.) 
Exercise 399. Define g: (-1,1) > R by g(x) =coszln(1+72). 


(a) Explain why g can be represented by a power series. 


(b) Use Theorem 57 to find a power series representation of g, up to and including the 
3 
7. Geni. 


(c) Do the same as in (b), but now use the Four-Step Maclaurin Recipe and Theorem 


(d) Do your answers in (b) and (c) agree? (Answer on p. 1924.) 
/2 


Exercise 400. As in Example 1298, define h: (- 


1 it 
(3 





h(a) = 


Use the Four-Step Maclaurin Recipe and Theorem 38 to find a power series representation 
of h, up to and including the x® term. How does your finding compare to the conclusion 
in Example 1298? (Answer on p. 1925.) 
Exercise 401. As in Example 1299, define f : RR — R by 


i (2) = sina cosa. 


Use the Four-Step Maclaurin Recipe and Theorem 38 to find a power series representation 
of f, up to and including the x® term. How does your finding compare to the conclusion 
in Example 1299? (Answer on p. 1926.) 





1026, Contents www.EconsPhDTutor.com 


102.7. Repeated Differentiation to Find a Maclaurin Series 


Your H2 Maths syllabus explicitly includes “derivation of the first few terms of the Maclau- 
rin series by repeated differentiation”. This isn’t really anything new or separate. Example: 


Example 1303. Consider the secant function sec. We can find its Maclaurin series using, 
as usual, the Four-Step Maclaurin Recipe: 


1. The first four derivatives of sec are defined thus: ® 


sec’ x =secxr tang, 


sec’ x = sec x tan? x + sec® x, 


sec!” x =secx tan? x + 2tanxsec’ x + 3sec® x tan x = secrtan® x + 5sec? x tanz, 


sec x =secx tant x + 3sec? xtan? x + 15sec? x tan? 7 + 5sec? x 
=secxtan* x + 18sec? x tan? x + 5sec? x. 


2. Observe that secO = 1 and tan0 = 0. So, 


secQ = 1, sec” 0 
sec’ 0 = 0, sec“ 0 
sec’ 0) = I, 


3. The first five Maclaurin coefficient of sec are 1, 0, 1/2, 0, and 5/4! = 5/24. 


4. The Maclaurin series of sec is*”® 


Exercise 402. Find the Maclaurin series of the tangent function tan, up to and including 
the «° term. The small-angle approximation for tan is given by the 1st Maclaurin 
polynomial—write it down. (Answer on p. 1926.) 








“"6Tt turns out that the interval of convergence here is (-5 ‘ . ) That is, 
1 5 ™ 7 
seor =1+ 5a° + 57a +... for all x e(-2,7), 


(Don’t worry, this isn’t something you need to know for H2 Maths.) 
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102.8. Repeated Implicit Differentiation to Find a Maclaurin Series 


Your H2 Maths syllabus includes “derivation of the first few terms of the Maclaurin series 
by repeated implicit differentiation”. We’ll try the example given in your syllabus: 


Example 1304. Consider the curve described by y? + y? + y 4? 29. 


The above equation implicitly defines y as a function of x. That is, even though we may 
not know how to, it is possible to express y as a function of x.477 


Let’s try to find the first few terms of the Maclaurin series of this function y. 


Plig = 0 inte 2; 
[y (0) +[y(O)P +y(0)=0?-2-0=0 or —_[y(0)] {[y (0)]° + (0) + 1} 0. 


Observe that 22+ z+1=0 has no (real) solutions. So, from 2, we have y (0) 2 0. 


d 
Apply the Ag operator to =: 
ae 


By? -y' + 2y-y' + y! = Qn —2 y' (By? + 2y +1) = 2x - 2. 
Plug x = 0 and y(0) 20 into 4: 
y’ (0) (3-07 +2-0+1)=2-0-2=-2 y! (0) 2-2. 
d 4 
Apply the — operator to =: 
ade 
y" (3y? + 2y +1) +y' (6y-y! + 2y’) =2 y"’ (By? + 2y + 1) + (y')? (By + 2) 22. 
Plug x = 0, y (0) 20, and y’ (0) 2 -2 into 2: 
" 2 2 = " ua 
y" (0) (3-0? +2-0+1) + (-2) (6-042) =2 ay (0) = 6. 
Altogether, we have y (0) 20, y’(0) 2 -2, and y” (0) 2-6. 


Hence, the first three Maclaurin coefficients of y are 0, -2, and —3. And the Maclaurin 
series of y is -2x -3a7+....4% 








“’This is the Implicit Function Theorem (see n. 414 on p. 897). 

“8This is a pointless and tedious exercise (and hence the sort of thing beloved by the Singapore education 
system). 
There is little point in simply computing the Maclaurin series for its own sake. What we might be 
interested in is the set of values for which y = —2x% — 327+... holds. Unfortunately, finding this set is 
well beyond the scope of H2 Maths (in fact, ’'m not even sure it can be done analytically). 
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Example 1305. Suppose the function f satisfies this equation: 
a[f(z)/ +eze™, 


Using repeated implicit differentiation, we can find the first few terms of the Maclaurin 
series of f. 


d 
Apply the ce operator once: 
be 


LF (a) + 2ef (2) fe) Hef f'(). 
Apply the = operator again: 
Qf (x) f(a) +2F (w) fi (a) + 20 {Lf (w) P+ f(x) £" (a) } 2 eM [F(a P + ef f" (2). 
Plug x = 0 into =: 
O[f (0) +e=0+e=e= ef f (0) 41. 
Next, plug x = 0 and f (0) 41 into 2: 
1+0=1=e'f"(0) f' (0) 2 =. 


Finally, plug x = 0, f (0) <1, and f’ (0) 2 1/e into 2: 


1 1 4 ie 1 3 
2-1--+2-1--+0=-= (=| a (== e) (0 "(O)=—5. 
a eae Nhe A) ce (U)D tears Os 


1 3 
Altogether, we have f (0) =1, f’(0) 2 -, and f”(0) = ai 
e C 


Hence, the first three Maclaurin coefficients are 1, 1/e, and 3/ (2e”). And the Maclaurin 


series of f is*” 








479This is again a pointless exercise. 
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Exercise 403. XXX 


A403. 
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(Answer on p. 1030.) 
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103. Antidifferentiation 


Antidifferentiation is simply the inverse of differentiation. 


Example 1306. Define the functions f,g:R— R by 


a and Or) =2e, 


The derivative of f is g. 
So, f is an antiderivative of g. 


And g is an antidifferentiable function (because it has an antiderivative). 


Definition 222. Let f and g be functions. If the derivative of f is g, then f is an 
antiderivative of g.*°° 


A function that has an antiderivative is called an antidifferentiable function. 





Example 1307. XXX 


Example 1308. XXX 





48°Qne perhaps subtle implication here is that f and g have the same domain. 
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103.1. Antiderivatives Are Not Unique ... 


Example 1309. Consider the functions g,h,i:IR > R defined by 


Nn an a= Sant (7 aa =O: 


Consider also the function f : R > R defined by f (x) = 2z. 
Observe that f is the derivative of g, h, and 2. 
Equivalently, the function f has antiderivatives g, h, and 7. 


This shows that antiderivatives are not unique. 





Suppose g is an antiderivative of f. If h differs from g by a constant, then h is also an 
antiderivative of f. More precisely, 


Fact 213. Let f be a nice function. Suppose g is an antiderivative of f. Ifh=g+C for 





some CER, then h is also an antiderivative of f. 


Proof. Let D be the domain of f (and hence also of g and h). 


forall ED. hi (x) = (0) + C= 9! (x) +029 (0) =F (2). 














So, the derivative of h is f. Equivalently, h is an antiderivative of f. 


103.2. ... But An Antiderivative Is Unique Up to a COI 


It turns out that the converse*®! of Fact 213 is also true. Though antiderivatives are not 
unique, they are unique up to a constant. 


That is, suppose g is an antiderivative of f. If h is also an antiderivative of f, then h differs 
from g by a constant. A little more precisely, 


Fact 214. Let f be a nice function whose domain is an interval. Suppose g is an an- 





tiderivative of f. If h is also an antiderivative of f, then h=g+C' for some CeR. 


Proof. Let i=g—h. Since g' =h' =f, i’ =g'-h'=0. 


Since the derivative of 2 is a zero function, by Proposition 8, 2 must be a constant function. 
That is, there exists C’ ¢ R such that 7 is defined by i (2) = C—or equivalently, h = g+C. 














Facts 213 and 214, combined: 





481 Actually, a partial converse, because we have now quietly added the requirement that f’s domain is an 
interval. 
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Corollary 47. Let f be a nice function whose domain is an interval. Suppose g is an 
antiderivative of f. Then 


h is an antiderivative of f 


h=g+C for someC eR. 


Example 1310. Define f,g:.R—R by f (x) = 22 and g(x) = 2”. 
Observe that g is an antiderivative of f. 


Suppose another function h is also an antiderivative of f. Then by Corollary 47, there 
exists C€ R such that 


Moa ga@jaO =a HC for all xe R. 
Example 1311. Let f,g,h:R-— R be functions. 
Suppose g is an antiderivative of f and is defined by 


g(x) =sin Cael 


We haven’t been told anything about f. But if told that h is also an antiderivative of f, 
then we know there must exist Ce R such that h may be defined by 


BG) = 6 (yO =sin (ce?) +C, 


Exercise 404. Define f:R— R by f(x) =2-3. (Answer on p. 1928.) 


(a) Find three antiderivatives of f. 


(b) Suppose you’re told that some function A: R > R is also an antiderivative of f. 
Then how must the function A be related to each of the three functions you found 
in (a)? 

Exercise 405. Define f,g,h: R > R by f(x) = 4sin4z, g(x) = -cos4z, and h(x) = 


8sin? x cos’ x. 


(a) Show that g and h are both antiderivatives of f. 


(b) The functions g and h are both antiderivatives of f, but look very different. Does 
this observation contradict Fact 214? (Answer on p. 1928.) 
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103.3. Shorthand: The Antidifferentiation Symbol [ 


Example 1312. Consider the function f : R > R defined by f (x) = 22. 
Here are three of f’s antiderivatives: 

¢ g:R-R defined by g(z) = 2”. 

¢ h:R-R defined by h(x) = 27+ 2. 

e i:R->R defined by i(z) = x? - 1. 


More generally, the antiderivatives of f are exactly those functions 7 : R > R defined by 
j (x) =27+C (for C eR). 


And even more generally, 


Suppose a function is antidifferentiable and defined on an interval. If this func- 


tion is defined by x & 2x, then its antiderivatives are exactly those functions 
defined by x x*+C (for C eR). 


The above italicised statement is precise, formal, and correct—but also sibei long-winded. 
So, let’s declare it to be exactly equivalent to this shorthand statement: 


jf ede=2? +. 





For the next three examples, assume D is an interval. 
Example 1313. The statement 
ij. 8a7 dz = 2° +C 


says that if the function f : D > R defined by f (x) = 32? is antidifferentiable, then its 
antiderivatives are exactly those functions g: D > R defined by g(x) = 2° +C (for C € R). 


Example 1314. The statement 

f cosa de =sinz+C 
says that if the function f : D > R defined by f(x) = cos is antidifferentiable, then 
its antiderivatives are exactly those functions g : D > R defined by g(x) = sina +C (for 


Ce R). 


Example 1315. The statement 


f 42° - 2x da = 24 -20+C 


says that if the function f : D + R defined by f (x) = 4x? — 2x is antidifferentiable, then 
its antiderivatives are exactly those functions g : D > R defined by g(x) = x*-2x+C 
(for Ce R). 
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In general, 


Definition 223. Let f(a) and g(x) be expressions containing the variable x. We shall 
write 


fF@ ce =o ey aiC 


to mean the following: 


Suppose a function is antidifferentiable and defined on an interval. If this func- 


tion is defined by x f(x), then its antiderivatives are exactly those functions 
defined by r++ g(x)+C (for C eR). 





In =, we call 
e The symbol [ (an elongated S) the antidifferentiation symbol. 


(Spoiler: Later, we’ll also call [ the integration symbol.) 
¢ f(x) the integrand. 


¢ x the (dummy) variable of antidifferentiation. 
(Spoiler: Later, we’ll also call z the (dummy) variable of integration.) 


¢ dz the differential of the variable x. (You can think of dz as a sort of punctuation 
mark.) 


Remark 155. The symbol [ f (x) dz is often called the indefinite integral of f. 


In this textbook, we'll avoid using the term indefinite integral, because it is highly 
confusing for two main reasons: 


1. Confusingly, different writers give this term many different definitions.**? 


2. It makes the relationship between differentiation, antidifferentiation, and integration 
that much more confusing. In particular and importantly, it leaves students confused 
as to why the Fundamental Theorems of Calculus have any substance and don’t just 
follow from definition. 


Unfortunately, the term indefinite integral remains in common use by other writers. 
Yow’re likely to encounter it elsewhere, which is why we even bothered briefly mentioning 
it here in this remark. 


(Where the need arises in this textbook, the symbol [ f (a) dz shall simply be called 


the antiderivative. ) 








482Here are just six: 
1. Larson and Edwards (2010): “The term indefinite integral is a synonym for antiderivative.” 


2. Hass, Heil, and Weir ( Thomas’ Calculus, 2018, p. 236): “The collection of all antiderivatives of f is 
called the indefinite integral of f with respect to x, and is denoted by 


7 Tie) da” 
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3. Shurman (2012, p. 293): “Let I be a nonempty interval in R, and consider a function 


f:IlrR 


b 
such that f exists for alla,be I. An indefinite integral of f is a second function 


F:I-+R 


such that 


[ f=FO-£0) for alla,be TI.” 


4. Priestley (1997): “Let g ¢ C[a,b]. Define the indefinite integral of g to be G, where 


G(z)= [ow 


5. Hughes et al. (1998): “All antiderivatives of f (x) are of the form F'(a#)+C. We introduce a notation 
for the general antiderivative that looks like the definite integral without the limits and is called the 
indefinite integral: 


f[F@ dz = F(x) +C” 


6. Herman & Strang (2017, p. 487): “Given a function f, the indefinite integral of f, denoted 


f fo) a, 


is the most general antiderivative of f.” 


483Remark 144. 
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We’ve been using the symbol xz as our (dummy) variable of antidifferentiation. As usual, 
we can replace x with any other symbol: 


Example 1316. The statement i 2e¢ dx = x2+C is equivalent to any of these statements: 


f 2udy=y+C, f 22dx=2+C, [ 2000 -@+¢, feo do=2+C. 





Exercise 406. XXX (Answer on p. 1037.) 
A406. 
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103.4. Rules of Antidifferentiation 
On the next page, we’ll formally state some Rules of Antidifferentiation. But first, 


we'll illustrate them with some quick examples. You should find these Rules familiar from 
secondary school: 


Example 1317. (Constant Rule) [dae =52+C. 


Example 1318. (Power Rule) es d= a. +C. 


1 LA 
Example 1319. (Power Rule) ee Gae= 30 +C=--—+C. 





323 


Example 1320. (Reciprocal Rule) [ae =In|z|+C. 
Example 1321. (Exponential function) | expzrdz =expx+C. 
Example 1322. (Sine) i sing de = —cosa + C. 

Example 1323. (Cosine) i: cos¢dz =sinz+C. 


Example 1324. (Sum Rule) i sing +expxdx =-—cosxr+expxr+C. 


1 
Example 1325. (Difference Rule) f- —cosxdz =In|z|-sinx+C. 
ao 


5 1 
Example 1326. (Constant Factor Rule) f 504 dz= ou Cc. 
“a 





1 
Example 1327. (LPC Rule) i cos (2% + 3) dx = 5 sin (27 +3)+C. 


Example 1328. (LPC Rule) i exp (27 + 3) dz = 5 CxP (27+3)+C. 


i 
Example 1329. (LPC Rule) i (27 +3)" dz = ia (Ee) NC, 
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d 
Fact 215. Let f (x) and g(x) be expressions containing the variable x. If Aro y= (ay. 
a 





then fe dx =g(x)+C. 











Proof. See p. 1704 (Appendices). 





These Rules of Antidifferentiation are, of course, simply the inverse of our earlier Rules of 
Differentiation. 


Theorem 40. (Rules of Antidifferentiation) Suppose k ¢R. Then 





(a) fe dz =kr+C (Constant Rule) 
(b) ie dge = jor k = —lwonde = 0 k <0) (Power Rule) 
(c) [- dz = In|z|+C, forx+#0 (Reciprocal Rule) 
(d) [ 10s ep (Exponential Rule) 
(e) [ SUL 0) = (COIS 1 1G (Sine) 
(f) i Cosa = sina © (Cosine) 
Now suppose also that f and g are functions with antiderivatives F and G. Then 
(g) i (ft+g) (x) dx =F (x) +G(xz)+C (Sum and Difference Rules) 
(h) far (i) di= hl @)4-C (Constant Factor Rule) 
(i) [i (ax + b) dx = “F (ax +b) +C, for a,beR with a #0 (LPC Rule) 
Proof. By Fact 215, to prove that [i (x) dx = F' (x) + C, show that —F (Pye 7 Ge): 


Here we'll do only (c), which is a little trickier: 
(c) By the Natural Logarithm Rule for Differentiation, 


ae es for x > 0, 
d dx x 
ae |r| = 
L d -1 1 
qzit(-*)=— =, for x <0. 











Exercise 407 asks you to prove the remaining Rules of Antidifferentiation. 
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Remark 159. For lack of a better name, I shall call the last Rule the Linear Polynomial 
Composition (LPC) Rule. 


When written out formally, it looks complicated. But as illustrated by the last three 
examples, it’s jolly simple and you'll already have seen plenty of it in secondary school. 


Remark 160. To repeat, each of the Rules of Antidifferentiation in Theorem 40 is merely 
shorthand for a precise but long-winded statement. 


For example, Theorem 40(a) states, “ [ k da =kx+C”. This is merely shorthand for the 


following precise but long-winded statement: 


Suppose a function is antidifferentiable and defined on an interval. If this func- 
tion is defined by x # k, then its antiderivatives are exactly those functions 
defined byx+kx+C (for C eR). 


Remark 161. In Theorem 40(a), in the case where k = 1, we have 


fp dr@xx+C. 


By convention, we may write [ 1dz more simply as [ dx. 


So, @* may also be rewritten more simply as 


Exercise 407. Prove Theorem 40(a), (b), and (d)—(i). State any Rules of Differentiation 
that you use. (Answer on p. 1928.) 


Exercise 408. Mimicking Remark 160, write out the precise but long-winded versions 
for each of Theorem 40(b)—(i). (Answer on p. 1928.) 


Exercise 409. Let a,b,c,d¢R. Find the following. (Answer on p. 1929.) 


(a) [w+ baw 
(b) fax? + be + eda 
(c) if e0? + ba? + cx + dda 


(d) i (ax +b)° dx (for c#-1) 








484Unless of course we already assume that x > 0. 
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103.5. Every Continuous Function Has an Antiderivative 


Happily, every continuous function is antidifferentiable: 


Theorem 41. Let a <b and f : [a,b] > R be a function. If f is continuous, then f is 


antidifferentiable. 














Proof. We'll prove this later (Exercise 418). 





Example 1330. XXX 


Example 1331. XXX 
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103.6. Be Careful with This Common Practice 


One Common Practice is to use the symbol “ [ f (x) dz” to denote any antiderivative 
of f, 


And so, under this Common Practice, we may write 
Lie 2 Bs 4. 
fede ta ; jf exde2a +1, f coswde * sine, or f cosade * sina -5.x 
And more generally, we may write 


f £@) ax 2 g(x). x 


This Common Practice is convenient and perfectly fine, so long as one keeps in mind that 
antiderivatives are unique—but only up to a constant. 


But if this is forgotten, then the Common Practice can lead to error, such as in this “proof” 
that.0 = 1: 


“Since [ or dx +x? and [ Qe dx 2 x? + 1, we have x? = x7 +1 and hence 0=1.” X 


The error in the above “proof” is easy enough to spot and hence avoid. But when we have 
more complicated problems,**° it is harder to avoid similar errors. 


It’s mostly OK to follow this Common Practice. But you should be careful. In particular, 
you should always bear in mind that antiderivatives are unique—but only up to a constant. 





485See e.g. Exercises 431 and 432. 
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103.7. The “Punctuation Mark” dz (optional) 


Actually, the “punctuation mark” dz is a little unnecessary: 


2 
Example 1332. Instead of ib ede = ~ +C’, we could more simply write 


9 
fo=See. 
Z 


It is clear from the context that x is the (dummy) variable of antidifferentiation. Hence, 
the “da” is a little unnecessary. 


Example 1333. Instead of i f (a) dx = g(x) +C, we could more simply write 


[ f@=9(@) +0. 


It is clear from the context that x is the (dummy) variable of antidifferentiation. Hence, 
the “da” is a little unnecessary. 


Indeed, even the “(x)”s are a little unnecessary. And so, we could even more simply write 


f feg+e 


Nonetheless, the “punctuation mark” dz does serve two purposes: 





1. It makes clear that the (dummy) variable of antidifferentiation is x. (This is especially 
helpful for reducing confusion when we have more than one variable or symbol.) 


2. It serves as a clear marker for where the integrand ends. (This can reduce the need for 
parentheses. ) 


Illustrations: 


1 k2y? 


Example 1334. Instead of i ke x dx a +C', we could more simply write 


Oe 
feo 6, 


However, with Zo the reader is faced with an ambiguity: It’s not obvious whether in 


: [ kx”, k or x is the variable of antidifferentiation. She is expected to figure out for 


herself that it is x and not k that is the variable of integration. 


To save the reader such troublesome ambiguity, we prefer to use the “punctuation mark” 
dx. That is, we prefer to write + rather than =. 
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Example 1335. Instead of [ Qe+1drex2+x+C , we could more simply write 


foct2ar+esc. 


However, with 2 the reader is faced with an ambiguity: It’s not obvious whether i 20 + 


I= [ 2-4 lide or [ 2r+1 = vi 2¢dx+1. To clarify matters, we may wish to add 


parentheses to 2 and instead write 


f Qe+1)2aeaee, 


Statement 2 is perfectly good and clear, but does require the use of an additional pair of 


parentheses. So, one might prefer to just stick with é. where the “punctuation mark” dx 
clearly where the integrand “22 +1” ends and eliminates any need for the parentheses. 
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Calculus may be divided into two branches, the differential calculus and 
the integral calculus—or more simply, differentiation and integration. 
These two branches correspond to two geometric problems: 


Differentiation is the problem of Integration is the problem of 
finding the derivative, finding the definite integral, 
or the gradient of a curve. or the area under a curve. 


Figure to be 


inserted here. 





So far, we’ve been looking only at differentiation. 

In the following chapters, we’ll look at integration. 

As we'll learn, differentiation and integration are two sides of the same coin. 
Specifically (spoiler alert), we’ll learn that 


Integration and antidifferentiation are the “same thing”. 


Equivalently, integration and differentiation are inverse op- 
erations. 
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104. The Definite Integral 


Integration is the problem of finding the definite integral, which is the area under a 
curve. 


Example 1336. Define f : [0,9] > R by f (v7) = Vv+1. 


Figure to be 
inserted here. 


The definite integral of f from 0 to 1 is the red area and equals 5/3. There are three ways 
to denote a definite integral: 


The definite integral of f from 2 to 4 is the blue area and equals approximately 5.45: 


4 4 4 
f f (x) dar = 5.45 f ees i eee 
2 


How do we find the red or blue areas? 


Right now, let’s pretend we have no idea (even though we actually already learnt to do 
this in secondary school). We’ll revisit this question only in the next subchapter. 





In the previous chapter, we already used the symbol [ as our antidifferentiation sym- 
bol. Here, we again use the same symbol. 


But now, in this new context, this symbol [ will instead be called the integration 
symbol. 


The symbol [ thus serves double duty, once as the antidifferentiation symbol and again 
as the integration symbol. 


For now, you should find it deeply puzzling that we use the exact same symbol 7 in two 


completely different contexts. This puzzle will be resolved in the next chapter, when we 
learn about the Fundamental Theorems of Calculus. 
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Example 1337. Define the function g: R > R by g(x) =x -2. 


Figure to be 
inserted here. 


We treat any area under the z-axis as negative. (And so, the definite integral is actually 
the signed area under a curve.) 


The definite integral of g from 0 to 1 is the red area and equals -1.5: 


1 1 1 
[ ge) dz=—l.5 i gdx = -1.5 [ g=-1.5. 
0 0 0 


The definite integral of g from 2 to 4 is the blue area and equals 2: 


4 4 4 
ff 9(2) de=2 ff, 9de=2 io 


We can actually compute these areas using primary-school geometry: 


1 1 ! Base x Height 1x1 1 
d =) d = = = = 
i Deis eas fe 2 2 2 


4 4 4 B Heigh ie aes 
ee ee ee 
2 2 2 2 Z 











You are probably most familiar with this notation for the definite integral: 


[to dx. 


We call 

e The symbol [ the integration symbol (it is simply an elongated S); 

e The numbers a and b the lower and upper limits of integration; 

e The function f to be integrated the integrand; and 

¢ The symbol dz the differential of the variable x—it tells us that the (dummy) 


variable of integration is 7. 


To repeat, the symbol [ does double duty: In the previous chapter, [ was the antidiffer- 
entiation symbol; in this chapter, ‘i is the antidifferentiation symbol. This is something 
you should find puzzling and which we’ll resolve only in the next chapter. 

As usual, x is merely a dummy variable that can be replaced with any other symbol. 
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When describing the definite integral of f from a to b, what really matters is that we 
specify (i) the function f; and (ii) the lower and upper limits a and b. The symbol we 
use to denote the dummy variable or variable of integration doesn’t really matter—it is 
customarily x but could be any other symbol like y, t, u, or even ©. So, we could write 


ft@a-= fro f roa= [sayau= [7(@) a0. 


b 
Indeed, the “(x)” and even the “dx” are somewhat superfluous. Instead of [ Gams 


we could write 


b b 
7 Oa, or even more simply, f. 


Exercise 410. XXX (Answer on p. 1048.) 
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104.1. Notation for Integration 


As we saw earlier (Ch. 89), in the differential calculus, there are (at least) three commonly 
used pieces of notation (due to Leibniz, Newton, and Lagrange). 


In contrast, in the integral calculus, the only notation that is still commonly used (and 
which we use) is Leibniz’s: 


[ f@a or [few or fe: 


Fun Fact 


To denote the derivative of f, Newton placed a dot above—f. 


| 
To denote the integral of f, he similarly placed a vertical line above—f. But as just 
mentioned, this notation is now rarely (never?) used. 
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104.2. The Definite Integral, Informally Defined 


We formally define the definite integral only in Ch. 146.22 (Appendices). Nonetheless, 
just to provide a little clarity and precision, here’s an informal definition anyway: 


Definition 224 (informal). Let a <b and f be a function that is continuous on [a, }]. 
Consider the “area” bounded by f/f, the x-axis, and the vertical lines x = a and x = b. This 
“area” is a real number that we call the definite integral of f from a to b and denote by 


fi f(a) ae f° fax or ie 


Figure to be 
inserted here. 


We call the symbol i the integration symbol; the numbers a and 6 the lower and upper 


limits of integration; the function f the integrand; x the variable of integration; and the 
symbol dx the differential of x. 





The above definition is informal because we haven’t formally defined what “area” is or how 
it might be computed. In the next subchapter, we’ll make a sketch of how this might be 
done. 


b 
Above, we’ve defined [ f in those cases where a < b. It will be convenient to also cover 
a 


those cases a = 6 or a> b: 
Definition 225. Let a<b and f be a function that is continuous on [a, }]. 


(a) The definite integral of f from a to a is denoted i f and is defined by 


et 


(b) The definite integral of f from b to a is denoted i f and is defined by 


a b 
[rf 
b a 
Example 1341. XXX 
Example 1342. XXX 
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Exercise 411. XXX (Answer on p. 1051.) 
AAl11. 
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104.3. An Important Warning 


In O-Level Additional Maths, you may have been taught that integration is, by definition, 
simply the inverse of differentiation.**° 


That approach is wrong,**” confusing, and detrimental to your understanding of calculus. 
Indeed, that approach often leaves students confused and unable to understand why the 
Fundamental Theorems of Calculus (to be covered only in the next chapter) have any 
substance. 


We will not use that incorrect approach in this textbook. To repeat, 


Differentiation is the problem of finding the gradient of a curve. 


Integration is the problem of finding the area under a curve. 


Figure to be 


inserted here. 





A priori,**® there is no reason to think that a curve’s gradient is in any way related to the 
area under the curve. Equivalently, a priori, there is no reason to think that differentiation 
and integration are in any way related. 


That they are related is established only with the two Fundamental Theorems of Cal- 
culus (FTCs). The FTCs are an important landmark in the history of mathematics and 
indeed the history of humanity. 


We will now work our way towards the FTCs. Don’t worry, we’ll omit most of the yucky 
technical details. The goal here is merely to provide you with some intuition and hence a 
better understanding of why the FTCs work. 





*86Tndeed, on your 4047 A Maths syllabus (and again on your H2 Maths syllabus), the very first mention 
of integration states, “integration as the reverse of differentiation”. 

“87Of course, a definition can never be wrong (unless self-contradictory). We can after all assign to any 
object or idea any name we like. What is wrong is to favour an approach that causes more confusion. 

488 4 priori is just a fancy Latin phrase for beforehand. 
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104.4. A Sketch of How We Can Find the Area under a Curve 


Example 1343. Define the function f :R > R by f (x) = 27+ 1. 
The definite integral of f from 0 to 16 is the red area and is denoted 


16 16 16 
ib i ae) da, i fda, or . 2 


Figure to be 
inserted here. 


16 il 
Suppose we’re asked to show that [ pat os How might we proceed? 
0 


One possibility is to try a crude approximation: Construct a rectangle with width 16 and 
height f (8) =8?+1=65. Call its area Ay. We have 


A; = 16 x f (8) = 16 x 65 = 1040. 
16 
So, our first crude approximation of i f is A, = 1040. 
0 


16 
We call the difference between A; and u f the approximation error and denote it 
0 
by Cilks 


16 1 1 
a=| f f-Ai = |13815 - 1040] = 3415, 
0 3 3 


Can we improve on this first crude approximation? (Equivalently, can we reduce the 
approximation error?) 


Sure. One obvious possibility is to use more rectangles: 





(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Figure to be 
inserted here. 


Construct two rectangles, each with width 8, but one with height f (4) = 47+ 1=17 and 
the other with height f (12) = 12?+1=145. Call their areas A; and Ag. We have 


Ap =8% fj (4) =8 x 17 = 136 and Asx = 8 « f (12) =8« 145 = 1200, 
So, the total area of these two rectangles is 


A» = Ao + Aoo = 136 + 1200 = 1336. 


1 
Previously, our approximation error was e€; = vale Our new approximation error is 


smaller: 


16 1 1 
a-| [ aes = |1 3815 - 1336] = 45~. 
0 3 3 





(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Figure to be 
inserted here. 


For our third crude approximation, we construct four rectangles, each with width 4, but 
whose heights are 


P2225, G6 ar 0) 0 10a C4 i 97, 
Call their areas Aq,, Ago, Aq3, and Ay,. We have 


Ag =4%f (2) = 20, Aw =4 x 7(6) = 148, Ay =4 = f (10) = 404 and 
Aga =A ci (14) = 7105. 


So, the total area of these four rectangles is 


Ay = Ay + Ayo + Ayg+ Aggy = 20+ 148 + 404 + 788 = 1360. 


ih 1 
Our previous two approximation errors were €1 = 3 and e€9 = HUG Our new approxi- 


mation error is smaller: 


16 1 1 
a-| [ are = |1 3815 - 1360] = 212. 
0 3 3 


As you can probably tell, we can use ever more rectangles to find ever better approxima- 


16 
tions for i f. Exercise 412 continues with this example. 
0 


Exercise 412. Continue to define the function f : R > R by f (x) = x7+1. Approximate 
f using eight rectangles, each with width 2. Let Ag be the total area of these eight 


0 
rectangles and eg be the corresponding approximation error. Is this fourth approximation 
an improvement over our previous three approximations? (Answer on p. 1930.) 





Sketched in the above example and exercise is the main idea underlying integration: 


Use thin rectangles to approximate the area under a curve. 


With more (and thinner) rectangles, our approximations improve. 
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Figure to be 


inserted here. 





More precisely, integration (which is the problem of finding the area under a curve) involves 
these steps: 


b 
Suppose we want to find i; f, the definite integral of f from a to b. 


1. Partition the interval [a,b] into n rectangles of equal width. Each rectangle’s height is 


the value taken by the function at the rectangle’s midpoint.**® 


2. Let the total area of these n rectangles be A,.°° We observe that A, serves as an 
b 
approximation for [ ae 


3. As n (the number of rectangles) increases, A, becomes an ever better approximation for 
ie 


b 
4. Indeed, it is possible to prove that as n > oo, A, > [ f, or equivalently, 
a 
b 
lim A,, = ie 
nN Co a 


b 
It turns out that formally, we shall simply define [ f—the definite integral of f from a 
to b—to be lim A,,. : 
nN co 
But don’t worry. For H2 Maths, these are technical details that you needn’t worry about. 
491 The above informal and intuitive explanation of how integration works should more 


than suffice. (For a recent A-Level exam question that asks for such an explanation, see 
N2015-I-3.) 














48°Snecifically, each rectangle has width ue and the ith rectangle has height f (« + ( - 5) be *) (for 
n n 
b- 
i=1,2,...,n). Hence, the ith rectangle has area yee (« + ( - 5) *). 
n n 
a b- a b-a 
490 A — lone 
. pe n (a+(i as n ) 


41But see Ch. 146.22 (Appendices) if you’re interested. 


1056, Contents www.EconsPhDTutor.com 


104.5. Ifa Function Is Continuous, Then Its Definite Integral 
Exists 


Figure to be 


inserted here. 





Formally, 


b 
Theorem 42. Let a<b. If the function f is continuous on [a,b], then [ f exists. 


a 








Proof. See p. 1706 (Appendices). 
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104.6. Rules of Integration 


Theorem 43. (Rules of Integration) Let a <b, c€ (a,b), and d,e¢ R. Suppose the 
b b Cc 
functions f and g are continuous on [a,b], so that by Theorem 42, [ He [ g; i ie 


b 
and f f exist. Then 


(a) ie (ftg9)= ft + foo (Sum and Difference Rules) 
(b) We = [fe ie (Adjacent Intervals Rule) 
(c) iC (qj y= afr. (Constant Factor Rule) 

b 
(d) i d=(b-a)d. (Constant Rule) 


(e) If f>g on [a,b], then i, » foo (Comparison Rule I) 
(f) Ifd< f(x) <e for every x €[a,b], then 


b (Comparison Rule IT) 
Gene i} Fe ale. 





Proof. For the formal proofs, see p. 1707 (Appendices). Here we give only informal 
“proofs”-by-picture: 


(a) Consider the function h = f+g. “Clearly”, the area under the graph of h must be equal 
to the sum of the areas under the graphs of f and g. 


Figure to be 


inserted here. 





Similarly, consider the function 7 = f —g. “Clearly”, the area under the graph of 7 must be 
equal to the difference of the areas under the graphs of f and g. 


b ¢ b 
(b) “Obviously”, [/ fe f fe f f 4% 





492\WVhile this may seem intuitively “obvious”, formally proving it is somewhat less so. 
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Figure to be 


inserted here. 





(c) Stretch the graph of f upwards by a factor d. The area under the new graph must be 
d times the area under the graph of f. 


Figure to be 
inserted here. 





b 
(d) “Clearly”, [ c is simply the area of a rectangle with base b-a and height c. So 


[c= @-ae. 


Figure to be 
inserted here. 





(e) If f is everywhere on or above g, then the area under f cannot be smaller than that 
under g. 


Figure to be 


inserted here. 





(f) The numbers c and d serve as lower and upper bounds for f on the relevant interval 


b 
(a,b). And so “obviously”, 7 f, the area under the graph of f from a to b, is bounded 


from below and above by the rectangles with base b-a and heights c and d. 
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Figure to be 


inserted here. 





Actually, it’s not difficult to formally prove (f) and you are asked to do so in Exercise 
XXX. 














Exercise 413. This exercise will guide you through a proof of Theorem 43(f) (Compar- 
ison Rule II). 


Let d,e € R. Define the functions h,i: [a,b] > R by h(a) =d and i(2) =e. 
b b 
(a) What are i h and | a? 


b b b 
(b) What can we say about i ie i h, and i a 
(c) Hence complete the proof of Comparison Rule II. (Answer on p. 1930.) 


Exercise 414. Let f : [a,b] > R bea continuous function. Suppose f > 0 on [a,b]. Prove 


b 
that i f >0. (Hint: Define h: [a,b] > R by h(x) =0.) (Answer on p. 1930.) 


Exercise 415. The Constant Factor Rule of Antidifferentiation (Theorem 40) states that 
ee) dx =kF (x) +C. 


The Constant Factor Rule of Integration (Theorem 43) states that 


fape-af's. 


Are these two Rules the exact same thing? Why or why not? (Answer on p. 1930.) 





1060, Contents www.EconsPhDTutor.com 


104.7. Term-by-Term Integration (optional) 


In Ch. 91.3, we discussed term-by-term differentiation. 
In this subchapter, we will analogously discuss term-by-term integration. 


Let a< band fo, fi, fo,... be nice functions that are continuous on [a,b]. 


b b b 
By Theorem 42, ‘A fo, i: Fis [ fo, ... exist (i.e. are equal to real numbers). 
a a a 


Suppose we define a new function g by 
g=fotfi. 
b 

Then by the Sum Rule (for Integration), [ g exists, with 

b b b 

foo-f forf on 
Similarly, suppose we define a new function h by 
h= fot fit fa. 


b 
Then again by the Sum Rule, i. h exists, with 


fore fiir fn [fp 


More generally, for any n € Z", suppose we define a new function 7 by 


i=Dfisfot fet fu 
i=0 


b 
Then by the Sum Rule, i i exists, with 


fie fey fae [ir faces [i t 


1 
In =, we 


b n 
interchange integration [ and finite summation y 
a i=0 
We see that ifa function 7 is the finite sum of n other continuous functions defined on 
[a,b], then [ 2 also exists and is equal to the sum of those n functions’ definite integrals 
from a to b. | 


Now, suppose instead we define a new function 7 by 


Se oe 


i=0 
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b 
Q1. Does [ 7 exist? 
Q2. Is it the case that 


[a- [02d [ae fae fae Pav 


In particular, is 2 legitimate? That is, can we 


CO 


d 
interchange integration = and infinite summation > 
- i=0 
Is this interchange operation legitimate? 


It turns out that the answer to Q1 and Q2 is, 


Yes, but only if certain technical conditions are met. 


In particular, if certain technical conditions are met, then, as was done in =, we can 


d foe) 
interchange differentiation dn and infinite summation Sr 
L i=0 


Fortunately, these “certain technical conditions” are beyond the scope of H2 Maths and 
we shall not discuss them in this textbook. Based on past-year A-Level exams, it would 
seem that your A-Level examiners simply assume that the interchange operation is always 
legitimate—so, at least for H2 Maths, there is probably little danger if you also do likewise. 


In Ch. 108, we'll revisit the questions just asked, but in the special case where 7 is an 
“infinite polynomial function”. 


Exercise 416. This exercise guides you through a counterexample showing that we can- 


not always interchange integration i and infinite summation > (Answer on p. 


@ i=0 


1062.) 
A416. 
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105. The Fundamental Theorems of Calculus 
105.1. Definite Integral Functions 


b 
So far, we’ve only defined an object denoted [ f and called the definite integral of f from 


b 
a to b. This object, [ f, is a real number. 


We now use this number to define a function that we call the definite integral function: 


Example 1344. Define f : [0,8] > R by f (x) = 32x? +2. 


Figure to be 
inserted here. 


The definite integral function of f from | is the function g: [0,8] — R defined by 


g(x)= ff. 


3 5 0 1 
So for example, g(3) = i gs = i J, and g(0) = i (i -f f are the areas 
1 1 1 0 
depicted above. 
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Example 1345. Define f :[-5,7] > R by f (x) = 27 +1. 


Figure to be 
inserted here. 


The definite integral function of f from 0 is the function g:[-5,7] > R defined by 


g(x)= fof 


1 3 5 
So for example, g (1) = i f=¢@)= i jf = 12; and g(5)= i f =30. Also, 


Definition 226. Let D be a closed interval,*”? f : D > R be a continuous function, and 
ce D. The definite integral function of f from c is the function g: D > R defined by 


g(x)= fof 


Exercise 417. Define f : [-2,5] > R by f(x) = 2%+1. Let h be the definite integral 
function of f from 2 and 7 be the definite integral of f from 2 to 3. 

(a) What is h? 

(b) Evaluate A at 3, 5, 0, and -2. 

(c) What is 7? 

(d) Evaluate i at 3, 5, 0, and -2. (Answer on p. 1932.) 


























*°3Somewhat strangely, we include R = (—oo, 00) as a closed interval. Also, if a ¢ R, then (—oo,a] and 
[a,co) are also included as closed intervals. You can think of these as conventions that we’ll simply 
follow. (Formally, an interval or set is closed if it includes its limit points.) 

494Unfortunately there does not seem to be any standard name for this object. This object is however key 
to understanding why the FTC1 is true, which is why I decided it should be given a name, even if that 
name is non-standard. 
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105.2. The First Fundamental Theorem of Calculus (FTC1) 


Example 1346. Define the function f :R —R by f (x) = 2? +1.” 
The definite integral function of f from 0 is the function g:R — R defined by 


g(z)= fof 


Now, our goal is to 


Solve the problem of integration. 
That is, Find the area under the graph of f 
Or equivalently, Find a general expression for g (x). 


Instead of approaching this goal directly, we’ll take a strange and indirect approach. We'll 
pose a seemingly unrelated question: 


What is the derivative of g? © 


This is a strange question to ask. Right now, we don’t even know what g is—that is, we 
don’t even know what a general expression for g(a) is. So, how could we possibly know 
what the derivative of g is? This question is thus strange—it is like asking someone who 
has no idea where Singapore is to locate Sentosa. 


Let us nonetheless see where the above question might lead us. For concreteness, let’s 
consider some arbitrary point, say, 4. We’ll try to find g’ (4), the derivative of g at 4. 


Figure to be 


inserted here. 





In the above figure, we’ve picked some point c that is “near” 4. 


The red area is g(c) - g(4), while the blue area is (c- 4) f (c). Observe that these red 
and blue areas are approximately equal. ‘That is, 


g()-a(4#(e-4) F(a) or HI -IOD Spey) 


1 
The closer c is to 4, the better the approximation *. Hence,*° 


LQ§6, &outvatentlv. gq’ (4) i f (4). www.EconsPhDTutor.co 


(... Example continued from the previous page.) 


Figure to be 
inserted here. 


ia 
Here’s a second way to see why = is true: 


Recall that informally and intuitively, g’, the derivative of g, answers this question: 
Given a small unit increase in x, by how much does g increase? 
Well, we know that g measures the area under the graph of f. So, when x increases by a 


small unit, we’d expect g(a) to increase by approximately f (7) (see above figure). So, 
at the point 4 in particular, we should have 


g' (4) = f (4). 


We’ve just shown that g’ (4) i f (4). But since the point 4 was arbitrarily chosen, the 
foregoing argument should also work for any other point x « R. That is, for any x € R, 
we also have 


g (x)= f(x). 


In other words, g’ = f. 


In ©, we asked, “What is the derivative of g?” We now have an answer: 
The derivative of g is f. 


This answer is also exactly what the First Fundamental Theorem of Calculus 
(FTC1) says: 





Very loosely, the First Fundamental Theorem of Calculus (FTC1) says that definite 
integrals are also antiderivatives. A bit more precisely, if g is a definite integral function of 
f, then g is also an antiderivative of f. Most precisely, 





496 Here we are hand-waving. 
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Theorem 44. (First Fundamental Theorem of Calculus, FTC1) Leta < b, f: 
[a,b] > R be a continuous function, and cé [a,b]. Suppose the function g: [a,b] > R is 
defined by 


g(z)= fs 


Then g' =f (i.e. the derivative of g is f or equivalently, g is an antiderivative of f ). 





Proof. The above example already discussed the intuition behind the FTC1. For a formal 
proof, see p. 1714 (Appendices). 

















Let’s illustrate the FTC1 with a familiar example from physics: 
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Example 1347. A car is moving. Below we graph its velocity v (m a0) as a function 
of time t (s). 


Figure to be 
inserted here. 


Recall that the distance d (m) travelled by the car is the area under the graph. 


5 
e For example, after 5s, the distance travelled by the car is ) OP 
0 


8 
e And after 8s, the distance travelled by the car is ii VU. 
0 


ob; 
e In general, after xs, the distance travelled by the car is i Vv. 
0 


But we already know that 
The derivative of distance (with respect to time) is velocity. 
That is, a =a): 
What we’ve just shown is precisely the FTC1: 
The derivative of the area under a function’s graph is the function itself. 


Exercise 418. This exercise guides you through a proof of Theorem 41: Every continuous 
function is antidifferentiable. (Answer on p. 1932.) 


Let a < b, f : [a,b] > R be a continuous function, and c € [a,b]. Let g be the definite 
integral of f from c. 


(a) What is g? 
(b) What does the FTC1 say about f and g? 
(c) Hence, we have shown that f has , namely (full in the blanks). 
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105.3. The Second Fundamental Theorem of Calculus (FTC2) 


Theorem 45. (Second Fundamental Theorem of Calculus, FTC2) Let a<b and 
f :[a,b] +R be a continuous function. Suppose g is an antiderivative of f. Then 


[ f=90)-9(0). 








Proof. See Exercise 419. 











Exercise 419. This exercise guides you through a proof of the FTC2. 


Let a< band f: [a,b] > R be a continuous function. suppose g is an antiderivative of f. 
Define the function h: [a,b] > R by h(x) = i a 
(a) What does the FTC1 say about h and f? 


(b) So, how are h and g related? 
(c) Now prove the FTC2 by considering g (b) - g (a). (Answer on p. 1932.) 





The FTC2 gives us a powerful tool for solving the problem of integration (i.e. the problem 
of finding the area under a curve): 


Three-Step Integration Recipe 


b 
Let a<b and f be a nice function that is continuous on [a,b]. To find i de 


1. Find any antiderivative g of f. 


2. Plug the upper limit b and lower limit a into g—that is, evaluate g(b) and g(a). 


3. Conclude: fs = g(b)-g(a). 
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3 
Example 1348. Define f :R—> R by f(x) =x”. Find ‘ ie 
0 


Figure to be 
inserted here. 


Use the Three-Step Integration Recipe: 
1. Find any antiderivative g of f. 


i 
One such antiderivative is g: R > R defined by g(x) = ge 


2. Plug the upper limit 3 and lower limits 0 into g: 
I 43 ! 33 
g(3)= 3-3 a.) and g(0)= 3-0 at; 


3 
3. Conclude: i f =9(3)-g(0) =9-0=9. 
0 


Example 1349. XXX 


Example 1350. XXX 





Exercise 420. Use the Three-Step Integration Recipe to evaluate each definite integral. 
(Answer on p. 1071.) 


(a) xxx 


(b) xxx 





A420(a) xxx 
(b) xxx 
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105.4. Why Integration and Antidifferentiation Use the Same 
Notation 


To repeat, 
Differentiation is the problem of finding the gradient of a curve. 


Integration is the problem of finding the area under a curve. 


Figure to be 


inserted here. 





A priori,**’ there is no reason to think that the area under a curve is in any way related 
to its gradient. Equivalently, a priori, there is no reason to think that integration is in any 
way related to differentiation (or antidifferentiation). 


That they are in fact related was only just established by the two FTCs: 
« The FTC1 says that any definite integral function of f is an antiderivative of f. And so, 


Loosely speaking, integration and antidifferentiation are the “same thing”. Or equiva- 
lently, integration and differentiation are inverse operations. 


e The FTC2 says that if g is any antiderivative of f, then 


b 
[ f=9@)-s(0). 
And so, to compute definite integrals, we can use the Three-Step Integration Recipe. 


Earlier, we were puzzled by why we made the symbol [ do double duty, i.e. why ‘i 


serves as both the antidifferentiation symbol and the integration symbol. We can 
now solve this puzzle: 


By the FTCs, antidifferentiation and integration are in fact the “same thing”. And so, it 


makes (some) sense to use the same symbol / fot both antidifferentiation and integration. 


We now also see why [ f (x) dz is sometimes called the indefinite integral: 


° | f (x) dx tells us about the antiderivatives of f. But since antidifferentiation and 


integration are in fact the “same thing”, it makes (some) sense to also call i}. 7 8) da 
(or indeed any antiderivative) an integral. 


° [ f (x) dx is indefinite in the sense that it can vary by up to a constant (i.e. the 


constant of integration, C). 





497 A priori is just a fancy Latin phrase for beforehand. 
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These though are mostly matters of historical convention. In my opinion, it is better to 
avoid using the term indefinite integral—and this is exactly what I’ve done in this textbook. 


It would also be better if there were simply a different symbol for antidifferentiation. By 
making the symbol [ do double duty, generations of students have been confused into 
thinking that 

e Integration is by definition the same thing as antidifferentiation. x 

e And hence, the FTCs have no substance. x 

I hope that in this textbook, we have left absolutely no room for confusion. In particular, 
we've repeatedly emphasised these important points: 

e Differentiation is the problem of finding the gradient of a curve. 

e Integration is the problem of finding the area under a curve. 

e Antidifferentiation is the inverse of differentiation. 


e A priori, there is no reason to believe that integration is in any way related to differen- 
tiation (or antidifferentiation). 


e The FTCs establish that—remarkably enough—integration is the “same thing” as an- 
tidifferentiation. 


¢ (Which is why historical convention has us using the exact symbol 7 for both integration 
and antidifferentiation. ) 


Exercise 421. Let a <b and f be a function that is continuous on [a,b]. Explain what 


b a 
i de dar, i i (@) da, and i f («) dz are and why they are different. (Answer on 
p. 1073.) . 





A421. 
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105.5. Some New Notation and More Examples and Exercises 


Let f be a nice function whose domain contains the points a and b. Then we introduce this 
new piece of notation: 


Lf (x) |} = f(b) - f (a). 


Example 1351. XXX 


Instead of [ f (x)]?, we will sometimes also more simply write [ f ie That is, 
[fla =f @]a=f(0)- F(a). 


Example 1352. XXX 


This notation we’ve just introduced is particularly convenient when using the Three-Step 
Integration Recipe: 


Example 1353. XXX 


Example 1354. XXX 





Many more exercises for you to practise the Three-Step Integration Recipe and make use 
of the new notation just introduced: 


Exercise 422. XXX (Answer on p. 1074.) 
A422. 
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106. More Techniques of Antidifferentiation 


To find the area under the graph of a function f, we will often use the Three-Step Integration 
Recipe, in which the only step that might be challenging is the first: Find an antiderivative 
of f. 

This is the reason why it’s so important to know how to find antiderivatives. In Ch. 103, 
we already learnt several rules and techniques of antidifferentiation. In this chapter and 
the next, we'll learn more. 


106.1. Factorisation 


1 
Example 1355. Find I) d= (ora? + 274120) 
zt 


eee | 


Looks tricky. But observe that 2? +2741 = (a+ ie And so, by the Power and LPC 
Rules of Integration (Theorem 41), 


i] il i] 
fase: / —«-- +C. ® 
e2+2¢+1 (x +1) c+1 


1 
pes or | 


Observe that x? + 3x2 + 32+ 1=(x+1)°. So, 


dx (for 2? + 3x? + 32+1+#0). 





Example 1356. Find [ 





1 if Gamer 
if d= {| ——,de=-;——5 +€ 
x? + 3074+ 3r+1 (x +1) 2 (a +1) 


Exercise 423. Find the following. (Answer on p. 1933.) 


: 1 
—_____ dr (for 42” - 4x + 1 #0). i—= ee 
(a) Hi — (for 42° - 42 + 1 # 0) (b) soc es dx (for 9x? + 30x 
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106.2. Partial Fractions: Finding [ os where b’ - 4ac > 0 
ax? +ba+c 


1 
In the last subchapter, we learnt to find [ —,——— dz in those cases where b?-dac=0, 
ax’ + bx+c 
so that ax? + bx +c was a perfect cana 
We now learn to also find [ ee dx in those cases where b?-4ac > 0. In such cases, 
ax +bxr+ec 


ax? + bx +c is no longer a perfect square but is still factorisable. So, this is really just 
more factorisation, except that now we'll also make use of partial fractions (Ch. ??): 
1 9 
Example 1357. 24 ag (tora = 10) 
G — 
We observe that 2? - 1 = (x + 1) (x-1) and so write 
Bo. A(a—l)+ Biel). (A+ B)a- A+B 
a oe oe SSD) x2 —1 


Comparing coefficients, we have 4+ B = 0 and -A+ B= 1. Solving, we have A = -1/2 
and B= 1/2. So, 





1 -1 1/2 
vl de= f ee / ag 
g2-1 z+1 2-1 


= / sui da + ii a da (Sum Rule) 


Was a 
+5 f— (Constant Factor Rule) 
eg 1° 2 
if 
Dx - 5 In |x + 1] + 5 an jc-1]+C — (Reciprocal and LPC Rules) 
if 
= 5 (In |e - 1] Info + 1) +e 


(Law of Logarithm) 


(Fact 10) 


Note: It would’ve been perfectly fine to leave our answer at @*. The last three steps 
were nice but not necessary. 
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Example 1358. Find i dx (for x? +2-6+#0). 


i 
r2#+2-6 
Observing that 2? + x -6 = (x +3) (x - 2), write 


oe x+x-6 








if A Bae 2) ea Aer ben ie a 


Comparing coefficients, we have A+ B =0 and 3B-2A=1. Solving, we have A = -1/5 
and B = 1/5. Thus, 


1 -1/5 1/5 
__* dg = [ d 
a= i fea f=? ‘i 


-1/5 1/5 
ike. a (Sum Rule) 


l 
ai 4 "5 -{— (Constant Rule) 


i 
ON is 5 In |a + 3] + 5 ain jc-2|+C — (Reciprocal and LPC Rules) 


i 
= = (In|z -2|-In|x +3/)+C 


ja ~ 2| 


i 
= —In (Law of Logarithm) 
5 |x+3] 
a 
i) 


2 
In|~ +C. (Fact 10) 
x + 


Again, @* would’ve sufficed as our answer. 


Exercise 424. Find the following. (Answer on p. 1933.) 
if 

(a) i el (for 5a? - 24-3 #0). 

6) faa (for x? - a? #0). 

© faa (fora? =z° 70). 
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106.3. Building a Divisor of the Denominator 


Example 1359. Find i a 
G2 ta I 


Observe that x? + 2x +1=(x+1)*. So, 


fate: fu 
v2+2¢+1 (x +1) 


1-1 
= i Gent (Plus Zero Trick) 
x + 


dx (assume x? + 24 +1+#0). 


c+1 1 
- | ~~ ,-—su 
(x+1)° (x+1) 
a 1 1 Be 
+1 (x+1) 


i} i] 
dx - i ——, dz. (Difference Rule) 
sh (x +1) 





i 
=In|x+1]+ a C, (Reciprocal, Power, and LPC Rules) 
ab 





® 
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x 
go = 307 + 3x — | 


Observe that x? - 3x2 + 3x - 1 =(x-1)*. So, 


Example 1360. Find [ dx (assume x° - 327+ 3a-1+#0). 





£ £ 
d =f d 
Wl oe eae a (x -1)° = 


ot 
= i av) da (Plus Zero Trick) 
G — 


=I eee 

(x-1) (a#-1) 

1) : me : 3 dx 
(o= Ny {ea 1) 


yea a+ [ae (Sum Rule) 





i ee | 
“2-18 
(x-1) 
22-1 
a 
2(¢—1) 


(Power and LPC Rules) 


The last step is nice but not necessary. 


Exercise 425. Find the following. (Answer on p. 1934.) 
(Geee 9 
= 1 : 
a) f ea daz (for 42° - 42 +1 +0) 
(eb ae 
(b) {/— = dr (for «? +2 ~6 +0; Hint: Use Example 1358). 


a o 
(c) iS 5 sai git (for 5a? -27-3+0; Hint: Use your answer from Exercise 424(a).) 
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106.4. More Rules of Antidifferentiation 


498 


The following rules of antidifferentiation are in List MF26 (p. 4),*’° so no need to mug: 


Proposition 16. Let a #0. 
1 1 
(a) i dz = —tan?=+C 
a a 














x? + a? 
(b) [ eee; Sein eo for |z| < |al 
Va? — x? ja) 
1 1 iI 
(c) fae-pnl +C, forx#+a 
(d) faoee-pn| = +0 for x ++a 
at—-x? ~ Qa ja-& 
(e) i tan xdzx = In|sec2|+C, for x not an odd multiple of 5 
i cot xdax =In|sinz|+C, for x not a multiple of m 
[ cosecr dz = —In|cosecx + cot z| + C, for x not a multiple of 7 
i secadax = In|secx+tanz|+C, for x not an odd multiple of 5 
d 
Proof. By Fact 215, to prove that fi (x) dz = F (x) +C, show that qa CaS Fe). 
a 
(a) By Fact 207(c) © tence a= So 
af "dee Pe 
d E pol *) 1 1 1 
<8 —_— an _— . 
dx \a a a(Z) +1 a «+a? 
(b) By Fact 207(a) ene ee for z € (-1,1) 
° de ime a 
So, for al € (-1,1) or xe (-|al, |al) or |z| < lal, if a> 0, then 
a 
d d 42 1 1 a 1 1 











498Proposition 16 is slightly more general than what’s on List MF26. Specifically, it allows for a < 0 and 
in (b)—(h), a wider range of values of x. 
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For (c) and (d), see Exercise 424. 
For (e)—(h), see Exercise 426. (We'll also prove (e) again in Example 1390.) 














Exercise 426. Prove Proposition 16(e)—(h). (Hint: In each, you'll have to examine two 


cases, similar to (b).) (Answers on p. 1935) 
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1 
106.5. Completing the Square: y —,—_—— dx where b? — dac <0 
ax’ +bx+c 


i 
Hind {= 
= ax? +br+c¢ . 


In Chs. 106.2, we learnt to solve the above problem in those cases where b? — 4ac = 0 or 
b? — dac > 0. 


We now learn to also solve the above problem in those cases where b? - 4ac < 0. It turns 
out that in these cases, the trick is to complete the square, so that we can make use of 
Proposition 16(a): 


1 1 
[ deo tan 2 +. 
x? + a? a a 


1 
Example 1361. Find [f= dz. 
re +e + | 


il 2 
Complete the square: g+n+1g (z fp 5) Be 


1 
ae / aai® 
Now, let x + 1/4 and \/3/4 take the places of “x” and “a” in @*. Then 


1 
eee ee 





In Ch. ??, we reviewed how to complete the square. In general, 


aa? + br +c=a(x+>-) ea, 
2a 4a 


But rather than to try memorise the above general formula, it’s probably easier to try to 
understand and thus easily “see” how you can complete the square in each specific case. 


1082, Contents www.EconsPhDTutor.com 


i 
Example 1362. Find (i= 


Let’s first rewrite the integrand so that the polynomial in the denominator has leading 
coefficient 1: 





i 1 ik 
ney uo 
277 237 +5 2 e+ 1.5¢+2.5 


o 2 
Complete the square: fe a 2 (z + 7} 


Let x + 3/4 and \/31/16 take the places of “x” and “a” in @*. Then 





In Ch. 106.1, Ch. 106.2, and this subchapter, we’ve learnt to find 


il 
=) ——__= 
J asm - 


in all three possible cases (b? ~4dac = 0, b? - 4ac > 0, and b? - 4ac < 0). Fact 216 summarises 
what we’ve learnt. It looks intimidating, but is really just what we’ve been doing, except 
that now we have a, b, and c instead of actual numbers. 


Fact 216. Let a,b,c¢R with a#0 and ax? +bx+c#0. If d=./|b? —4ac|, then 


jor, —4ac = 0, 


il 
| aoe Soe for b? - 4ac = 0, 


for b* - 4ac <0. 








Proof. See p. 1716 (Appendices). 
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AA27. 
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dx in the Special Case where a < 0 


il 
106.6. ‘i —$—$_______—. 
Vax? +bx+c 


2 


Example 1363. Find a x (for -x“+x2+1>0). 


1 
—————-.d 
V-a2+x04+1 


5 1\? 5 
Complete the square:—27 +re+1= ie (z = 5) = (> 


By Proposition 16(b), 


dy = sin} id (. 


il 
Lez 


Replace d and y with V5/2 and x - 1/2: 


ee) eee 
S area S (8) - (2-3) 
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Example 1364. Find y) a (lor —25 237 25-0) 


i 
———— a 
29? ae eS 


Bee ie 49 
Complet2the sguare= 2 (-2" + 52 + =| = Fe — 


By Proposition 16(b), 


y 
y = sin a 


S Tae 


Replace d and y with 7/4 and x - 3/4: 


1 
(ee ee 
V-222 +3245 a= (x - 3) 


oa eas 3/4 
va T/A 


iL poles 3 
= —=sin™ e. 
7 a 


Exercise 428. Find the following. (Answer on p. 1936.) 


1 
(a) [ ——. dx (for -327 +2 +6> 0). 
V-327+2+6 


1 
(b) i Ta aee (for ie i. —-x+2> 0): 
J-7r? — 2 +2 





Fact 217 is the general formula. Again, this is for your reference only and not meant for 
mugging (which would be foolish). 


Fact 217. Suppose a,b,c€R with a<0 and ax? +br+c>0. Then 


il a =( 27 +b) b) 
Vax? +bx+e 1 a WE = 








Proof. See p. 1718 (Appendices). 
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“But see Fact 299 (Appendices) if you’re interested. 
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106.7. Using Trigonometric Identities 


The following rules of antidifferentiation are not on List MF26, but are explicitly listed on 
your H2 Maths syllabus. Which means you'll have to know how to derive them. 


Proposition 17. Let m,néR such thatm+n#0, andm-n#0. 
1 in2 
(a) [ sin? x dz = rae — aan CG. 


1 inal 7 
(b) f costeda = grt — = +C, 


(c) ii tan?adzx =tanz-x£+C, where x is not an 


1 = 
(d) J sinme cosnx dx == 5 [etwas ott lec, 


m — 71 m+n 





(e) J sinme Sine an = 5 |e St] Lo, 


m— mT! m+n 





1 
(f) i Cosme Cosy dy = 5 | 


salman , ein) 60, 


m— 7 m+n 





Proof. We could simply verify these, i.e. differentiate the expression on RHS to get the 
integrand on LHS. However, these do not appear in List MF26 and you’re required to know 
how to derive these. So, here our proof will be the “proper” one, going from LHS to RHS. 


Here we prove only (a) and (d). You’re asked to prove (b), (c), (e), and (f) in Exercise 
429. 


(a) Use the identity cos2x = 1-2sin? x. So, 


1- 2 1 in 2 
fsmcde= | —ar-52-= ae Gi 














2 2 4 
P P- 
(d) As stated on List MF26 (p. 3), sin P +sinQ@12sin “e cos — @ 
P P- 
So, let = _ and ie ———, 
Hence, P@2(m+n)x and QO3(m-n) az. 


Plug in @1, @2, and @3: 


i! 
J sinme cosne dx = f 5 [sin (m+n) a+ sin(m-n)2] da 





5 [eens , oem) Lo, 


2 mtn m-n 
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106.8. Integration by Parts (IBP) 


Let u and v be differentiable functions.°”’ 

By the Product Rule, (u-v) =ul-vtu-v'. 
Equivalently, Uv = ful-vtuv'de= ful-ude+ f w-v'de, 
Rearranging, 7 u-v' dx =u-u- 7 ul -vdg. 


This last equation is our Integration by Parts (IBP) formula. We see then that IBP is 
simply the inverse of the Product Rule. For future reference, let’s jot it down as a formal 
result: 


Theorem 46. (Integration by Parts) Suppose u and v are differentiable functions. 


Then 
fwolde=w- fw-vas. 


Example 1365. Find Hi xe” daz. 


To use IBP, we must first decide, Which of x or e” is wu and which is v’? 


Let’s choose u =x and v’ =e”, so that u’ = 1 and v =e”. So, 


/ y 
uU U U u Uv 


eS _— ans 
feerar-re- f 1 e* dx = xe” -e* +C=e* (x#-1)+C. 


Example 1366. Find f esine ds, 


Let’s choose u = x and v’ =sinz, so that u’ = 1 and v=-—cosz. And now, 


/ Uf 


U v U Vv 


a ——— ——S—— 
[ @sina de = x (-cos)- f 1 (-cosx) dx =-xcosxt+sinx+C. 





To choose v’, use the mnemonic and rule of thumb dETAIL. That is, choose the derivative 
v’ in this order: 


Exponential, Trig., Algebraic, Inverse trig., Logarithmic. 


The above rule of thumb (usually) works because it is easiest to find an antiderivative of 
an Exponential function and hardest to find one of a Logarithmic function.*”! 


Example 1367. XXX 


°° Following common practice, when discussing IBP, our functions shall be u and v (rather than the more 
typical f and g). 

°°lKasube (1983) first gave this mnemonic as LIATE. By reversing the letters and adding the letter d in 
front, we get the actual English word dE TAIL, which is probably more memorable. 
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Example 1368. XXX 


Example 1369. Find [ V1=2? de for x ¢[-1, 1]. 
XXX 





V1l-22 sin! 
Conclude: f Vi-sae-* 5 - 25 eG, 


We'll revisit this problem in Example 1397. 
Example 1370. Find [ sin xcos x dz. 


1 
An easy way to find this antiderivative is to use the identity sin x cos x = 5 sin 2”: 


il 1 
f sinecosrde = [sin 2eda * -7cos 2x +C. 


But here as an exercise, let’s use IBP instead. 


This time the dETAIL rule of thumb doesn’t help because we have two trigonometric 
functions. So, let’s just choose u =sinx and vu’ = cosz, so that u’ =cosz and v = sing: 


U i U U U U 


s —<—— tg 7 Vv Z ‘ GN 
sin zcosxdzx = sinzsinz — |] cosxsinz dz. 
Rearranging, 
: - 929 A : lie 9 
2 | sinxcosxdz =sin“‘r+C sing cosede = = sin Cec) &) 


Can you explain why @1 and @2 are consistent with each other??? 





Sometimesm we need to apply IBP more than once: 


Example 1371. Find f oer dz. 


By dETAIL, choose v’ = e* (and u = x”), so that v = e* (and u! = 22): 


7 
Uy v ul v 


U 
me 1 eae ao 
2 dg =e = 208" dz, 


At this point, we would apply IBP a second time. But we already did this in an earlier 


example and found that | xe* dz 2 et (x-1). So let’s just plug 2 into =: 


f set da = ae? - 26" (w= 1) + C= 0? (2? - 20 +2) C. 





Sometimes, we can use IBP together with the Times One Trick: 
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Example 1372. To find i. Inadz, use the Times One Trick and IBP: 


i! 
fonzde= fine-tma-1=(Ina)e- f <ade=aine- f 1de=cine-2+C. ® 
Te 
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Exercise 430. Find (a) i ve" dx (you may use our result from Example 1371); and 
(b) ie sin x da. (Answer on p. 1937.) 


Exercise 431. Below is a five-step “proof” that 0 = 1. Identify the error. 
, 1 
1. Consider i —dz. 
af 
: ; 1 1 
2. By the Times One Trick, i —dz = i —-ldz. 
Ta i 


i =i 
3. Apply IBP, choosing v’ = 1 (and u = —), so that v = x (and u’ = —): 
Ty 


£ 
f cae= f =-1de=—-0- f Seae=1+ f “de. 
£ £ £ x x 
Al, So, f <dr=1+ f dx. 
£ £ 


1 
5. Subtracting [ —dz from both sides, we get 0 = 1. (Answer on p. 1938.) 
x 


Exercise 432. Below is a five-step “proof” that 0 = 1. Identify the error. 


2] 
i- Consider [ —dz. 
i & 


= 2 | 
. By the Times One Trick, i ip = i —-ldz. 
ee A Se 
il 
. Apply IBP, choosing v’ = 1 (and u = —), so that v = x ( 
a 


2 


= if = 
f ~-Ide=|"-2- f Sarde] 
ee us rs 1 
2 2 
oa ~dr=1+ f ane 
Lae ae, 


aul 
. Subtract i —dz from both sides to get 0 = 1. (Answer on p. 1938.) 
ie a 
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107. The Substitution Rule 


The Substitution Rule is simply another technique of antidifferentiation, but is of suffi- 
cient heft that it gets its own chapter. 


For H2 Maths, you are merely required to mug and mindlessly regurgitate what [ll call the 
Five-Step Substitution Rule Recipe, which we'll state below. But first, some examples 
to illustrate how it works: 


Example 1373. Using the substitution u = sin, find i} cotxdax (for x not an integer 


multiple of 7t). 


503 


Five-Step Substitution Rule Recipe: 


A; 





: s 1 2 , : : : : du 
. Using = and =, rewrite the integrand cot x into an expression involving u and —: 


du 
Compute — + cosa. 
dx 


dy 


Ss 1 
fcotade= [Sac f "a+ Es 
sin x U u dz 


. Magically cancel out the dz’s: 


. Find this last antiderivative: 


i] 
f <du=In|ul +c. 
U 


. Plug back = to get rid of u: 


In jul + C =In|sinz|+C. 


Conclude: i cot z dz = In|sinz|+C. 





°°3The parenthetical requirement ensures that sin x + 0 and cot x is well-defined. 
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Example 1374. Find i ce” Ie dax using the substitution wu = 2”. 


Five-Step Substitution Rule Recipe: 
du 
1. Compute — 2 2". 
da 
du 
2. Using = and é. rewrite the integrand e* 2x into an expression involving u and oe 
a 


foeferdr® f e2drt fot“ ae. 
dz 


3. Magically cancel out the dz’s: 


[evsgat= [craw 


4. Find this last antiderivative: 

[ e"'du=e"+C. 
5. Plug back = to get rid of u: 

e+ Cle? +0. 


Conclude: i e” Ie dx =e" +C. 
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Example 1375. Find i (cos ze 2x dx using the substitution u = 2. 
Five-Step Substitution Rule Recipe: 

d 
1. Compute me 

aa 


2. Using = and 2 rewrite the integrand (cos E) 2x into an expression involving wu and 
du_ 
da 


s d 
J (cosa?) 2ede® f (cosu)2ede* f cosu de. 


. Magically cancel out the dz’s: 


jf csudiae= [ cosudu. 


. Find this last antiderivative: 


J cosudu =sinw +t. 


. Plug back = to get rid of u: 


Qo Ss Q 
sinu+C =sinz?+C 


Conclude: yi (cos i) 2x dx = sin x? + C 





More examples, but now without explicitly explaining each step: 
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Example 1376. Find i [sin (x + x)| (2x +1) dz using the substitution u = 2? +2. 


Five-Step Substitution Rule Recipe: 


1. Compute ae ale 
dg 
2 f [sin(e? +2)] (27 +1) dx = f (sinu) (27 +1) de f sinuS ax 


jf snuQas- J sinudu 


—cosu+C 


Ss 
~ cos (a> +x) aC 


Conclude: i [sin (a? + x)| (2x +1) dx =-cos(2?+2)+C 


Example 1377. Find [ Ges 4 2a} (az + Aer ) aa 
One possible method is to expand the integrand, then antidifferentiate term by term: 
[ (Ge + 2x) (327 + Aer) din = ii 3a° + 4a* + 624 + 82° dx 


1 4 6 i 
= Be + gt? + pa? + 2a" +O = 52° + 2x? + 2a" + C. 


Another method is to use the substitution u = 2° + 2x?—-Five-Step Substitution Rule 
Recipe: 


d 
1. Compute ni 30? + der. 
dz 


2 [ Ge + ice (ea + 4x) dx = i U (327 + Aer) dx = Lee dx 


uot = f ua 


1 
== C 
so 


. : (ae + Qn?) IG, 


Conclude: [ (a + 2a (Bz + 4x) dz (a +24 2 +C 


eae 
=> 
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The Five-Step Substitution Rule Recipe 


Find i f (x) dx using the substitution u = u (2): 


du 
. Find —. 
dx 
. Use = and Step 1 to rewrite the integrand f (x) into an expression in terms of u and 
du 


ie 


[F@ de= fg (u) ae. 


. Magically cancel out the da’s: 


[90 Goe= [9 (w) au 


. Find this last antiderivative: 
f atu) du=G(u)+C. 
. Plug back = to get rid of wu: 


Gi) =G Gi(2)). 


Remark 171. The Key Magical Step in the above Recipe is Step 3, where we “cancel out 
the dx’s”. 


du 
This is a strange and questionable move. As we repeatedly warned earlier, — is not a 
x 
fraction and the dx’s are not numbers. So, how is it that we can just “cancel out the 
dz’s”? 
It turns out that formally, we are not actually doing anything like “cancelling out the 
dx’s”—instead, this is simply an informal and convenient mnemonic. 


Indeed, strictly and pedantically speaking, Step 3 and hence also our Five-Step Substi- 
tution Rule Recipe are wrong! 


But this isn’t something you need to worry about for H2 Maths. What matters for H2 
Maths is that the above Recipe “works” and enables us to find the correct antiderivative. 


For why Step 3 is wrong and what it should instead be, see Ch. 146.26 (Appendices). 





More examples: 


Example 1378. XXX 


Example 1379. XXX 
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Example 1380. XXX 





Exercise 433. XXX (Answer on p. 1099.) 
A433. 
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107.1. The Substitution Rule Is the Inverse of the Chain Rule 


It turns out that the Substitution Rule is simply the inverse of the Chain Rule (for 
Differentiation). Let’s illustrate why this is so with a few examples. 


Example 1381. XXX 


Example 1382. XXX 


For a formal proof that the Substitution Rule is simply the inverse of the Chain Rule, see 
Ch. 146.26 (Appendices). 


Exercise 434. XXX (Answer on p. 1100.) 
A434. 
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107.2. Skipping Steps 


Since the Substitution Rule is really just the inverse of the Chain Rule, if we can “see” 
how the Chain Rule works, then we can often skip the entire Five-Step Substitution Rule 
Recipe: 


Example 1383. Find yi (cos a \ On aa 


In Example 1375, we solved this using the Five-Step Substitution Rule Recipe and the 
substitution u = 2. 


We can now solve this more quickly. Observe that by the Chain Rule, 


— sin zr? = (cos a} 2 
dx 


Thus, i, (cos xz) Qe dx =sinz? +C. 


Implicitly, what we’ve just done is no different from the Five-Step Substitution Rule 
Recipe. Just that we’re doing it much more quickly (indeed, in a single line). 





Example 1384. XXX 


Example 1385. XXX 


Exercise 435. XXX (Answer on p. 1101.) 
A435. 
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107.3. ff flexp fda = exp f+ 


By the Exponential and Chain Rules (of Differentiation), 


xexp f = f'exp f. 


Hence, 


Fact 218. Suppose f is a differentiable function. Then 





f flexp fda =expf +C. 


And so, by recognising that an integrand is of the form f’exp f, we can simply skip the 
substitution altogether and thus solve the problem more quickly. We now revisit the above 
two examples: 


Example 1386. Find IP eM cog ¢ dz. 
Earlier in Example XXX, we solved this using the Five-Step Substitution Rule Recipe 
and the substitution u = sin x. 


We can now solve this more quickly. Observe that xsinx = cos. Hence, by the Chain 
Rule, 


— pint ~ SIN Tagg y, 


dx 
Thus, [ &™*cosxadx =e +C. 


Again, implicitly, what we’ve just done is no different from the Five-Step Substitution 
Rule Recipe. Just that we’re doing it much more quickly (indeed, in a single line). 


E le 1387. Find [oa 
xampie ‘ in J 4 ie 


Earlier in Example XXX, we solved this using the Five-Step Substitution Rule Recipe 
and the substitution u = Vz. 


il 
We can now solve this more quickly. Observe that x./z = -——=. Hence, by the Chain 


2./h 
Rule, 
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Exercise 436. XXX (Answer on p. 1103.) 
A436. 
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107.4. f Gav-mnlfl+e 


By the Natural Logarithm and Chain Rules, 


aint, for f > 0, 
d 
qa nlfl= 

(-f) =-4 for f <0. 


Hence, 


Fact 219. Suppose f is a differentiable function. Then 


f Fac=-inlsiec, jor 7 (2) a0: 


Example 1388. Find i cotx daz (for x not an integer multiple of 7) 


Earlier in Example 1373, we solved this using the Five-Step Substitution Rule Recipe 
and the substitution u = sin x. 


To solve this problem more quickly, simply observe that 


COs x cd 
cotz = — and — sin x = cos7z. 
sin 7 dx 


And so by Fact 219, jt cot x dx = In|sinz|+C. 
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2x + 
Example 1389. Find ‘i es x 

xi +sin x 
One possible method is to use the Five-Step Substitution Rule Recipe and the substitution 
u@1x? + sin z: 


du 
1. Compute — 2 Or + cosZ. 
da, 


> (— ph f PAE as Lf fie 1 du | 
v2 +sin a 7 


Ss . 
ms In | +sin2| +, 


2% +COS2 


Conclude: y/ dz =In be +sin 2| aiG: ® 


v2 4+sin ae 


A second quicker method would be to simply observe that in the integrand, the derivative 
of the denominator is the numerator: 


Hare sin x) = 27 + cosx. 


Te 


So, we can use Fact 219: 


9 x? +sinz 
pene ee de = In|? +sin2a|+C. 


v2 4+sinr v2 4+sinz 








We now prove Proposition 16(e) using Fact 219: 


Example 1390. Find f tancae. 


sin © d 
Observing that tan x = and ae cos £ = —sinx, we can use Fact 219: 
€ x 


OS & 


sin x = sing 4h COs £ 
f tanedz = {= dx=- f Par=- f ar =-Injcosal+C = Infsecal+C. 





COS L COS @ COS & 
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Exercise 437. Find the following. (Answer on p. 1939.) 


(a) (/—_— Size se x (for x® - cosx #0). 
— COS 


sin 27 
fe ) | aa ete 

10a + cose 
© f= 


5a +sine re 


Oi ll 
(d pein (for 22 +27+2+1+#0). 


(e) i cotxdzx (for z not an odd multiple of 7/2).°™ 


t 
(f) i secx dx (for x not an odd multiple of 1/2).°” (Hint: Multiply by poe el ee 
sec yx + tan 








°This was also Proposition 16(f). 
°° This was also Proposition 16(h). 
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il 
n+l 





107.5. f (fae Coens 


We reproduce from Ch. 93.3 this definition: 


Definition 210. Given the nice function f : D > R and n€R, the function (f)": D>R 





is defined by (f)" (x) = [f (z)]”. 
By the Power and Chain Rules, Xx Gan = 277, 


Hence, [ feflar®Syree, 


Example 1391. Find [ (x ~ 2a} (Gz 4 Aer) aa, 
In Example 1377, we solved this using two methods: 


1. Expand the integrand, get a fifth-degree polynomial, then integrate term-by-term. 


2. Use the Five-Step Substitution Rule Recipe (and in particular the substitution u = 
3 2 
2 blr hh 


We now solve the above problem using a third method: 
d 

Observe that aR (2 + Za) = 3x7 + 4x. Hence, the integrand is of the form f - f’ and we 
16 


1 
Can use =: 


ik (2 an) (eae = 4x) da = [ (ae + ont) ae + aE) dx + ‘(a3 “: 2a?) aC. 
aa 2 
@) i"(e) (f(a)? 


Again, this “third” method is secretly the same as our second method. The only difference 
is that we skip the step of making the explicit substitution and thus arrive at our answer 
more quickly. 





By the Power and Chain Rules, x (fy? =3 (fy -f'. 


Gave (es 


Wire 


Hence, [ (f)?- fda 2 
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Example 1392. Find i) (oe 4 Qa?) (327 + 4c) a 


Again, we can solve this using two methods: 


1. Expand the integrand, get a fifth-degree polynomial, then integrate term-by-term. 


2. Use the Five-Step Substitution Rule Recipe (and in particular the substitution u = 
3 2 
aE Dre), 


But let’s do it using the third and quickest method: 
Observe that = (2? + 25° | = 3x7 +42. Hence, the integrand is of the form Gar - f’ and 
2 
we can use =: 
i ae + Qa?) (az + 4x) ag = uf ie + 22) ia + eid dx 2 (0° + 22)" +C, 


ee x ee 
[f(x]? f(a) [f(x]? 





More generally, by the Power and Chain Rules, for any ne R, 


d n n 
a) es 
a 
Hence, 
Fact 220. Suppose f is a differentiable function andnéR. Then 


1 
n+l 


f ( fae=— (+6, 
Example 1393. Find i (2° on One ay (en + 10x - 3) ee 
Again, we can solve this using two methods: 


1. Expand the integrand to get a 152nd-degree polynomial(!), then integrate term-by- 
term. This is doable, but absurdly tedious. 

2. Use the Five-Step Substitution Rule Recipe (and in particular the substitution u = 
Cot 302) 


But let’s do it using the third and quickest method: 


Observe that “ ie +5g°— 39 + 2) = 3x7 +102 -—3. Hence, the integrand is of the form 
x 
Gale - f’ and we can use Fact 220: 


[ (ee Pop Saas 2) (an + 10x - 3) dx = (= + 5a? — 39 + ae oa 6s 





Exercise 438. XXX (Answer on p. 1108.) 
A438. 
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107.6. Building a Derivative 
Sometimes, the integrand is secretly of the form examined in the past few subchapters. 


However, it may not be obvious and we may first have to do a little work. In particular, 
we may first have to build a derivative. 


Example 1394. Find i —— 
6 


Observe that x (e +“+ 1) =2x+1. This suggests the possibility of using Fact 219: 
/ 
f Fac Pals 


In particular, we can build the numerator so that it’s a derivative of the denominator: 


f pat ne tt +0 2n(o? +041) +0. 
rt+axutl1 


(At Zo we can remove the absolute value operator because x? + +120 for all x €R.) 


Now, 


ja | ae ube Que Tac) 


Pipe 1) LA 
= = { — (Plus Zero Trick) 


rere 
1 2r+1 1 
AU be fate 
fala cea | - 
res ») l 
2 S[in(z +e+1)- f s——arl. 


We now have the separate problem of finding | - 
do this in Ch. 106.5: 


1 1 Z, PA | 
{= —_———_;—— de = Se tant “2-40, 
e+aet1 z al V3 V3 

2 





a dz. But we already learnt to 
+x£+1 





Altogether then, 





Example 1395. XXX 


Exercise 439. XXX (Answer on p. 1109.) 
A439. 
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107.7. Antidifferentiating the Inverse Trigonometric Functions 


Example 1396. Find jf tantede, 


U vy! 
l= 


jf tanteda = tan bx: 1 dz 


= etan" >In +2] +C 


= tan 'n-5In(1+22)+C 


Exercise 440. Find the following. 
(a) i sin” x da. 
(b) i cos} x dx 
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(Times One Trick) 


(IBP) 


(f Fae =1n/s") 


(1+27 >0) 


(Answer on p. 1939.) 
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107.8. Integration with the Substitution Rule 


XXX To be written. 
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107.9. More Challenging Applications of the Substitution Rule 


The examples and exercises in this subchapter are harder than what you can expect to find 
on your A-Level exams. So, if you’re able to go through these 


Example 1397. Find i V1-2? dz for x €[-1,1]. 

We actually solved this problem in Example 1369 using Integration by Parts (IBP). 
E m™ 7 

Let’s now solve it again using the substitution x = sinu or u = sin! « « PS a 


Five-Step Substitution Rule Recipe: 


1. Compute od + cosu. 
du 
2 f vi-sae! f 1-sin*udz (Plug in =) 
= ie V cos? udx ( 
= [ |cos u| da (Vi? =| 


= J coswan 


=) COS we da (Times One Trick)” 


= —d 
[ 0s Ua x 
= | costs? pe J cos" udu 


y; “ 
= i —— du (Double Angle Formula for Cosine) 


sin2u u 
= + — 
4 2 
sInuUcosuU U 

+ 
2 2 

rcosu sin x 

= = 
2 2 


V1l-2? sin” 
ed cosu = V1-sin?u= V1-2? 


V1-22 sin! 
Conclude: f Vi-Pae- eS aera g, 


sin? u + Cos? u = 1) 


we Ls | — cosu20) 


+C 


+C (Double Angle Formula for Sine) 





+C 








d 
TF uy = +7t/2, then cosu = 0 and a involves the Cardinal Sin of Dividing by Zero (see Ch. 2.2). 


dx cosu 
This is a legitimate problem that we’ll simply ignore or hand-wave away. 
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y) 
; nb q a 9 Ss 3} 
Example 1398. Find [ —__—— (7 (for 1 +27 + 0) usine the substitution @ = 1+ 2a + 
s yea ( ) : 


1 
0 or x = 5 (u?-1). 
Five-Step Substitution Rule Recipe: 


Cas 
ile te bo y?, 
PU re a 





3 3 
aol ee 4 i ae 1)’ ae (Times Olhe Ti 
U 


ie 
ls ay ok ra ri fa dudz 


1 7 (e- 1)°3 2 dua f 3 2 dy 
= ie 5u aE t= (u 1) ee 
=f i )uee 
a ore 


2 1 
- Gu + Su?) 4 C= = (5u" ~ 16u° + 20u*) + C 


“m0 a: (1+ 20)*? -16(1+ 22)? + 20(1 +20)" ]4+C 


“i = Gia 22) (5a) 16 (1 0 


= —— (1+ 2x)" (20x? - 122 +9)+C 


The last two steps (algebra) are nice but not necessary. 


2 
Conclude: [ V1l=gdz = om (1+ 2x)7/3 (20x? - 127 +9)+C 





If we want to be really anal, correct, and proper about it, we can simply treat the above as mere 
guesswork, then formally verify that our guess or conclusion is correct, like this: 
xVv1l—2? . sin‘ x 


Let D be an interval in [-1,1] and the function f : D > R be defined by f (x) = ; 5 


We can verify that f is differentiable and f’: D > R is defined by f’ (x) = V1-.?. And hence, by Fact 
215, our conclusion holds. 
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il 
Example 1399. Find i ——— dz for x € (-5.5)3 using the substitution u = 
k 1+cos? x 22 
an x. 


Five-Step Substitution Rule Recipe: 


du 
1. Compute — + sec? x. 
dx 


sec? & 


——— dj, 
sec? 7 +1 


1 du 
sec? x +1dz 

il du 1 
oe al ee ie eee 
eam irra - 


(tan? x +1=sec? z)) 


(Proposition 16(a)) 


Conclude: 








°°7Not something you need to worry about for H2 Maths, but just so you know, the condition that 


LE (-5. *} is important and should not be omitted. (Observe that for example tan would be 
undefined.) 














1 
It is possible to find i ae dx for the more general case where x € R—but this requires more 
+ COS? £ 
work and produces a different answer. See &8, Jeffrey (1994), and Jeffrey & Rich (1994). 
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Exercise 441. Consider dx for x € (1, 00). (Answer on p. 1939.) 


il 
een? -1 


4 
(a) By using the substitution x = secu or u=sec™! x for u«¢ (0. *), show that 


dz = sin (sec! z) + Ci, for x € (1, 0). 


ili 
eV? -1 


(| 


5 for we (0,1), show that 
y 


(b) By using the substitution u 2 


1 g- = 1 


Ss = 
een? -—1 

3 Ve = 1 
L 


Cy for 2 € (lyce), 


(c) Show that sin (sec! x) ion 2 €( loo), (Hint: Plic in x= 2.) 


Exercise 442. Consider i _*_ dex for «eR. (Answer on p. 153.21.) 


(a) By using the substitution u = 2? + 1, show that 


g) 
f ate = et for xe R. 
( 


x2 +1)? x +1 


T 
(b) By using the substitution «x > tanu for ue (-5, ) show that 


5 1 =I 
ese ——— + cos (tan x) + Co, for ce R. 


a2 +1)9? > os (tan! x) 


(c) Prove that cos(tan7! x) = , for all xe R. (Hint 1: Plug in x = 0.) 


1 
Vane + 
(Hint 2: If a,b #0, then 








°°SHere’s a proof of this assertion: 


1 1 xa 
Pade a “ Meda — ba? -(1+b?)a+b=0 
a 


1+BV/(1+0?)?-4b? 14+b?4\f/(b?-1)” 14b24]b2-1] 1402 4(02-1) aia 
7 2b 7 2b 7 2b 7 2b 7 


Db 





<—_—_~—s aa 
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108. Term-by-Term Antidifferentiation and Integration of a 
Power Series 


By the Sum Rule for Antidifferentiation, we can antidifferentiate a polynomial function 
term-by-term to get an antiderivative.°”? 


Example 1400. Define the function g: R — R by 
g(x) =x +ar4+1. 


By the Sum Rule for Antidifferentiation, we can simply antidifferentiate the expression 
a? +x+1 “term-by-term” to find that the antiderivatives of g are exactly those functions 


G:R-— R defined by 


G (2) = f a+a4lde 


- fades fades f ide 


1 1 
= 50+ 5a +240, for CeR. 


And as usual, by the FTC2, we can pick any of the antiderivatives G that we just found!” 
to compute definite integrals. For example, the definite integral of g from 0 to 1 is 


=== () ==. 


| ial i 
6 6 








°This chapter is the integration analogue of Ch. 101.5 (Term-by-Term Differentiation of a Power Series). 
510 As usual, the most convenient choice will be C = 0. 
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Example 1401. Define the function h: R > R by 
Miele ie lig ee oe, 


By the Sum Rule, we can simply antidifferentiate the expression 17x? + 147+ + 11x? +827 + 
5a +2 “term-by-term” to find that the antiderivatives of h are exactly those functions 
H :R-R defined by 


H(z) = i 1705 + 1404 +1123 + 802 + 5a +2de 


gh ss do+ fic? dx+ f 82” d+ f Srdx+ f 2dx 








pt oa + 20? Ee for CER. 


And as usual, by the FTC2, we can pick any of the antiderivatives H that we just found 
to compute definite integrals. For example, the definite integral of h from 0 to 1 is 


1 17 14 i 8 5 : _ 311 
fo r= [Zoos Se 72 vate Sa? 420] = 














It turns out that happily, we can also antidifferentiate “infinite polynomial functions”—or 
power series—term-by-term: 
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Example 1402. Define the function f : (-1,1) > R by 


f (x)= 


. A 


Proceeding as usual, 





[ i : dx =-In|jl-a2|+C=-In(1-2)+C2" 
—2£ 
Hence, the antiderivatives of f are the functions F’: (-1,1) > R defined by 
F(z) =-In(1-2)+C, for CeR. 


We now introduce another method for finding the antiderivatives of f. 


We know from Ch. 101.4 that f can be represented by the power series 1+x+27+2°+.... 


That is, 


1 
=l+a+2’ 





f(x) = oa stor all 2e(—Iyly= Domain, 


l=] 


It turns out that thanks to Corollary 48 (below), happily, we can antidifferentiate this 
power series term-by-term: 


fitceateads... de 


- fide+ fede+ fades f adr... 
1 = 


= 0+ <0? + Pp wee 
2 3 4 





and conclude that f’s antiderivatives are the functions F’: (-1,1) > R defined by 





F(a) 29+ 502+ 508+ Ite 46, for all Ce R. 


And as usual, by the FTC2, we can pick any of the antiderivatives F' that we just found 
to compute definite integrals. For example, the definite integral of f from 0 to 1/2 is 


1/2 il il 1 1/2 
i f=|o+ ae es ~ 0.693 - 0 = 0.693. 
0 2 3 4 0 





1 il 1 
By the way, as a bonus, together, = and 2 show that — In (1-2) and x+ 5 tat tel. a 


differ by at most a constant. That is, for all x € R, there exists some C eR such that 
yD hie ds A 


1 
“In(l-2)=a+ 52 tat tae tet, 


In fact, C =0 and the two expressions are equal for all x € R: 
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More examples: 


Example 1403. XXX 


Example 1404. XXX 
For two more examples, see Exercises 696(iii) (N2014/I/8) 712(ii)(a) (N2011/1/4). 


Exercise 443. XXX (Answer on p. 1119.) 
A443. 





°11TIn the last step, we remove the absolute value operator because 1- x > 0 for x € (-1,1). 
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108.1. Formal Results (optional) 


Below are the two formal results justifying the term-by-term integration and antidifferen- 
tiation that we did in the above examples and exercises. (Note that above, we did term- 
by-term antidifferentiation first, then used FTC2 to justify also term-by-term integration. 
The two formal results below reverse this order.) 


Theorem 47. (Term-by-Term Integration) Let R>0, a,b¢€(-R,R) with a<b, and 
Co, C1, €2,°*: ER. Suppose the function f :(-R, R) > R is defined by 


PQ=qget hor tar = =) Cea 
n=0 


Then 


bnr+ 1 —~ qr 1 








fl Fo) de =e" +64 “= Doe 


ntl 








Proof. Omitted.°!? 











Corollary 48. (Term-by-Term Antidifferentiation) Let R > 0, a,b ¢ (-R,R) with 
a<b, and c,C1,C2,°::€ R. Suppose the function f :(—R,R) > R is defined by 


f (2) = cg + C14 + co? + c92°- = Cae 
n=0 


If g: [a,b] > R is defined by g(x) = f(x), then an antiderivative of g is the function 
G:[a,b] > R defined by 


ay 2 joe ae ntl _ gntl 
CAG) = Gy + C1 ——— + ve .——— 
il 2 3 = n+l 











Proof. Apply FTC1 to Theorem 47. 














°12See e.g. Tao (Analysis IT, 2016, pp. 76-77, Theorem 4.1.6(e)) 
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109. More Definite Integrals 


Your H2 Maths syllabus explicitly includes: 
e “finding the area of a region” 


— “bounded by a curve and lines parallel to the coordinate axes”, 
— “between a curve and a line”, or 
— “between two curves”; 

e “area below the x-axis”; 

¢ “finding the area under a curve defined parametrically”; 

e “finding the volume of revolution about the x- or y-axis”; and 


e “finding the approximate value of a definite integral using a graphing calculator”. 


So, these fascinating topics are what we’ll cover in this chapter. 
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109.1. Area between a Curve and Lines Parallel to Axes 


Example 1405. Find the area bounded by the curve y = x? and the lines y = 1 and y = 2. 


It’s often helpful (and worth the time spent) to first make a quick sketch. (You can use 
your calculator, though this shouldn’t be necessary here.) 


Figure to be 
inserted here. 


Here are two methods for finding the requested area A. 
Method 1. The area of the rectangle A+ B+C + D is 


2x 2/2 = 4/2. 


The area of C is 2. 
The area of each of B and D is 


-1 Sy 
[ a de = =| = 
—/2 3 I-V/2 


Hence, the area of A is 


iv- (2429x2522) 3 (22-1). 


Method 2. The right branch of the parabola y = 2” has equation x = \/¥. 
Hence, the area of the right half of A is 


gee as eae ae 7 
J vdy= f Vu dy = = [y ],=5(2v2 NE 


=1 


Hence, the area of A is 
2 4 
Happily, our two answers coincide (we’d be worried if they didn’t). 


Exercise 444. Find the exact area bounded by the curve y = x°, the lines y = 1 and 
y = 2, and the y-axis. (Answer on p. 1943.) 
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109.2. Area between a Curve and a Line 


Example 1406. Find the area bounded by the curve y + x? and the line y 2 Ar +5. 


Find the points at which the line and curve intersect by combining + and 2: 


Figure to be 
inserted here. 


To find the points at which the line and curve intersect, combine and 2: 


Av +5 =x? =a" +47+5=0 —- 


Een 4(-1)5 4% V36 | 


= 
aC hae 
Hence, the requested area is 


5 3 125 
[ Se |-5 +20 +5 2 |--" +504 25 |- 
=ll 2A 


Exercise 445. Consider the curve y = sinx for x € [7,27]. Find the exact area bounded 
by this curve and the line y = 1/2. (Answer on p. 1943.) 
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109.3. Area between Two Curves 


Example 1407. Find the area bounded by the curves y + 7? 87-3 and y 27-7. 


As usual, first make a quick sketch: 


Figure to be 
inserted here. 


Find the points at which the line and curve intersect by combining < and 2: 
© 280-32 fa: — 22” -8r-10=0 — pode) =O 


MeV 4-40) C5) _ de V5 
2 


2(1) 





5, -1. 


Hence, the requested area is 


5 9 ' 250 2 252 
i. 2x7 —- 8x -10dx = [52° - 4x? - 100 — - 100 - 50] -|-3-4+109] = ——_156= 
= 3 7 3 3 3 
=) ® 


Exercise 446. Find the exact area bounded by the curves y = 2-27? and y = x? +1. 
(Answer on p. 1944.) 
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109.4. Area below the z-Axis 


Example 1408. Find the area bounded by the curve y = x? — 4 and the z-axis. 


As usual, first make a quick sketch: 


Figure to be 
inserted here. 


The curve intersects the z-axis at 


5 = —>- v= +2. 


As stated earlier, the definite integral gives us the signed area. So if the curve is under 
the x-axis (as is the case here), then the computed definite integral will be negative: 


2 


ae Eed-G)-B9)- 
Wee Ada = 5 polls 5 hs a. ® 


But of course, an area is simply a magnitude (i.e. we don’t care about the sign). So, our 


answer is that the requested area is Ze 


Exercise 447. Find the exact area bounded by y = x* - 16 and the z-axis. (Answer on 
p. 1944.) 
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109.5. Area under a Curve Defined Parametrically 
Example 1409. The curve C’ has parametric equations 
ee y=4-2°, for te R. 


Find the exact area bounded by C,, the x-axis, and the lines x = -2 and x = -1. 


As usual, first make a quick sketch of C: 


Now, the line x = —2 intersects C at -2 =t?-2 or t =0. 
Similarly, the line x = —-1 intersects C at -1 =t?-2 or t=1. 


So the requested area is 
(Times One Trick) 


(Substitution Rule) 


= i (4 - ) ae dé (Change of limits) 
t=0 


1 
= i) Iie oe alt 
0 


S. 1 
= 18 - =] = 
| 8 Jo 


Exercise 448. The curve C’ has parametric equations 


=) f— — for te R. 


Find the exact area bounded by the curve, the y-axis, and the lines y = 1 and y = 2. 
(Answer on p. 1944.) 





1126, Contents www.EconsPhDTutor.com 


1127, Contents www.EconsPhDTutor.com 


109.6. Volume of Rotation about the z-axis 


Example 1410. Consider the line segment y = 2, x € [0,3]. 


Rotate this line segment about the z-axis to form a 3D cylinder with height 3 and base 
radius 2. 


Q. Find the cylinder’s volume V. 


Figure to be 


inserted here. 





Method 1 (Primary-school formula). The cylinder’s base area is 7-2? = 47. So, by 
a primary-school formula, 


V = Base x Height = 471 x 3 = 127. ¥ 


It turns out that we can also obtain this volume using integration: 


Method 2 (Integration). In Ch. 104.4, we dealt with the 2D case. We wanted to find 
the area under a curve. To do so, we used the intuitive idea of slicing the area under the 
curve into infinitely many infinitely thin rectangles—then adding up the areas of these 
rectangles. 


Here in the 3D case, we want to find the volume of a cylinder. To do so, we'll use the 
similar idea of slicing the cylinder into infinitely many infinitely thin slices—then adding 
up the volumes of these slices: 


Figure to be 


inserted here. 





At each x, the corresponding infinitely thin slice has area 47t. 


Let f (a) be the volume of the cylinder from 0 to x. We now use the same intuition that 
led to the two FTCs: 


FTC1: The derivative of f must be one of these thin slices. That is, 


df _ 
dx 


FTC2: We can use an antiderivative of f to find V: 


AT. 


oa 3 
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Example 1411. Consider the line segment y = 37, x € [0,1]. 


Rotate this line segment about the x-axis to form a cone with height 1 and base radius 
3: 


Q. Find the cone’s volume V. 


Figure to be 
inserted here. 


Method 1 (Secondary-school formula). The cone’s base area is 1-3” = 97. By a 
secondary-school formula, 


i 1 
We q base x Height = a x1 Som, So 


It turns out that we can also obtain this volume using integration: 


Method 2 (Integration). Again, we’ll slice the cone into infinitely many infinitely thin 
slices, then add up the volumes of these slices: 


Figure to be 
inserted here. 


At each 2, the corresponding infinitely thin slice has area ty? = 7 (32) Son. 
Let f (x) be the volume of the cone from 0 to a. 
FTC1: The derivative of f must be one of these thin slices. That is, 


— = nx? 
Ag TX 


FTC2: We can use an antiderivative of f to find V: 


ldf 1 ! a 
Ve ax= [ Onx da = snr" | = ait, J 
Ue: 0 : 
Happily, using integration, we’ve arrived at the same volume as when we used our 
secondary-school formula. 
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Example 1412. Consider the curve y = x”. Rotate it about the z-axis to form a solid. 
Let V be the volume of the portion of the solid that is between the vertical lines x = 1 
eid = 2, 


Q. Find V. 


Figure to be 
inserted here. 


This time, we do not know of any formulae from primary or secondary school that will 
help us find V. Instead, we can only use integration: 


Again, we’ll slice the solid into infinitely many infinitely thin slices, then add up the 
volumes of these slices: 


Figure to be 
inserted here. 


2 
At each x, the corresponding infinitely thin slice has area 7ty” = 7 (2) = mx". 


Let f (7) be the volume of the cylinder from 0 to x. 
FTC1: The derivative of f must be one of these thin slices. That is, 


df _ 


Tx. 
dx 


FTC2: We can use an antiderivative of f to find V: 


y D 2 
va) a= f rat da =| Z| 
1 dx 1 5 
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Here’s the general result or formula whose intuition was illustrated in the above examples. 
This isn’t on List MF26, so yea you'll have to mug it. 


Proposition 18. Given a curve y = f (x), its volume of rotation about the x-axis from a 
to b is 


[mec or fou (x)]}° de. 








Proof. Omitted.>!8 











We can now easily find the general formulae for volumes of spheres and cones: 





513We cat prove this only after we’ve formally and precisely defined such terms as volume and volume 
of rotation. And given that this subchapter is the only place in H2 Maths Calculus where volume is 
discussed, I decided not to bother adding another 10 pages to the Appendices for that task (and that 
no one will read anyway). 
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Example 1413. Find the volume of a sphere with radius r. 


Figure to be 
inserted here. 


To do so, consider the curve x? + y* = r?, for x € [-r,r] and y > 0. Rotating this curve 
about the x-axis produces our sphere: 


Figure to be 
inserted here. 


Hence, by Proposition 18, the volume of the sphere is 


eae ae ea), rae 
[or de-n f 1-2 de =| 1x - | =n |( - 3 
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Example 1414. Find the volume of a cone with base radius r and height h. 


Figure to be 
inserted here. 


To do so, consider the line segment y = 5h for x € [0,h]. Rotating this curve about the 


x-axis produces our cone: 


Figure to be 
inserted here. 


Hence, by Proposition 18, the volume of the cone is 





More examples: 


Example 1415. XXX 


Exercise 449. Consider the curve y = sinz. Find its volume of rotation about the x-axis 
from 0 to 7. (Answer on p. 1945.) 
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109.7. Volume of Rotation about the y-axis 


In the previous subchapter, we learnt to find the volume of rotation about the x-axis. 


Of course, finding the volume of rotation about the y-axis is exactly analogous: 


Definition 228 (informal). Given a curve, rotate it about the y-axis to form a solid. 
We call the volume of the solid between y = a and y = b the curve’s volume of rotation 
about the y-axis from a to b. 


Figure to be 
inserted here. 


Proposition 19. Given a curve x = f (y), its volume of rotation about the y-axis from a 
to b is 


[i retay for (y)}° dy. 

















Proof. Omitted. 


Example 1418. Consider the curve y = x”. Rotate it about the y-axis to form a solid. 
Let V be the volume of the portion of the solid that is between the horizontal lines y = 1 
and y=. 


Figure to be 
inserted here. 





Example 1419. XXX 
Example 1420. XXX 


Example 1421. XXX 
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109.8. Finding Definite Integrals Using Your TI84 


Example 1422. Let A be the area bounded by the curve y = e*, the x-axis, and the 
vertical lines x = 1 and x = 2. Find A using your T184. 


Step 1. Step 2. Step 3. Step 4. 


Step 5. Step 6. Step 7. 


Step 9. Step 10. 


. Press to turn on your calculator. 

. Press S&S. 

. Press blue button and then e* (which corresponds to the button). Then 
press and altogether you will have entered e*™”. 

4. Now press (G80) and the calculator will (very slowly) graph the given equation. 

. Press the blue button and then CALC (which corresponds to the @QN@3 
button), to bring up the CALCULATE menu. 

. Press (7) to select the “ i f (x) dz” option. This brings you back to the graph. 


. The TI84 is now prompting you for “Lower Limit?” Simply press (1). 

. Now press and you will have told the TI84 that your lower limit is x = 1. 

. The TI84 is now similarly prompting you for “Upper Limit?” Simply press (@). 

. Now press and you will have told the TI84 that your upper limit is x = 2. 
The TI84 also informs you that “ i) f (x) dx = 2.6046615”. This is our desired area A 
(which our TI84 also crudely shades in black). 
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110. Introduction to Differential Equations 


We won't actually be doing anything new in this subchapter. Instead, all we’ll do is 
introduce the four terms differential equation, general solution, initial condition, 
and particular solution. 


For convenience, we reproduce from Ch. 103.4 this result: 


Fact 215. Let f(x) and g(x) be expressions containing the variable x. If ~4 (Qs 
fe 
Cae doe i f (x) dv =g(a)+C. 


Example 1423. Here’s a differential equation: 


dy 1 


= 7. 
dx 


Q. Solve <. (That is, express y in terms of x.) 


3 
By Fact 215, y= i x’ dx = = +C’. We call 


ls 
SC, 
ay 


the general solution to the differential equation +. It is general because the constant 
of integration C is free to vary (it can be any real number), so that there are infinitely 
many possible solutions for y. 


Suppose we are now also told that if x =0, then y= 1. That is, 
3 
(x,y) = (0,1). 


This additional piece of information 3 is often called an initial condition. Here’s why: 


Say y is the mass (g) of bacteria in a Petri dish and x is time (s). Then the initial 
condition 2 says that at time x = 0 (i.e. “initially”), the mass of bacteria y = 1. And over 


time, the mass of bacteria changes according to the differential equation z. 


03 
Plugging the initial condition 3 into the general solution 2 we get 1 = a +C'=C. Hence, 


eet 
2 


d 
We call + the particular solution to “the differential equation = + ? with initial 
x 


condition (x, y) 3 (0,1)”. (It is particular in that it corresponds to a specific, given initial 
condition. ) 
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Example 1424. Solve this differential equation: 


The general solution to 2 ig 


y= J cosade = sina +C, 


We are now given this initial condition: 
(x,y) = (0,2). 

Plugging 3 into 2 we get 2=sin0+C =C. Hence, the particular solution to 2 ig 
y * sing + 2. 


Example 1425. Solve this differential equation: 


dy 1 2 : 
== Se SS Ile. 


dx 


The general solution to + ig 
2 2 F ae 
y= fe —sinxdz = gz tcosz+l. 


We are now given this initial condition: 


(x,y) = (0,3). 


03 
Plugging 3 into 2 we get 3 = +cos0+C =1+C or C = 2. Hence, this particular 


. a 
solution to = is 


Tb 
US eee 





In the above examples, our initial condition has 7 = 0. But this need not generally be so: 
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Example 1426. Solve this differential equation: 


+exp Z, for @ = 0), 


The general solution to + is 
2 1 
v= [ —+exprdzr =In|r|+ expr+C 
a 
We are now given this initial condition: 
3 
(x,y) = (1,2). 


Plugging = into 2, we get 2=In [lJ +expl1+C=e+C or C=2-e. Hence, the particular 
solution to = is 


Fe 
y wy CU 


d 
Exercise 451. Solve — =expxsinx. Given also the initial condition (x,y) = (0,1), find 
Di 





the particular solution. (Answer on p. 1947.) 
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110.2. = f(y) 
da 


Again, this subchapter will involve nothing new. We’ll just put together stuff we’ve already 
learnt: 


Example 1427. Solve this differential equation: 


oe 
ap 


By the Inverse Function Theorem (Ch. 91.2), 


The general solution to 2 ig 


2 


1 1 1 
x2 f aay +0 or y= a (for 2 #0). 


We are now given this initial condition: 


3 
(x,y) = (0,1). 
Plugging 3 into Z| we get C= 1. Hence, the particular solution to = is 


2 1 


=e (for sl), 


Y 
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Example 1428. Solve this differential equation: 
for y € (0,71). 


By the Inverse Function Theorem (Ch. 91.2), 


a sin y, for y € (0,70). 
dy 


The general solution to + ig 


ra J sinydy = cosy +C 


We are now given this initial condition: 


Plugging 3 into Z we get C =0. Hence, the particular solution to 2 ig 
© =-cosy or y = cos (-2). 


Example 1429. Solve this differential equation: 


By the Inverse Function Theorem (Ch. 91.2), 
dx 1 1 


dy ity? 
By Proposition 16(a), the general solution to = is 
st fia im 5 dy = tan ty+C 


We are now given this initial condition: 


(x,y) = (0,1). 


Plugging 3 into 2 we get C= -71/4. Hence, the particular solution to + ig 


2 A 2 =| 
=t =] =t toe A: 
= lane ee Zi y= tan (z m 
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Example 1430. Solve this differential equation: 


2 = siny, for y € (0,71). 


By the Inverse Function Theorem (Ch. 91.2), 


dx 1 1 
—= = cosecy, for y € (0,71). 
dy siny 2 ry) 
By Proposition 16(g), 
i= i cosecy dy = — In |cosecy + cot y| + C. 


We are now given this Hint: cosecy + cot y = cot Z (Can you show this?)°“ 


Hence, x = —In|cosecy + cot y|+C =-In cot 5 C2-In (cot x +C. 


(We can remove the absolute value operator because cot 5 > 0 for y € (0,72).) 


Now rearrange 2 to also get 
exp (C'-2) = cot 5 or exp («- C) = tan 5 y 2 2tan! (exp(z-C)). 
Altogether, the general solution to 2 is 
y = 2tan”! (exp(z-C)). 


Suppose we are now given this initial condition: 


1 
Plug 3 into 2 to get 1 =-In (cot =| +C =-Inl1+C=C. Hence, the particular solution 


oe 
to = 1s 


2 2-In(cot¥) +1 y = 2tan! (exp (zx -1)). 








1 cos 1+cos 14+2cos? ¥- cos 2 
M4 cosecy + cot y = + be ae 2 = 2 = cot . 


siny siny siny 2sin 4 cos 4 sin 3 


5 
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dey 


In the last two subchapters, we’ve been looking at first-order differential equations. 
We now look at second-order differential equations: 


Example 1431. Solve this differential equation: 


3 
By Fact oie = f @az- ee oy 
dx 3 


xe i - 
Again by Fact 215, y = f Z+Cde- pe: 


Hence, the general solution to 2 ig 
4 
20x = 
=—+Cr+C. 
77 


Observe that with a second-order differential equation, our general solution will (usually) 
contain two constants of integration. And so, to get a particular solution, we need to be 
given two initial conditions. 


Suppose we’re now given these two initial conditions: 
3 4 
(x,y)=(0,1) and = (a, y) = (1,0). 


Plugging 3 into 2 we get 1=0+0+CorC 21. 
13 


il 
Plugging * and 2 into 2 we get 0= 19 +C+lorCe= ie 


Hence, the particular solution to 4 is 
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Example 1432. Solve this differential equation: 


Ga ie 
—“ =sin2. 
dx? 


The general solution to 2 ig 


y2 [ ( f sneae] dx = { (-cosx+C) dg=—=sina Cr eC. 


We’re now given these two initial conditions: 
(v,y)=(0,1) and (x,y) = (7,0). 


Plugging 3 into 2 we get 1=0+0+C or C21. 


1 
Plugging * and 2 into 2 we get 0=0+Cr+1 or C= a 


Hence, the particular solution to + ig 


6 P 40 
scale 
7 


d2 
Exercise 453. Solve = = expzsinz. Given also the initial conditions (x,y) = (0,1) 


a 
and (x,y) = (1,0), find the particular solution.x (Answer on p. 1947.) 
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110.4. Word Problems 


The H2 Maths syllabus includes: 


« “formulating a differential equation from a problem situation”; and 


- “interpreting a differential equation and its solution in terms of a problem situation”.°° 


So these are what we'll cover in this subchapter. 
Example 1433. The mass of bacteria on a plate grows at a rate that is inversely pro- 


portional to the present mass of bacteria. Express the mass of bacteria as a function of 
time. 


Let x be the mass of bacteria. Let t be time. We are given that the growth rate of x is 
inversely proportional to x. That is, 


Sen for some k € R (and zx > 0). 
dite <a 


Or, 


Hence, 


Further rearranging, we have 7 = +,\/2k(t-C). Since the mass of bacteria can’t be 
negative, we can reject the negative root and conclude that 


e=f2k(t—C). 


We are now further given the initial conditions (t,x) = (0,1) and (t,z) = (1,2). Plugging 
these into the above equation, we get 


1=\72k (0) and 2=\/ 2k (1—C). 
polving, C =—1/3 and k =3/2. Thus, 


TH=A/ ore lk 








°15T understand “problem situation” to mean more simply “word problem”. 
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Exercise 454. Suppose a small object is initially on the surface of the Earth. It is then 
shot upwards with initial velocity V. The minimum initial velocity such that the object 
travels upwards forever is called that object’s escape velocity from Earth (so called 
because the object escapes Earth’s gravity). 


In this exercise, we will work towards finding the escape velocity, V..°'° (Answer on p. 
1948.) 


(a) Newton’s Law of Gravitation states that the force of attraction F (unit: N or kg ms”) 
between two point masses / and m (kg) is proportional to the product of their masses 
and inversely proportional to the square of the distance r (m) between them. 


Write down this law as an equation. Your equation should contain a constant denoted 
G (this is the gravitational constant). 


(b) Momentum is defined as the product of mass m and velocity v. Newton’s Second 
Law of Motion states that force F is the rate of change of momentum. 
(i) Write down this law as an equation. 
(ii) Show that if mass m is constant, then F = mo. 
Let m be the mass of our small object and M be the mass of the Earth. Assume 


e The Earth is a fixed perfect sphere with radius Rm and its mass is concentrated at a 
single point at its centre. 


e Our small object is initially on the surface of the earth. At time t = Os, it is shot 
upwards with initial velocity V > 0. 


¢ Once the ball is shot upwards, the sole force acting on it is gravity. (In particular, 
there is no air resistance or any other form of friction.) 


Figure to be 


inserted here. 





Let r be the distance (m) the ball is above the centre of the Earth and h be the height 
(m) is above the surface of the Earth. (Hence, r = R +A.) 


dv GM 
Expl h —S, 
(c) Explain why — ay ce 
(d) From (c), we have i ar = {= dr. By the Substitution Rule, i wd 
. 
dr 
ae 
Ta 


(i) Show that f para gh +C. 


(ii) Find f= oF ar. 
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110.5. Yet Another Way to Define Sine and Cosine (optional) 


XXX 





because they are vectors. 


dv dv dr dt | dv dr Ch x? dv dr 
517% D ” h | = t. 
Hi ._ dt a ail! dp haa ear aw Te 
as r r dv dt rdv dt rdv 
Similarly, f Gev= f Garay” wa te 











dv 
Hence, [ ae ar = a dv. 
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111. Revisiting In, log,, and exp (optional) 


In Ch. 28, we informally defined the natural logarithm function In as giving us a 
particular area under a particular graph: 


Int is defined as 
this shaded area 














Now that we’ve learnt about integration, we can (at long last) write down its formal defi- 
nition: 


Definition 229. The natural logarithm (function) In: R* = R is defined by 


ay 
Ine= fi =a. 
1 ¢ 





By the above definition and the FTC1, we have this result: 
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1 
= 


Fact 221. The derivative of In: R* > R is the function f :R* +R defined by f (x) = 





Proof. By Definition 229, for any x € R’, 


Inx= ff. 














d 
And so, by the FTC1, aa =F 
ie 


Since In is differentiable, by Theorem 29, it is also continuous. Hence the following result 
that was given long ago in Ch. 87.4 and now reproduced: 


Fact 202. The natural logarithm function, ln, is continuous. 


With the above definition, it is not difficult to prove some basic properties of the natural 
logarithm function: 


Fact 222. Suppose x,y>0. Then 


(ey lie 0) 
(b) In(zy) =Inz+Iny. 


(c) He =-Inz. 
aL 


(d) In = lay 





Proof. (a) Simply plug x = 1 into Definition 229, then apply Definition 225(a): 


Ly 
Int= [ Eee ee 
1 & 


(b) We will play a little trick. Differentiate each side with respect to x to get 


d 1 dy 1 Idy 
i} oo a 
dx n (ry) ey (v * ost) 


~~ -apde 
d 1 

—(l ] =—+4+——, 
q, net ny) ae 


By Corollary 47, In(xy) and Inz +Iny differ by at most a constant: 
In(zy) =Inz+Iny+C, for some Ce R. 


Plug in x =1 to get Iny=Inl+ny+C=Iny+C, or C=0. Thus, In(zy) =Inz+Iny. 
(c) See Exercise 455(a). 
(d) follows from (b) and (c). 
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Exercise 455. Prove Fact 222(c) (use the same trick as in the proof of Fact 222(b)). 
(Answer on p. 1950.) 


Fact 223. Suppose x>0 andnéeR. Then Inz”=nInz. 


Proof. See p. 1724 (Appendices). 














We can now prove the following result (given long ago in Ch. 28.2 and now reproduced): 


Theorem 6. e= lim (1 + - | 
nN Co nN 
Proof. By Fact 221, In'1 4 i/l=1, 
Also, by Definition 203 (of the derivative), 


Inz-In1l 1 1\" 
In’ 1 = lim ———— = lim ae lim nin(t+—)? lim in(1 +=] , 
n n 


zl g-1 zl y— 1 143-41 N—+00 





i 1 
where at 3 we let x = 1+-— and hence alsox > 1 — 1+--1 — n- +o. 
n n 
1 n 
Together, + and 2 show that lim In (1 + - | 1 Thus, 
nN-7+00 n 
1 nr 1 nm 1 nm 
e=exp 1 * exp| lim In(1 +=] )e lim Jexp(in(1 + =) ) © lim (1+ =] ; 
N+ 00 n MN £00 Nn N>+00 n 


where 2 uses the continuity of exp and Theorem 25 (continuity allows us to “move” limits), 
while 2 uses Definition 85 (exp is inverse of In). 














1 n 
The above proof has also shown that e= lim (1 + - | . 
n—-—0o n 
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111.1. Revisiting Logarithms 
In Ch. 5.7, Informal Definition 32 of logarithms said that 
=n —= x = logy n. 


Now that we’ve formally defined the natural logarithm In, we can replace Definition 32 
with the following definition: 


Definition 230. Suppose b,n > 0 with b#1. Then the base b logarithm of n is denoted 
log, n and defined by 





In Ch. 5.7, we gave informal proofs of the Laws of Logarithms. With the above Definitions 
and results, we can now prove these rigorously and indeed more easily: 


Proposition 2. (Laws of Logarithms) Let x¢R and a,b,c>0 with b#1. Then 
(a)  log,1=0 
(b) log, b=1 
(c)) “log, 6" = 
(d)  p°&*=4 


(e) clog,a= log, a° 


(f) log, a log, a (Logarithm of Reciprocal) 
a 


(g) log, (ac) = log, a + log, c (Sum of Logarithms) 


(h) log, -= log, a — log, c (Difference of Logarithms) 
€ 
(Change of Base) 


1 
() legac= 5 8a e fora#l 





Proof. Below, = indicates the use of Definition 230 





» nl 
log, 1 = —— =0. 
(a) log, fad 
, nb 
b) log, b= — =1. 
(b) log, lab 
» nd® e«lnb 
(c) log, b” = —— = = x. (The middle step uses Fact 223.) 
Inb Inb 
, Ina 
(d) log, a= nb: Rearranging, (log, a) Inb = Ina. 
n 
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By Fact 223, In b!%? = Ina. 
Apply exp: exp (In pie) = exp (Ina) or b'°8* = a (exp is defined as inverse of In). 
(e)—(j) See Exercise 456. 














Exercise 456. Prove Proposition 2(e)—(j). (Answer on p. 1950.) 
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111.2. Revisiting the Exponential Function 


We reproduce from Ch. 28 our formal definition of the exponential function: 





We now prove certain basic properties about the exponential function: 


Fact 224. Suppose x,ye R. Then 
(a) exp0=1. 

(b) expl=e. 

(c) exp(a+y) =expxexpy. 


(d) exp(-z) = 


expr 
exp x 


(e) exp (ey) = SP. 





Proof. (a) By Fact 222(a), Inl =0. So, exp0= 1. 
(b) This is actually not a result but our definition of Euler’s number (Definition 86). 
(c) Let a=expaz and b=expy. Then by definition, x = Ina and y = Inb. So, 


exp (4 + y) = exp (Ina + Inb) + exp [In (ab)] = ab = (exp) (expy), 


where = uses Fact 222(b). 


(d) exp (-2) expx 2 exp (-2 +2) = exp0 4. Rearranging (note that exp > 0 for all 


x €R), 














(e) is immediate from (c) and (d). 





We can now also easily prove that the derivative of the exponential function is itself: 


Fact 225. as exp = exp. 
dx 


d 
Proof. By the Chain Rule, ae In (exp) + 
ze 





—exp 2. 
exp x dx . 


d 
But observing that In (exp x) = x, we also have ae In (exp x) 21. 
a 














d 
Put + and 2 together and rearrange to get oe = exp 2. 
a 
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Interestingly, the only function that is its own derivative is the exponential function (or 
constant multiples thereof): 


Proposition 20. Let f: D>R be a nice function. Suppose f is its own derivative, 1.e. 
f=f'. Then there exists some ce R such that for all x € D, 


i (@) = cexp 7: 





Proof. Define g: D > R by g(x) = f (x) exp(-z). Then for each x D, 


g' (x) = f' (x) exp (-2) — f (x) exp (-x) = 0. 


Since the derivative of g is the zero function, by Proposition 8, g must be a constant 
function. That is, there exists some c € R such that for all 7 € D, or 














g(2)=c or f (x) exp (-2) =c or ; (@) =Cexpe. 


1155, Contents www.EconsPhDTutor.com 


Part VI. 
Probability and Statistics 
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Two holes are better than one. Any mouse will tell you that. 


— Willy Wonka (1972). 


If you don’t get this elementary, but mildly unnatural, mathematics of elementary 
probability into your repertoire, then you go through a long life like a one-legged 
man in an ass-kicking contest. You’re giving a huge advantage to everybody else. 


— Charlie Munger (1994). 
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112. How to Count: Four Principles 


How many arrangements or permutations are there of the three letters in CAT? For 
example, one possible permutation of CAT is TCA. 


To solve this problem, one possible method is the method of enumeration. That is, 
simply list out (enumerate) all the possible permutations. 


ACT, ATO, CAT, CTA, TAC, TCA. 


We see that there are 6 possible permutations. 


Enumeration works well enough when we have just three letters, as in CAT. Indeed, enu- 
meration is sometimes the quickest method. 


In contrast, the 13 letters in the word UNPREDICTABLY have 6, 227,020,800 possible 
permutations. So enumeration is probably not practical. 


To help us count more efficiently, we’ll learn about four basic principles of counting: 
1. The Addition Principle (AP); 

2. The Multiplication Principle (MP); 

3. The Inclusion-Exclusion Principle (IEP); and 

4. The Complements Principle (CP). 
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112.1. How to Count: The Addition Principle 


The addition principle (AP) is very simple. 


Example 1434. For lunch today, I can either go to the food court or the hawker centre. 
At the food court, I have 2 choices: ramen or briyani. At the hawker centre, I have 3 


choices: bak chor mee, nasi lemak, or kway teow. 


Altogether then, I have 2+3=5 choices of what to eat for lunch today. 





Here’s an informal statement of the AP:°!8 


The Addition Principle (AP). J have to choose a destination, out of two possible 
areas. At area #1, there are p possible destinations to choose from. At area #2, there 


are q possible destinations to choose from. 
The Addition Principle (AP) simply states that I have, in total, p+q different choices. 





(Just so you know, the AP is sometimes also called the Second Principle of Counting 
or the Rule of Sum or the Disjunctive Rule.) 


Of course, the AP generalities to cases where there are more than just 2 “areas”. It may 
seem a little silly, but just to illustrate, let’s use the AP to tackle the CAT problem: 





°!8See section 147.1 (Appendices) (optional) for a more precise statement of the AP. 
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Example 1435. Problem: How many permutations are there of the letters in the word 
CAT? 


We can divide the possibilities into three cases: 


Case #1. First letter is an A. Then the next two letters are either CT or TC—2 possi- 
bilities. 

Case #2. First letter is a C. Then the next two letters are either AT or TA—2 possibil- 
ities. 

Case #3. First letter is a T. Then the next two letters are either AC or CA—2 possibil- 
ities. 

Altogether then, by the AP, there are 2+ 2+2 = 6 possibilities. That is, there are 6 
possible permutations of the letters in CAT. These are illustrated in the tree diagram 
below. 
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The next exercise is very simple and just to illustrate again the AP. 
Exercise 457. Without retracing your steps, how many ways are there to get from the 
Starting Point to the River (see figure below)? (Answer on p. 1951.) 


River 


—_ = 


Starting Point 


Exercise 458. How many permutations are there of the letters in the word DEED? 
Illustrate your answer with a tree diagram similar to that given in the CAT example 
above. (Answer on p. 1951.) 
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112.2. How to Count: The Multiplication Principle 


Example 1436. For lunch today, I can either have prata or horfun. For dinner tonight, 
I can have McDonald’s, KFC, or Pizza Hut. 


Enumeration shows that I have a total of 6 possible choices for my two meals today: 


(Prata, McDonald’s), (Prata, KFC), (Prata, Pizza Hut), 


(Horfun, McDonald’s), (Horfun, KFC), (Horfun, Pizza Hut). 


Alternatively, we can use the Multiplication Principle (MP). I have 2 choices for lunch 
and 3 choices for dinner. Hence, for my two meals today, I have in total 2 x 3 = 6 possible 
choices. 





Here’s an informal statement of the MP:°!% 


The Multiplication Principle (MP). I have to choose two destinations, one from 
each of two possible areas. At area #1, there are p possible destinations to choose from. 
At area #2, there are q possible destinations to choose from. 


The Multiplication Principle (AP) simply states that I have, in total, pxq different choices. 





(The MP is sometimes also called the Fundamental or First Principle of Counting 
or the Rule of Product or the Sequential Rule.) 


Of course, the MP generalities to cases where there are more than just 2 “areas”. Here’s an 
example where we have to make 3 decisions: 





°!9See section 147.1 (Appendices) (optional) for a more precise statement of the MP. 
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Example 1437. For breakfast tomorrow, I can have shark’s fin or bird’s nest (2 choices). 
For lunch, I can have black pepper crab or curry fishhead (2 choices). For dinner, I can 
have an apple, a banana, or a carrot (3 choices). By the MP, for tomorrow’s meals, I have 
a total of 2 x 2x 3 = 12 possible choices. We can enumerate these (I’ll use abbreviations): 


(SF, BPC, A), (SF,BPC,B), (SF, BPC, C), (SF, CF, A), 
(SF, CF, B), (SF,CF,C), (BN, BPC, A), (BN, BPC, B), 


(BN, BPC, C), (BN, CF, A), (BN, CF, B), (BN, CF, C). 


Example 1438. Problem: How many four-letter words can be formed using the letters 
in the 26-letter alphabet? 


Let’s rephrase this problem so that it is clearly in the framework of the MP. We have 4 
blank spaces to be filled: 


These 4 blanks spaces correspond to 4 decisions to be made. Decision #1: What letter 
to put in the first blank space? Decision #2: What letter to put in the second blank 
space? Decision #3: What letter to put in the third blank space? Decision #4: What 
letter to put in the fourth blank space? 


How many choices have we for each decision? 
For Decision #1, we can put A, B, C, ..., or Z. So we have 26 choices for Decision #1. 


For Decision #2, we can again put A, B, C, ..., or Z. So we again have 26 choices for 
Decision #2. 


We likewise have 26 choices for Decision #3 and also 26 choices for Decision #4. 


Altogether then, by the MP, there are 26 x 26 x 26 x 26 = 264 = 456,976 ways to make our 
four decisions. 


Solution: There are 26* = 456,976 possible four-letter words that can be formed using the 
26-letter alphabet. 
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Example 1439. One 18-sided die has the numbers 1 through 18 printed on each of its 
sides. Another six-sided die has the letters A, B, C, D, E, and F printed on each of its 
sides. We roll the two dice. How many distinct possible outcomes are there? 


Again, let’s rephrase this problem in the framework of the MP. Consider 2 blank spaces: 


ile 


These 2 blank spaces correspond to 2 decisions to be made. Decision #1: What number 
to put in the first blank space? Decision #2: What letter to put in the second blank 
space? 


Again we ask: How many choices have we for each decision? 
For Decision #1, we can put 1, 2, 3, ..., or 18. So we have 18 choices for Decision #1. 
For Decision #2, we can put A, B, C, D, E, or F. So we have 6 choices for Decision #2. 


Altogether then, by the MP, there are 18 x 6 = 108 ways to make our two decisions. In 
other words, there are 108 possible outcomes from rolling these two dice. 


(If necessary, it is tedious but not difficult to enumerate them: 1A, 1B, 1C, 1D, 1E, 1F, 
2A, 2B, ..., 17E, 17F, 18A, 18B, 18C, 18D, 18E, and 18F.) 


Exercise 459. A club as a shortlist of 3 men for president, 5 animals for vice-president, 
and 10 women for club mascot. How many possible ways are there to choose the president, 
the vice-president, and the mascot? (Answer on p. 1952.) 


Exercise 460. (Answer on p. 1952.) The highly stimulating game of 4D consists 
of selecting a four-digit number, between 0000 and 9999 (so there are 10,000 possible 
numbers). 


Your mother tells you to go to the nearest gambling den (also known as a Singapore Pools 
outlet) to buy any three numbers, subject to these two conditions: 


e The four digits in each number are distinct. 


e Each four-digit number is distinct. 


How many possible ways are there to fulfil your mother’s request? 
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112.3. How to Count: The Inclusion-Exclusion Principle 
The Inclusion-Exclusion Principle (IEP) is another very simple principle. 


Example 1440. For lunch today, I can either go to the food court or the hawker centre. 
At the food court, I have 4 choices of cuisine: Chinese, Indian, Malay, and Western. At 
the hawker centre, I have 3 choices of cuisine: Chinese, Malay, and Thai. 


There are 2 choices of cuisine that are common to both the food court and the hawker 
centre (Chinese and Malay). 


And so by the Inclusion-Exclusion Principle (IEP), I have in total 4+ 3-2 =5 choices of 
cuisine. The Venn diagram below illustrates. 


Why do we subtract 2? If we simply added the 4 choices available at the food court 
to the 3 available at the hawker centre, then we’d double-count the Chinese and Malay 
cuisines, which are available at both the food court and the hawker centre. And so we 
must subtract the 2 cuisines that are at both locations. 


Food Court (4 choices of cuisine) Hawker Centre 


x 3 choices) 


Chinese 


Malay 


Western 


Total: 5 choices 


Example 1441. Problem: How many integers between 1 and 20 are divisible by 2 or 5? 
There are 10 integers divisible by 2, namely 2, 4, 6, 8, 10, 12, 14, 16, 18, and 20. 

There are 4 integers divisible by 5, namely 5, 10, 15, and 20. 

There are 2 integers divisible by BOTH 2 and 5, namely 10 and 20. 


Hence, by the IEP, there are 10+ 4-2 = 12 integers that are divisible by either 2 or 5. 
(These are namely 2, 4, 5, 6, 8, 10, 12, 14, 15, 16, 18, and 20.) 





Here’s an informal statement of the IEP:>2” 





°20See section 147.1 (Appendices) (optional) for a more precise statement of the IEP. 
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112.4. How to Count: The Complements Principle 


The Complements Principle (CP) is another very simple principle. 


Example 1442. The food court has 4 types of cuisine: Chinese, Malay, Indian, and 
Other. 


I’m at the food court but don’t feel like eating Malay or Chinese. So by the Complements 
Principle (CP), I have 4 - 2 = 2 possible choices of cuisine (Indian and Other). 





Here’s an informal statement of the CP:°?! 


The Complements Principle (CP). There are p possible destinations. I must choose 
one. I rule out q of the possible destinations. 


The Complements Principle says that I am left with p-q possible choices. 


Exercise 462. There are 10 Southeast Asian countries, of which 3 (Brunei, Indonesia, 
and the Philippines) are not on the mainland. How many mainland Southeast Asian 
countries are there that a European tourist can visit? (Answer on p. 1953.) 








°21See section 147.1 (Appendices) (optional) for a more precise statement of the CP. 
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113. How to Count: Permutations 


In this chapter, we’ll use the MP to generate several more methods of counting. 


But first, some notation you should find familiar from secondary school: 


Definition 231. Let n¢ Zj. Then n-factorial, denoted n!, is defined by n! = nx (n- 1) x 
ox itor m2 Wand: (l= te 


Example 1443) 0!=] 1, 1!es 1,2) S222 sla3x2x126, 4) =4%3 «2% We 24. 
Sea x4 x ax 2x 1= 120; 





Exercise 463. Compute 6!, 7!, and 8!. (Answer on p. 1954.) 
We now revisit the CAT problem, using the MP: 


Example 1444. Problem: How many permutations (or arrangements) are there of the 
three letters in the word CAT? 


Let’s rephrase this problem in the framework of the MP. Consider three blank spaces: 


These 3 blank spaces correspond to 3 decisions to be made. Decision #1: What letter to 
put in the first blank space? Decision #2: What letter to put in the second blank space? 
Decision #3: What letter to put in the third blank space? 


Again we ask: How many choices have we for each decision? 
For Decision #1, we can put C, A, or T. So we have 3 choices for Decision #1. 


Having already used up a letter in Decision #1, we are left with two letters. So we have 
2 choices for Decision #2. 


Having already used up a letter in Decision #1 and another in Decision #2, we are left 
with just one letter. So we have only 1 choice for Decision #3. 


Altogether then, by the MP, there are 3 x 2 x 1 = 3! = 6 possible ways of making our 
decisions. This is also the number of ways there are to arrange the three letters in the 
word CAT. 





Let’s now try the UNPREDICTABLY problem. 


1168, Contents www.EconsPhDTutor.com 


Example 1445. Problem: How many ways permutations are there of the 13 letters in 
the word UNPREDICTABLY? 


Again, let’s rephrase this problem in the framework of the MP. Consider 13 blank spaces: 





b2345 6 7 oe 910111213 


These 13 blanks spaces correspond to 13 decisions to be made. Decision #1: What letter 
to put in the first blank space? Decision #2: What letter to put in the second blank 
space? ... Decision #13: What letter to put in the 13th blank space? 


Again we ask: How many choices have we for each decision? 


First an important note: In the word UNPREDICTABLY, no letter is repeated. (Indeed, 
UNPREDICTABLY is the longest “common” English word without any repeated letters. ) 
For Decision #41, we can put U, N, P, R, E, D, I, C, T, A, B, L, or Y. So we have 13 
choices for Decision #1. 


For Decision #2, having already used up a letter in Decision #1, we are left with 12 
letters. So we have 12 choices for Decision #2. 


For Decision #3, having already used up a letter in Decision #1 and another letter in 
Decision #2, we are left with 11 letters. So we have 11 choices for Decision #3. 


For Decision #13, having already used up a letter in Decision #1, another in Decision 
#2, another in Decision #8, ..., and another in Decision #12, we are left with one letter. 
So we have 1 choice for Decision #13. 


Altogether then, by the MP, there are 13 x 12 x---x 2x 1 = 13! = 6, 227,020, 800 possible 
ways of making our decisions. This is also the number of ways there are to arrange the 
13 letters in the word UNPREDICTABLY. 
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The next fact simply summarises what should already be obvious from the above examples: 


Fact 226. There are n! possible permutations of n distinct objects. 


Here is an informal proof of the above fact.°”? 


Consider n empty spaces. We are to fill them with the n distinct objects. 





123 n 


For space ##1, we have n possible choices. For space ##2, we have n- 1 possible choices 
(because one object was already placed in space #1). ... And finally for space #n, we have 
only 1 object left and thus only 1 choice. By the MP then, there are nx (n-1)x---x1l=n! 
possible ways of filling in these n spaces with the n distinct objects. 


Example 1446. The word COWDUNG has seven distinct letters. Hence, there are 


7! = 5040 permutations of the letters in the word COWDUNG. 








522 This is informal because, amongst other omissions, we haven’t yet given a precise definition of the term 
’ ’ 
per mutation. 
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113.1. Permutations with Repeated Elements 


In the previous section, we saw that there are 3! permutations of the three letters in the 
word CAT and 13! permutations of the 13 letters in the word UNPREDICTABLY. We 
made an important note: In each of these words, there was no repeated letter. 


We now consider permutations of a set where some elements are repeated. 


Example 1447. How many permutations are there of the three letters in the word SEE? 


A naive application of the MP would suggest that the answer is 3! = 6. This is wrong. 
Enumeration shows that there are only 3 possible permutations: 


EES, ESE, SEE. 


To see why a naive application of the MP fails, set up the problem in the framework of 
the MP. Consider 3 blank spaces: 


These 3 blanks spaces correspond to 3 decisions to be made. Decision #1: What letter 
to put in the first blank space? Decision #2: What letter to put in the second blank 
space? Decision #3: What letter to put in the third blank space? 


Again we ask: How many choices have we for each decision? 
For Decision #1, we can put E or 8. So we have 2 choices for Decision #1. 


But now the number of choices available for Decision #2 depends on what we chose 
for Decision #1! (If we chose E in Decision #1, then we again have 2 choices for 
Decision #2. But if instead we chose S in Decision #2, then we now have only 1 choice 
for Decision #2.) This violates the implicit but important assumption in the MP that 
the number of choices available in one decision is independent on the choice made in the 
other decision. Hence, the MP does not directly apply. 


The reason SEE has only 3 possible permutations (instead of 3! = 6) is that it contains a 
repeated element, namely E. But why would this make any difference? 


A 


To understand why, let’s rename the second E as E, so that the word SEE is now trans- 
formed into a new word SEE. From the three letters of this new word, we’d again have 
3! = 6 possible permutations: 


EES, EES, ESE, ESE, SEE, SEE. 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Restricting attention to the two letters EE, we see that there are 2! = 2 ways to permute 
these two letters. Hence, any single permutation (in the case where we do not distin- 
guish between the two E’s) corresponds to 2 possible permutations (in the case where 
we do). The figure below illustrates how the 3 permutations of SEE correspond to the 6 
permutations in SEE. 


2.EES E 6.SEE 


Hence, when we do not distinguish between the two E’s, there are only half as many 
possible permutations. 





We next consider permutations of SASS. 


Example 1448. How many permutations are there of the four letters in the word SASS? 
The answer is 4!/3! = 4. Let’s see why. 


If we distinguish between the three S’s, perhaps by calling them 5, S, and S, then we’d 
have 4! = 24 possible permutations of the letters in the word SASS. 


But amongst the three S’s themselves, we have 3! = 6 possible permutations: Sss, Sss, 


number of possible permutations. Working backwards, the word SASS thus has one-sixth 
as many permutations as SASS. That is, SASS has 4!/3! = 4 possible permutations. 


The figure below illustrates how the 4 possible permutations of SASS correspond to the 


24 possible permutations of SASS. 


148585 7.SASS 13.SSAS 19.SSSA 


2.ASSS<—1.ASSS /48.SASS 14.SSAS<« 3.SSAS /420.SSSA 
3.A8S8*/, [pp 3-805% 15. 8S A5*/, /[7p23 885 
eA888e// (00.8 83S 16.85. /n22. 35.8.6 
5.ASSS*/ 2.SASS—11.SASS17.SSAS*¥/ 4.S5SSA—*23.SSSA 
6.ASSS 12.SASS18.SSAS 24.SSSA 
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Example 1449. How many permutations are there of the four letters in the word DEED? 


4! 


Answer: 


In the numerator, the 4! corresponds to the total of 4 letters. In the denominator, the 2! 
corresponds to the 2 D’s and the 2! corresponds to the 2 E’s. Where do these numbers 
come from? 


Let x be the number of permutations of DEED (i.e. x is our desired answer). 


If we distinguish between the two D’s, then we’d increase by 2!-fold the number of possible 
permutations, to x-2!. If, in addition, we distinguish between the 2 E’s, then we’d increase 
again by 2!-fold the number of possible permutations, to x-2!-2!. But we know that if 
all 4 letters are distinct, then there are 4! possible permutations. Therefore, 


Sig A Ae aa 


Rearrangement yields the answer: 


A! 


a a 


Lv 


You can go back and check that this answer is consistent with our answer for Exercise 
458 (above). 





We next consider permutations of ASSESSES. 
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Example 1450. Problem: How many permutations are there of the eight letters in the 
word ASSESSES? 


Answer: 


8! 


In the numerator, the 8! corresponds to the total of 8 letters. In the denominator, the 2! 
corresponds to the 2 E’s and the 5! corresponds to the 5 S’s. Where do these come from? 


Let y be the number of permutations of ASSESSES (i.e. y is our desired answer). 


If we distinguish between the two E’s, then we’d increase by 2!-fold the number of possible 
permutations, to y-2!. If, in addition, we distinguish between the 5 S’s, then we’d increase 
again by 5!-fold the number of possible permutations, to y-2!-5!. But we know that if 
all 8 letters are distinct, then there are 8! possible permutations. Therefore, 


y-2!-5! = 8! 


Rearrangement yields the answer: 
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In general, 


Fact 227. Consider n objects, only k of which are distinct. Let r,, ro,..., and rz be the 
numbers of times the 1st, 2nd, ..., and kth distinct objects appear. (So r1+rg+-+:+rzp =n.) 


Then the number of possible ways to permute these n objects is 


n! 
Tyltol te 





More examples: 


Example 1451. How many permutations are there of the six letters in the word BA- 
NANA? 


We have three distinct letters—B, A, and N. The letter B appears 1 time. The letter A 
appears 3 times. The letter N appears 2 times. Hence, by the above Fact, the number of 
possible permutations of these 6 letters is 


6! 
—— = 60 
1!3!2! 
Of course, 1! is simply equal to 1. So for the denominator, we shall usually not bother to 


write out any 1!. So we will normally instead write that the number of permutations of 
BANANA is 


Example 1452. How many permutations are there of the 11 letters in the word MISSIS- 
SIPPI? 


We have four distinct letters—M, I, 8, and P. The letter M appears 1 time. The letter I 
appears 4 times. The letter S appears 4 times. The letter P appears 2 times. Hence, by 
the above Fact, the number of possible permutations of these 11 letters is 


11! 
qiatal = 34, 650. 


Exercise 464. There are 3 identical white tiles and 4 identical black tiles. How many 
ways are there of arranging these 7 tiles in a row? (Answer on p. 1954.) 
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113.2. Circular Permutations 


Informal Definition. Two circular permutations are equivalent if one can be trans- 
formed into another by means of a rotation. 


Example 1453. There are 3! = 6 (linear) permutations of CAT. That is, there are 3! = 6 
possible ways to fill them into these 3 linearly arranged spaces: 


In contrast, there are only 2! = 2 circular permutations of CAT. That is, there are only 
2! = 2 possible ways to fill them into these 3 circularly arranged spaces: 


Let’s see why there are only 2 circular permutations of CAT. 





(Example continues on the next page ...) 
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... Example continued from the previous page.) 


The three seemingly different arrangements above are considered to be the same circular 
permutation. This is because any arrangement is simply a rotation of another. Take the 
left red arrangement, rotate it clockwise by one-third of a circle to get the middle green 
arrangement. Repeat the rotation to get the right blue arrangement. 


The second and only other circular arrangement of CAT is shown below. Again, these 
three seemingly different arrangements are considered to be the same circular permuta- 
tion. This is because any arrangement is simply a rotation of another. Take the left 
black arrangement, rotate it clockwise by one-third of a circle to get the 

. Repeat the rotation to get the right orange arrangement. 


Note importantly, that the arrangement (or three arrangements) below cannot be rotated 
to get the arrangement (or three arrangements) above. Hence, the arrangement below is 
indeed distinct from the arrangement above. 


It turns out that in general, if we have n distinct objects, there are (n-1)! ways to 
arrange them in a circle. So here there are only (3 — 1)! = 2! = 2 ways to arrange CAT in 
a circle. 





In general: 


Fact 228. n distinct objects have (n-1)! circular permutations. 


Proof. Given n distinct objects, any 1 circular permutation can be rotated n times to obtain 
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n distinct (linear) permutations. Hence, there are n times as many (linear) permutations 
as there are circular permutations. 


But we already know that there are n! (linear) permutations of n distinct objects. Hence, 
there are n!/n = (n-1)! circular permutations of n distinct objects. 














Exercise 465. How many ways are there to seat 10 people in a circle? (Answer on p. 
1954.) 


Note that if there are repeated objects, then the problem is considerably more difficult. See 
Ch. 147.2 (Appendices) for a brief discussion. 
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113.3. Partial Permutations 


Example 1454. Using the 26-letter alphabet, how many 3-letter words can we form that 
have no repeated letters? This, of course, is simply the problem of filling in these 3 empty 
spaces using 26 distinct elements. For space #£1, we have 26 possible choices. For space 
#2, we have 25. And for space #2, we have 24. 


By the MP then, the number of ways to fill the three spaces is 26 x 25 x 24. This is also 
the number of three-letter words with no repeated letters. 





Problems like the above example crop up often enough to motivate a new piece of notation: 


Definition 232. Let n,k be positive integers with n >k. Then P(n,k), read aloud as n 
permute k, is defined by 


| 


P(n,k) = rot 





P(n,k) answers the following question: “Given n distinct objects and k spaces (where 
k <n), how many ways are there to fill the k spaces?” 


Just so you know, P(n,k) is also variously denoted nPk, Pi’, "Px, etc., but we'll stick 
solely with the P(n,k) in this textbook. 


Example 1179 (continued from above). The number of 3-letter words without 
repeated letters is simply P (26,3) = 26!/23! = 26 x 25 x 24. 


Example 1455. Problem: Using the 22-letter Phoenician alphabet, how many 4-letter 
words can we form that have no repeated letters? 


This, of course, is simply the problem of filling in these 4 empty spaces using 22 distinct 
elements. So the answer is P (22,4) = 22!/18! = 22 x 20 x 19 x 18 words. 





Exercise 466. Out of a committee of 11 members, how many ways are there to choose 
a president and a vice-president? (Answer on p. 1954.) 
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113.4. Permutations with Restrictions 


Example 1456. At a dance party, there are 7 heterosexual married couples (and thus 
14 people in total). Problem #1. How many ways are there of arranging them in a 
line, with the restriction that every person is next to his or her partner? 


Think of there as being 7 units (each unit being a couple). There are 7! ways to arrange 
these 7 units in a line. Within each unit, there are 2 possible arrangements. Hence, in 
total, there are 7! x 2° possible arrangements. 


Problem #2. Repeat the above problem, but now for a circle, rather than a line. 


There are 6! ways to arrange the 7 units in a circle. Within each unit, there are 2 possible 
arrangements. Hence, in total, there are 6! x 2’ possible arrangements. 


Problem #3. How many ways are there of arranging them in a circle, with the restric- 
tion that every man is to the right of his wife? 


There are 6! ways to arrange the 7 units in a circle. Within each unit, there is only 1 
possible arrangement. Hence, in total, there are 6! possible arrangements. 


Example 1457. (I assume you’re familiar with the standard 52-card deck.) 
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(Example continues on the next page ... 
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(... Example continued from the previous page.) 


Problem #1. Using a standard 52-card deck, how many ways are there of arranging 
any 3 cards in a line, with the restriction that no two cards of the same suit are neat to 
each other? 


This is the problem of filling in 3 spaces with 52 distinct objects. For space #1, we have 
52 possible choices. 


For space ##2, having picked a card of suit X for space #1, we must pick a card from some 
other suit Y. And so there are only 39 possible choices (we have three suits available— 
that’s 3 x 13 = 39). 


For space #3, having picked a card of suit Y for space #2, we must pick a card from 
some other suit Z. Note that suit Z can be the same as suit X. And so there are 38 
possible choices (we have three suits available, less the card used for space #1—that’s 
3x 13-1 = 38). 

Altogether then, there are 52 x 39 x 38 possible arrangements. 

Problem #2. Repeat the above problem, but now for a circle, rather than a line. 


One subtle thing is that, in addition to space #1 being of a different suit from space #2 
and space #2 being of a different suit from space #3, we must also have that space #3 
is of a different suit from space #1. Thus, there are 52 x 39 x 26 possible ways to fill in 
these three spaces, if they were in a line. 


Since they are instead in a circle, there are 52 x 39 x 26+ 3 possible ways to arrange three 
cards in a circle, with the condition that no two cards of the same suit are next to each 
other. 


Exercise 467. (Answer on p. 1954.) There are 4 brothers and 3 sisters. In how many 
ways can they be arranged ... 
(a) in a line, without any 2 brothers being next to each other? 


b) in a line, without any 2 sisters being next to each other? 


( 
(c) in a circle, without any 2 brothers being next to each other? 
( 


d) in a circle, without any 2 sisters being next to each other? 
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114. How to Count: Combinations 


P(n,k) is the number of ways we can fill & (ordered) spaces using n distinct objects. 


In contrast, C'(n,k) is the number of ways of choosing k out of n distinct objects. Equiva- 
lently, it is the same problem of filling k spaces using n distinct objects, except that now 
order does not matter. 


Example 1458. Suppose we have a committee of 13 members and wish to select a 
president and a vice-president. This is equivalent to the problem of filling in 2 spaces, 
given 13 distinct objects. 


eZ 


The answer is thus simply P (13,2) = 13 x 12. 


Suppose instead that we want to choose two co-presidents. How many ways are there of 
doing so? 


This is simply the same problem as before—again we want to fill in 2 spaces, given 13 
distinct objects. The only difference now is that the order of the 2 chosen objects 
does not matter. So the answer must be that there are P (13,2) /2! ways of choosing 
the two co-presidents. 


Example 1459. How many ways are there of choosing 5 cards out of a standard 52-card 
deck? 


First, how many ways are there to fill 5 spaces using 52 distinct objects (where order 
matters)? Answer: P (52,5) = 52 x 51 x 50 x 49 x 48 = 311, 875, 200. 


And so if we don’t care about order, we must adjust this number by dividing by 5! to get 
P (52,5) /5! = 2,598,960. So the answer is that to choose 5 cards out of a 52-card deck, 
there are 2,598,960 ways. 





The above examples suggest that, in general, to choose k out of n given distinct objects, 
there are P(n,k)/k! possible ways. This motivates the following definition: 
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Definition 233. Let n,k be positive integers with n >k. Then C(n,k), read aloud as n 
choose k, is defined by 


Pink) n! 


CAO eae os eae 





It turns out that C'(n, &) appears so often in maths that it has many alternative notations— 


one of the most common is 


“n choose k” also has several names, such as the combination, the combinatorial 
number, and even the binomial coefficient. Shortly, we'll see why the name binomial 
coefficient makes sense. 


Exercise 468 gives an alternate expression for C(n,k) which you'll often find very useful. 
Exercise 468. (Answer on p. 1956.) Show that 


Rie en See ee) 


Exercise 469. Compute C(4, 2), C(6,4), and C(7,3). (Answer on p. 1956.) 


Exercise 470. We wish to form a basketball team, consisting of 1 centre, 2 forwards, 
and 2 guards. We have available 3 centres, 7 forwards, and 5 guards. How many ways 
are there of forming a team? (Answer on p. 1956.) 





Here’s a nice symmetry property: 


Fact 229. (Symmetry.) C(n,k) =C(n,n-k). 


Proof. Choosing k out of n objects is the same as choosing which n-k out of n objects to 
ignore. 
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Example 1460. We have a group of 100 men. 70 are needed for a task. The number of 
ways to choose these 70 men is 


100! 
C(100, 70) = so 


0!70! 
This is the same as the number of ways to choose the 30 men that will not be used for 


the task: 


100! 


C(100, 30) = an. 
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114.1. Pascal’s Triangle 


Pascal’s Triangle consists of a triangle of numbers. If we adopt the convention that the 
topmost row is row 0 and the leftmost term of each row is the Oth term, then the nth row, 
kth term is the number C(n, k): 


1 
1 1 
1 2 ih 
1 3 3 1 
1 4 6 A 1 
i sy) 10 10 sy) i| 


It turns out that beautifully enough, each term is equal to the sum of the two terms above 
it. The next exercise asks you to verify several instances of this: 


Exercise 471. Verify the following: (a) C(1,0)+C(1, 1) = C(2, 1); (b) C(4, 2)+C(4, 3) = 
C(5,3); (c) C(17, 2) + C(17, 3) = C(18, 3). (Answer on p. 1956.) 


Fact 230. (Pascal’s Rule/Identity/Relation.) C(n+1,k) =C(n,k)+C(n,k-1). 





Proof. C(n+1,k) is the number of ways of choosing & out of n+ 1 distinct objects. 


Suppose we do not choose the last object, ie. the (n+ 1)th object. Then we have to choose 
our k objects out of the first n objects. There are C(n,k) ways of doing so. 


Suppose we do choose the last object. Then we have to choose another k - 1 objects, out 
of the first n objects. There are C(n,k-1) ways of doing so. 


Altogether then, by the Addition Principle, there are C(n,k) +C(n,k-1) ways of choosing 
k; out of n+ 1 distinct objects. 
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114.2. The Combination as Binomial Coefficient 


[L]a mathématique est art de donner le méme nom a des choses différentes. 


[M]athematics is the art of giving the same name to different things. 


— Henri Poincaré (1908, 1914) 


Poincaré’s quote is especially true in combinatorics. In this section, we’ll learn why C'(n, k) 
can be called the combination and also the binomial coefficient. 


Verify for yourself that the following equations are true: 


l+2)=14+2r+ 27, 


( 

( 

( 

(l+2)=14+32+ 327+ 2°, 
(Lea)=1+4¢4 627 +47? 4o*, 
(l+7)=14+52+4 102? +102? + 524+ 2°, 
(1+2)=1+ 624 15x? + 20x + 15x* + 6x° + 2°, 
( 


(4a) 147e 4 2le? £350" 4 357°4 212° 4 Te ee". 


Each of the expressions on the RHS is called a binomial series. Each can also be called 
the binomial expansion of (1+). 


Notice anything interesting? No? Try this exercise: 


moon som) ESCHGHGN GL) = 


pare these to the coefficients of the binomial expansion of (1+). What do you notice? 
(Answer on p. 1957.) 





It turns out that somewhat surprisingly, the coefficients of the binomial expansions of 
n 


n 
1+) are simpl : 
tsz)owemt(*) 


yourself that this is also true for n = 0 through n = 6. 


} ..{" |. As an additional exercise, you should verify for 
n 


There are several ways to explain why the combinatorial numbers also happen to be the 
binomial coefficients. Here we'll give only the combinatorial explanation: 
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Consider (1+). Expanding, we have 
(l+z)=(l+az)(1+az)=1-14+1l-c+az-lt+a-z. 


Consider the 4 terms on the right. 


For 1-1, we “chose” 1 From the two (1+)’s in the 
from the first (1+) and1 — _ product, there is C(2,0) =1 
from the second (1+ 2). way to choose 0 of the x’s. 


For 1-x, we “chose” 1 
from the first (1+ 2) and x 
from the second (1+ 2). From the two (1+ 2)’s in the 
product, there are C(2,1) =2 
For x-1, we “chose” x ways to choose 1 of the x’s. 
from the first (1+ 27) and 1 


from the second (1+). 


Finally, for x-x, we “chose” From the two (1+ )’s in the 
x from the first (1+) and — _ product, there is C(2,2) =1 
x from the second (1+ 2). way to choose 2 of the x’s. 


Altogether then, the coefficient on x° is C(2,0) (“choose 0 of the z’s”), that on x! is C(2, 1) 
(“choose 1 of the x’s”), and that on x? is C(2,1) (“choose 2 of the x’s”). That is, 


(1+2) = C(2,0)x° + C(2,1)2' + C(2,2)2? = 14+ 22427. 


Exercise 473. (Answer on p. 1957.) Mimicking what was just done above, explain why 


(1+ 2) = C(3,0)x° + O(3, Iz! + C(3, 2)x? + C(3, 3)z°. 





More generally, we have 


Fact 231. LetneZ. Then 


(n+y)"=>° 


i=0 
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114.3. The Number of Subsets of a Set is 2” 


By plugging x = 1, y = 1 into the last fact, we see that (1 +1) = 2” is the sum of the terms 
in the nth row of Pascal’s triangle: 


Fact 232. Letne Z*. Then 


(s)he) 


There’s a nice combinatorial interpretation of the above fact (Poincaré’s quote at work 
again). 


Consider the set S = {A,B}. S has 2 = 4 subsets: @ = {}, {A}, {B}, and S = {A,B}. 


Now consider the set T = {A,B,C}. T has 2? = 8 subsets: @ = {}, {A}, {B}, {C}, {A, B}, 
{A,C}, {B,C}, and T = {A, B,C}. 


In general, if a set has n elements, how many subsets does it have? We can couch this in 
the framework of the Multiplication Principle—this is really a sequence of n decisions of 
whether or not to include each element in the subset. There are 2 choices for each decision. 
Thus, there are 2” choices altogether. In other words, using a set of n elements, we can 
form 2" subsets. 





But of course, this must in turn be equal to the sum of the following: 


« C’(n,0) ways to form subsets with 0 elements; 
¢ C(n,1) ways to form subsets with 1 element; 


¢ C'(n,2) ways to form subsets with 2 elements; 


¢ C'(n,n) ways to form subsets with n elements. 


Thus, 


Exercise 474. Verify that 2’ = ( i }-( ‘ }-( : aaa d } (Answer on p. 1957.) 


Exercise 475. Using what you’ve learnt, write down (3+. x)*. (Answer on p. 1958.) 


Exercise 476. (Answer on p. 1958.) (a) The Tan family has 4 sons and the Wong 
family has 3 daughters. Using the sons and daughters from these two families, how many 
ways are there of forming 2 heterosexual couples? 


(b) The Lee family has 6 sons and the Ho family has 9 daughters. Using the sons and 
daughters from these two families, how many ways are there of forming 5 heterosexual 
couples? 
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115. Probability: Introduction 


115.1. Mathematical Modelling 


All models are wrong, but some are useful. 


— George Box (1979). 


Whenever we use maths in a real-world scenario, we have some mathematical model in 
mind. Here’s a very simple example just to illustrate: 


Example 1461. We want to know how much material to purchase, in order to build a 
fence around a field. We might go through these steps: 


. Formulate a mathematical model: Our field is the shape of a rectangle, with length 
100 m and breadth 50 m. 


. Analyse: The rectangle has perimeter 100 + 50 + 100 + 50 = 300 m. 


. Apply the results of our analysis: We need to buy enough material to build a 
300-metre long fence. 





The figure below depicts how mathematical modelling works. 


(1) Formulate. 


>» 







(2) Analyse. 





Starting with some real-world scenario, we go through these steps: 
1. Formulate a mathematical model. 


That is, describe the real-world scenario in mathematical language and concepts. 


This first step is arguably the most important. It is often subjective—not everyone will 
agree that your mathematical model is the most appropriate for the scenario at hand. 


To use the above example, the field may not be a perfect rectangle, so some may object 
to your description of the field as a rectangle. Nonetheless, you may decide that all things 
considered, the rectangle is a good mathematical model. 


2. Analyse the model. 
This involves using maths and the rules of logic. (A-Level maths exams tend to be mostly 


concerned with this second step.) 
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In the above example, this second step simply involved computing the perimeter of the 
rectangle—100 + 50 + 100 + 50 = 300 m. Of course, for the A-Levels, you can expect the 
analysis to be more challenging than this. 


Note that this second step, in contrast to the first, is supposed to be completely watertight, 
non-subjective, and with no room for disagreement. After all, hardly anyone reasonable 
could disagree that a perfect rectangle with length 100 m and breadth 50 m has perimeter 
300 m. 


3. Apply your results. 


Now apply the results of your analysis to the real-world scenario. 


In the above example, pretend you’re a mathematical consultant hired by the fence-builder. 
Then your final report might simply say, “We recommend the purchase of 300 m worth of 
fence material.” 


This third and last step is, like the first, subjective and open to debate. It involves your 
interpretation of what the results of your analysis mean (in the real world) and your rec- 
ommendation of what actions to take. 


For example, you find that the fence will have perimeter 300 m and thus recommend that 
300 m of fence material be purchased. However, someone else, looking at the same result, 
might point out that the corners of the fence require additional or special material; she 
might thus make a slightly different recommendation. 


Secretly, we’ve always been using mathematical modelling; we just haven’t always been 
terribly explicit about it. The foregoing discussion was placed here, because with probability 
and statistical models, we want to be especially clear about that we are doing mathematical 
modelling. 
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115.2. The Experiment as a Model of Scenarios Involving Chance 


Real-world scenarios often involve chance. We can model such scenarios mathemati- 


cally. For this purpose, we’ll use a mathematical object named the experiment, typically 
denoted E.°?8 


An experiment FE = (S,,P) is an ordered triple’** composed of three objects, called the 


sample space S, the event space © (upper-case sigma), and the probability function 
P, where 


e The sample space S is simply the set of possible outcomes. 


e« An event is simply any set of possible outcomes. In turn, the event space © is simply 
the set of all events. 


¢« The probability function P simply assigns to each event some probability between 0 
and 1. This probability is interpreted as the likelihood of that particular event occurring. 


Examples: 





°23 An experiment is often instead called a probability triple or probability space or (probability) 
measure space. 

°24Previously, in the only ordered triples we encountered, the three terms were always simply real numbers. 
Here however, the first two terms are sets and the third is a function. Nonetheless, this is all the same 
an ordered triple, albeit a more complicated one. 
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Example 1462. We model a coin-flip with the experiment EF = (S,%,P). What are the 
sample space S,, the event space /, and the probability function P? 


ce eeiea, 


The sample space is simply the set of possible outcomes. 


The choice of the sample space belongs to Step #1 (Formulate a mathematical model) 
in the process of mathematical modelling. It is subjective and open to disagreement. 


For example, John (another scientist) might argue that the coin sometimes lands exactly 
on its edge. This is exceedingly unlikely but nonetheless possible—one empirical estimate 
is that the US 5-cent coin has probability 1 in 6000 of landing on its edge when flipped 
(source). So John might denote this third possible outcome X and his sample space 
would instead be S = {H,T,X}. 


2. Event space © = {@,{H},{T},{4,T}}. 


An event is simply any subset of S. In other words, an event is simply some set of 
possible outcomes. So here, {1} is an event. So too is {7}. But there are also two other 
events, namely @ = {} (this is the event that never occurs) and S = {H,T} (this is the 
event that always occurs). 


The event space is simply the set of events. In other words, the event space is the set 
of all subsets of S$.°?° 


As we saw in Ch. 114.2, given any finite set S, there are 2!5! possible subsets of S. In 
general, given a finite sample space S, the corresponding event space © always simply con- 
tains 2/5! events. And so here, since there are 2 possible outcomes, there are, altogether, 
2? = 4 possible events. 


If the real-world outcome of the coin flip is Heads, then our interpretation (in terms of 
our model) is that “the events {H} and {H,T} occur”. If the real-world outcome of the 
coin flip is Tails, then our interpretation (in terms of our model) is that “the events {T} 
and {H,T} occur”. 


The event @ never occurs, whatever the real-world outcome is. And the event S = {H,T} 
always occurs, whatever the real-world outcome is. 





(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


The mathematical modeller is free to select the sample space S she deems most ap- 
propriate. However, once she has selected the sample space S, the event space » is 
automatically determined by the rules of maths. There is no room for interpretation. 
Hence, the selection of the event space © belongs to Step #2 (Analysis) in the process of 
mathematical modelling. 


So likewise, John, who chooses S = {H,T,X} as his sample space, 
has no freedom to choose his event space ». It is automatically Uo = 
{@,{H},{T},{X},{H,T}, {H, X}, {T,X}, S} (consists of 8 elements). 


3. Probability function P: -R. 


The probability function simply assigns to each event a number (between 0 and 1) 
called a probability. So here, if heads and tails are “equally likely” (or the coin is 
“unbiased” or “fair”), then it makes sense to assign 


Pil a0 ne (en ye Pi =05 ied 


The mathematical modeller has no freedom over the domain ©) and codomain R of the 
probability function. However, she does have freedom to choose the mapping rule she 
deems most appropriate. Hence, the act of choosing the mapping rule belongs to Step 
#1 (Formulation) in the process of mathematical modelling. 


So here, if told that heads and tails are “equally likely” (or that the coin is “unbiased” 
or “fair”), the mathematical modeller would naturally choose to assign probability 0.5 to 
each of the events {H} and {T}. 


John, who chooses S = {H,T,X} as his sample space, might instead assign probability 
1/6000 to the event {X} and probability 5999/12000 to each of the events {H} and {T}. 
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Example 1463. A real-world die-roll can be modelled by an experiment E = (S,™,P), 
where 


1. S = {1,2,3,4,5, 6}. 


2. Event space: 

Bt ee Ot 2 ay ai aeoy Oran ae eo ey ooh. 

There are 6 possible outcomes and thus 2° = 64 possible events. The event space, given 
above, is simply the set of all possible events. 


If the real-world outcome of the die roll is 3, then the interpretation (in terms of our 
model) is that the following 32 events occur: {3}, {1,3}, {2,3}, ..., {1,2,3}, {1,3,4}, 
..., 5 = {1,2,3,4,5,6}. (These are simply the events that contain the outcome 3.) 


Similarly, if the real-world outcome of the die roll is 5, then the interpretation is that 32 
events occur. You should be able to list all 32 of these events on your own. 


3. Probability function P: > R. 


If the die is “unbiased” or “fair”, then it makes sense to assign 


PO Seah) ) = Pa) Bie) eae) — . 


What about the other 58 events? It makes sense to assign, for example, P ({1,3,5,6}) = =. 


In general, the mapping rule of the probability function can be fully specified as: For any 
event Ae &, 


A A 
Peay fal 


In words, given any event A, its probability P(A) is simply the number of elements it 
contains, divided by 6. 
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Here’s the formal definition of an experiment: 


Definition 234. An experiment is an ordered triple (5, %,P), where 


S', the sample space, is simply any set (interpreted as the set of possible outcomes in 
a real-world scenario involving chance). 


ui, the event space, is the set of possible events. 
P, the probability function, has domain %, codomain R, and must satisfy the three 
Kolmogorov axioms (to be discussed below in Definition 235). 


Given any event A ¢%, the number P(A) is called the probability of A. 





For the probability function P, the mathematical modeller is free to choose the mapping 
rule she deems most appropriate. The only restriction is that P satisfies three axioms, 
called the Kolmogorov Axioms, to be discussed in the next section. 


Exercise 477. (Answers on pp. 1959, 1960, and 1961.) Consider each of the following 
real-world scenarios. 


(a) You pick, at random, a card from a standard 52-card deck. 
(b) You flip two fair coins. 


(c) You roll two fair dice. 


Model each of the above real-world scenarios as an experiment, by following steps (i) - 


(iii): 


(i) Write down the appropriate sample space S. 


(ii) How many possible events are there? Hence, how many elements does the event space 
“contain? If it is not too tedious, write out © in full. 


(iii) What are the domain and codomain of the probability function P? Write down the 
probabilities of any three events. Given any event A € %, what is P(A)? 


(iv) In each scenario, explain briefly how John, another scientist, might justify choosing 
a different sample space, event space, and probability function. 
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115.3. The Kolmogorov Axioms 


An axiom (or postulate) is a statement that is simply accepted as being true, without 
justification or proof. 


Example 1464. Euclid’s parallel axiom says that “Two non-parallel lines in the plane 
eventually intersect”. Historically, this axiom was accepted as a “self-evident truth”, 
without need for justification or proof. 


However, in the 19th century, mathematicians discovered “non-Euclidean geometries”, in 
which the parallel axiom did not hold. These turned out to have significant implications 
for maths, philosophy, and physics. 





The above example illustrates that an axiom is not an eternal and immutable truth. Instead, 
it is merely a statement that some mathematicians tentatively accept as being true. Having 
listed a bunch of axioms, mathematicians then study their implications. 


In probability theory, we impose three axioms on the probability function. These can be 
thought of as restrictions on what the probability function looks like. Informally: 

1. Probabilities can’t be negative. 

2. The probability of an outcome occurring is 1. 


3. The probability that one of two disjoint events occurs is the sum of the their individual 
probabilities. 





Formally: 


Definition 235. We say that a function P satisfies the three Kolmogorov axioms if: 


1. Non-Negativity Axiom. For any event E ¢ S, we have P(E) > 0. 


2. Normalisation Axiom. P(s) = 1. 


3. Additivity Axiom.°?° Given any two disjoint events E),E, ¢ S, we have 
P(E, U FE) = P (£1) +P (£5). 





In case you’ve forgotten, two sets are disjoint if they have no elements in common. 


1196, Contents www.EconsPhDTutor.com 


115.4. Implications of the Kolmogorov Axioms 


Obviously, P(@) = 0 (the probability that the empty event occurs is 0). Previously, you’ve 
probably taken this and other “obvious” properties for granted. Now we’ll prove that they 
follow from the Kolmogorov axioms. 


Recall that given any set A, its complement A° (sometimes also denoted A’) is defined to 
be “everything else”—more precisely, A‘ is the set of all elements that are not in A. 


Proposition 21. Let P be a probability function and A, B be events. Then P satisfies 
the following properties: 


. Complements. P(A) =1-P(A‘). 
. Probability of Empty Event is Zero. P(@) = 0. 


. Monotonicity. If Bc A, then P(B) < P(A). 
. Probabilities Are At Most One. P(A) <1. 
. Inclusion-Exclusion. P(Au B) = P(A) + P(B) - P(An B). 





You may recognise that the Complements and the Inclusion-Exclusion properties are anal- 
ogous to the CP and IEP from counting. 


Proof. 1. Complements. By definition, An A‘ are disjoint. And so by the Additivity 
Axiom, P(A) + P(AS) = P(AU A’). 


Also by definition, AU A®=S. And so P(Au A‘) = P(s). 
By the Normalisation Axiom, P(s) = 1. 


Altogether then, P(A) + P(A‘) = P(AU A‘) = P(s) =1. Rearranging, P(A) =1-P(A’°), as 
desired. 


The remainder of the proof is continued on p. 1730 (Appendices). O 


Venn diagrams are helpful for illustrating probabilities. Those below help to illustrate the 
four of the above five properties. 
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P(A‘) = 1 — P(A). (Complements.) 


©’ 
P(A) <1. 


P(B) = P(A): Ge ‘i 
(Monotonicity.) Os e. 


P(A UB) = P(A) + P(B) — P(ANB). 
(Inclusion-Exclusion.) 


Exercise 478. Prove each of the following properties and illustrate with a Venn diagram: 
(a) “If two events A and B are mutually exclusive, then P(An B) = 0.” (b) “Let A, B, 
and C’ be events. Then P(AU BUC) = P(A) +P(A°N B)+P(A°nN BonC).” (Answer on 
p. 1962.) 
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116. Probability: Conditional Probability 


Example 1465. Flip three fair coins. Model this as an experiment E = (S,%,P), where 





e The sample space is S={HHH,HHT,HTH,HTT,THH,THT,TTH,TTT}. 
¢ The event space } has 2° = 256 elements. 


e The probability function P : i> R has mapping rule: 





P(A AA) =P(B AT) ==]PCTT)= 


1 
7 


A 

and more generally, for any event A¢ X, P(E) = — ; 

Problem: Suppose there is at least 1 tail. Find the probability that there are at least 2 
heads. 


There are 7 possible outcomes where there is at least 1 tail: HHT, ATH, ATT, THA, 
THT, TTH, and TTT. Each is equally likely to occur. Of these, 3 outcomes involve 
at least 2 heads (HHT, HTH, and THH). Thus, given there is at least 1 tail, the 
probability that there are at least 2 heads is simply 3/7. 





The above analysis was somewhat informal. Here is a more formal analysis. 
Let A be the event that there are at least 2 heads: A= {HHT,HTH,THH,HHH}. 


Let B be the event that there is at least 1 tail: B 
(oo A Pe Be Pe eae 


AnB is thus the event that there are at least 2 heads and 1 tail: AnB 
VHA APE Ta 


Our problem is equivalent to finding P(A|B)—the conditional probability of A given 
B, which is given by 











P(AnB) 3/8 3 


pee P(B) 7/8 7 
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Example 1466. Let P be a probability function and A, B € © be events. 


e P(A) =0.5 (the probability that A occurs is 0.5). 
¢ P(B) =0.6 (the probability that B occurs is 0.6). 
¢ P(AnB) =0.2 (the probability that both A and B occur is 0.2). 


Hence, given that B has occurred, the probability that A has also occurred is simply 
0.2/0.6 = 1/3. (The information that P(A) = 0.5 is irrelevant.) Formally: 


P(AnB) | 
P(B) 


P(A|B) = 





The foregoing examples motivate the following definition: 


Definition 236. Let P be a probability function and A,B € % be events. Then the 
conditional probability of A given B is denoted P(A|B) and is defined by 


So i) 


P(B) 
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116.1. The Conditional Probability Fallacy (CPF) 


Definition 237. The conditional probability fallacy (CPF) is the mistaken belief that 


P (A|B) = P (BIA) 





is always true. 
Informally, the CPF is the fallacy of leaping from 
“If A, then probably B” to “Since B, then probably A.” 
But in general, it is not true that P (A|B) = P (BIA). Instead: 


Fact 233. (a) If P(A) < P(B), then P(A|B) < P(BIA). 
(b) If P(A) > P(B), then P(A|B) > P(BIA). 


(c) If P(A) = P(B), then P(A|B) = P(BIA). 





Proof. By definition, P (A|B) = oS and P (B|A) = P Sa 
Thus, P (A|B) = am (BA). And so, 


P(A) < P(B) == P(A|B) < P(BIA), 
P(A) > P(B) = P(A|B) > P(BIA), 
P(A) = P(B) => P(A|B) = P(BIA). 














The CPF is also known as the confusion of the inverse or the inverse fallacy. In 
different contexts, it is also known variously as the base-rate fallacy, false-positive 
fallacy, or prosecutor’s fallacy. 
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Example 1467. Suppose the following statement is true: “If Mary has Ebola, then Mary 
will probably vomit today.” Formally, we might write P (Vomit|Ebola) = 0.99. 


Mary vomits today. One might then reason, “Since P (Vomit|Ebola) = 0.99, by the CPF, 
we also have P (Ebola|Vomit) = 0.99. Thus, Mary probably has Ebola.” 


Formally, this reasoning is flawed because P(Vomit) is probably much larger than 
P(Ebola). Thus, P (Vomit|Ebola) is probably much larger than P (Ebola|Vomit). 


Informally, the reasoning is flawed because: 


e Ebola is extremely rare, so it is extremely unlikely that Mary has Ebola in the first 
place. 


e Besides Ebola, there are many other alternative explanations for why Mary might have 
vomitted. For example, she might have had motion sickness or food poisoning. 


Example 1468. Sally buys a 4D ticket every week. One day, she wins the first prize. 
To her astonishment, she wins the first prize again the following week. 


Her jealous cousin Ah Kow makes a police report, based on the following reasoning: 


“Without cheating, the probability that Sally wins the first prize two weeks in a row is 
1 in 100 million. Given that she did win first prize two weeks in a row, the probability 
that she didn’t cheat must likewise be 1 in 100 million. In other words, there is almost 
no chance that Sally didn’t cheat.” 


Let’s rephrase Ah Kow’s reasoning more formally. Let A and B be the events “Sally 
wins the first prize two weeks in a row” and “Sally didn’t cheat”, respectively. We 
know that P(A|B) = 0.00000001. By the CPF, we have P(A|B) = P(B|A). Hence, 
P (B|A) = 0.00000001. Equivalently, there is probability 0.99999999 that Sally cheated. 


Formally, this reasoning is flawed because P(B) is probably much larger than P(A). 
Thus, P (B |) is probably much larger than P (4|B ). 


Informally, the reasoning is flawed because: 


e Cheating in 4D is extremely rare (and difficult), so it is extremely unlikely that Sally 
cheated in the first place. 


¢ Besides cheating, there are many other alternative explanations for why there exists 
an individual who won first prize two weeks in a row. 


One important alternative explanation is that so many individuals buy 4D tickets regu- 
larly that there will invariably be someone as lucky as Sally. Suppose that only 100, 000 
Singaporeans (less than 2% of Singapore’s population) buy one 4D number every week. 
Then we’d expect that about once every 20 years, one of these 100,000 Singaporeans 
will have the fortune of winning the first prize on consecutive weeks. Rare, but hardly 
impossible. 
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The next example uses concrete numbers to illustrate how large the discrepancy between 
P(A|B) and P(B|A) can be. 


Example 1469. A randomly chosen person is given a free smallpox screening. We know 
that 1 out of every 1,000,000 people has smallpox. The test is very accurate: If you have 
smallpox, it correctly tells you so 99% of the time. (Equivalently, it gives a false negative 
only 1% of the time.) And if you don’t have smallpox, it also correctly tells us so 99% of 
the time. (Equivalently, it gives a false positive only 1% of the time.) 


Formally, let S, +, and - denote the events “the randomly chosen person has smallpox”, 
“the test returns positive”, and “the test returns negative”. ‘Then 


il 999999 
Ps) = ——_— P(S°) = 
ce) 1000000’ (S°) 1000000’ 


P (+|S)) = 0.99, P(-|S) = 0.01, 


P(-|S°) = 0.99, P(+|S°) = 0.01. 


The test result returns positive (i.e. it says that the randomly chosen person has small- 
pox). What is the probability that this person actually has smallpox? 


In words, it is easy to confuse “the probability of a positive test result conditional on 
having smallpox” with “the probability of having smallpox conditional on a positive 
test result”. Formally, this is the CPF. One starts with P(+|S') = 0.99 and confusedly 


concludes that P (s +) = 0.99—this person almost certainly has smallpox. 


In fact, as we now show, despite testing positive, the person is very a to have 


smallpox. The correct answer is P (s + +) x ! In the steps below, each = simply 


10, 000 
uses the definition of conditional probability (Definition 236): 


Gh aoe ee ee 
P(+) P(+) P(+nS)+P(+nS°) 


: P(S)P (4S) 
P(S)P(+|S)+P(S°) P (+|S°) 





ae 
TONDO = 0.00009899029 x 


1 nqq. 29999 Vat 
TRI  e gecnenn nOe 10, 000 


This example illustrates how far off the CPF can lead one astray. 





Now an actual, real-world example: 
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Example 1470. The British mother who murdered her two babies. In 1996, 
Sally Clark’s first-born died suddenly within a few weeks of birth. In 1998, the same 
happened to her second child. Clark was then arrested on suspicion of murdering her 
babies. 


At her trial, an “expert” witness claimed that in an affluent, non-smoking family such 
as Sally Clark’s, the probability of an infant suddenly dying with no explanation was 
1/8543. Hence, he concluded, the probability of two sudden infant deaths in the same 


family was (1/8543)° or approximately 1 in 73 million. 


The “expert” then committed the CPF. He argued that since 


1 


P (Two babies suddenly die|Mother did not murder babies) = 73 000.000" 


it therefore follows that 


1 


P (Mother did not murder babies|Two babies suddenly die) = anal, 


This erroneous reasoning led to Sally Clark being convicted for murdering her two babies. 


(Some of you may have noticed that the “expert” actually also made another mistake. 
But we'll examine this only in the next chapter.) 





It turns out that not only laypersons and court prosecutors commit the CPF. As we’ll see 
later, even academic researchers also often commit the CPF, when it comes to interpreting 
the results of a null hypothesis significance test (Chapter 131). 


Exercise 480. (Answer on p. 1963.) At a murder scene, a sample of a blood stain is 
collected. Its DNA is analysed and compared to a database of DNA profiles. A match 
with one John Brown is found. Say there is only a 1 in 10 million chance that two random 
individuals have a DNA match. 


Does this mean that there is probability 1 in 10 million that the DNA match with John 
Brown is merely a coincidence, and thus a near-certainty that the blood stain is really 
his? Explain why or why not, with reference to the following conditional probabilities: 


P (Blood stain is not John Brown’s|DNA match) , 


P (Blood stain is not John Brown’s|DNA match). 
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116.2. Two-Boys Problem (Fun, Optional) 
This is a famous puzzle, first popularised by Martin Gardner in 1959. 
Example 1471. Consider all the families in the world that have two children, of whom 


at least one is a boy. Randomly pick one of these families. What is the probability that 
both children in this family are boys? 


Think about it (set aside this book) before reading the answer below. 


We already know that one child is a boy. So intuition might suggest that “obviously”, 
P (Both boys) = P (The other child is a boy) = 0.5. 


Intuition would be wrong. Intuition goes astray by failing to recognise that there are three equally likely 
ways that a family with two children can have at least one boy: BB, BG, or GB. The answer is in fact 
1 


P(BBn "At least one boy”) _ P(BB) 


P(BB\At | = = 
CB BIS leer CBe BO) P(At least one boy) P(At least one boy) 





= P(BB) a 
~ P(BB)+P(BG)+P(GB) © 








In 2010, the following variant of the above Martin Gardner problem was presented. 
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Example 1472. Consider all the families in the world that have two children, of whom 
at least one is a boy born on a Tuesday. Randomly pick one of these families. What is 
the probability that both children in this family are boys? 


Those familiar with the previous problem might think, “Well, this is exactly the same as 
the two-boys problem, except with an obviously irrelevant bit of information about the 
boy being born on a Tuesday. So the answer must be the same as before: 1/3.” 


It turns out though that, surprisingly, the Tuesday bit of information makes a big differ- 
ence. The answer is 13/27 = 0.481. This is much closer to 0.5 than to 1/3! 


Consider all the “two-child, at-least-one-boy-born-on-a-Tuesday” families in the world. 
The four mutually exclusive possibilities are 


Child #1 Child #2 Probability 


1 


BrB Boy born on Tuesday — Boy (born on any day) P(BrB) = ai 


1 


BrG Boy born on Tuesday Girl P (BrG) = a 


ee 
14 14 196 





By Br Boy not born on Tuesday Boy born on Tuesday P(ByBr) = 


eet 7 
GBr Girl Boy born on Tuesday P(GBr) = ls Oe 


Altogether then, amongst two-child families with at least one boy born on a Tuesday, the 
proportion that have two boys is 


P(BB|”At least one Tuesday boy”) 


P (Both boys, at least one of whom born on Tuesday) 
P (At least one Tuesday boy) 


7 iP (BrB) aol (By Br) 
~ P(BrB)+P(BrG)+P(ByBr) +P(GBr) 
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117. Probability: Independence 


Informally, two events A and B are independent if the probability that both occur is 
simply the product of the probabilities that each occurs. Independence is thus analogous 
to the MP from counting. Formally: 


Definition 238. Two events A, B¢ are independent if 


P(An B) = P(A)P(B). 





There is a second, equivalent perspective of independence. Informally, two events A and B 
are independent if the probability that A occurs is independent of whether B has occurred. 
Formally: 


Fact 234. Suppose P(B) #0. Then A,B are independent events <= > P(A|B) = P(A). 


Proof. By definition of conditional probabilities, P(A|B) = P(An B)/P(B). By definition 
of independence, P(An B) 2 P(A)P(B). Plugging 2 into +, we have P(A|B) = P(A), as 
desired. 
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Example 1473. Flip two fair coins. Model this with the usual experiment, where 





« S={HH AT TH TE}. 
¢ Y contains 2+ = 16 elements, and 
«PHRAD=aPRAT aPQl Harri) = (iA. 








Let H; be the event that the first coin flip is Heads—that is, H, = {HH,HT}. Analo- 
gously define 7, Ho, and 7». 


The intuitive idea of independence is easy to grasp. If we say that the two coin flips are 
independent, what we mean is that the following four conditions are true: 


. H, and Hp are independent. (The probability that the second flip is heads is independent 
of whether the first flip is heads.) 


. H, and T> are independent. (The probability that the second flip is tails is independent 
of whether the first flip is heads.) 


. T, and Hp are independent. (The probability that the second flip is heads is independent 
of whether the first flip is tails.) 


. T, and JT) are independent. (The probability that the second flip is tails is independent 
of whether the first flip is tails.) 


Formally: 
I PCA os) = PCA) =P CAP Ao) = POR A AP PCR Fal) = 0.6205 = 
5. 











0. 
2. P(A, nT)) = P({HT}) = P(A1) P (Th) = PHA, HT})P({HT, TT}) = 0.5-0.5 = 0.25. 
3. P (Tn Ho) = P({TH}) = P(T,) P (He) = PTH, TT})P({HH, TH}) = 0.5-0.5 = 0.25. 
4. P(T, Ty) = PUTT}) = P (Ty) P(T)) = PU{TH, TT})P({HT, TT}) = 0.5-0.5 = 0.25. 





Example 1474. Flip a fair coin and roll a fair die. This can be modelled by an experi- 
ment, where 


Sat AAs nae HO 2. eS, eas Pa, Oy 
¢ » consists of 2!” events. 
e P(A) =|A|/12, for any event A¢ &. 








Now consider the event “Heads” E, = {H1,H2, H3,H4,H5, H6}, and the event “Roll an 
odd number” E> = {H1, H3, H3,T1,73,75}. These two events E, and EF», are indepen- 
dent, as we now verify: 





p(s) = 2B) 22.2 pee), 


More broadly, we can even say that the coin flip and die roll are independent. Informally, 
this means that the outcome of the coin flip has no influence on the outcome of the die 
roll, and vice versa. 





The idea of independence is a little tricky to illustrate on a Venn diagram. I'll try anyway. 
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Example 1475. The Venn diagram below illustrates a sample space with 100 equally 
likely outcomes (represented by 100 small squares). The event A is highlighted in red. 
The event B is highlighted in blue. 


P(A) =0.2 (A is made of 20 small squares). P(B) = 0.1 (B is made of 10 small squares). 
The event An B, coloured in green, is made of 2 small squares, so P(An B) = 0.02. 


We compute 


P(AnB) 0.02 _ 
P(B) ~~ 0.1. 


P(A|B) = ne, 


We observe that P(A) = 0.2 = P(A|B). And so by Fact 234, we conclude that the events 
A and B are independent. 
























































Exercise 481. Symmetry of Independence. In Fact 234, we showed that “A,B 
independent <=» P(A|B) = P(A)”. Now prove that “A, B are independent events <> 
P(B\A) = P(B).” (Answer on p. 1964.) 


Exercise 482. (Answer on p. 1964.) An example of a transitive relation is equality: 
If A= Band B=C, then A=C. Another example is <: If A< Band B< C, then A<C. 


In contrast, independence is not transitive, as this exercise will demonstrate. That 
is, even if A and B are independent, and B and C' are independent, it may not be that 
A and C are also independent. 


Flip two fair coins. Let H; be the event that the first coin flip is heads, H> be the event 
that the second is heads, and 7; be the event that the first flip is tails. Show that 


(a) H, and Hy are independent. 
(b) Hy and T, are independent. 
(c) H, and T; are not independent. 
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117.1. Warning: Not Everything is Independent 


The idea of independence is intuitively easy to grasp. Indeed, so much so that students 
often assume that “everything is independent”. This is a mistake. Unless you’re explicitly 
told, NEVER assume that two events are independent. 


Here are two examples where the assumption of independence is plausible: 


Example 1476. The event “coin-flip #1 is heads” and the event “coin-flip #2 is heads” 
are probably independent. 


Example 1477. The event “die-roll #1 is 3” and the event “die-roll #2 is 6” are probably 
independent. 





Here are two examples where the assumption of independence is not plausible: 


Example 1478. The event “Google’s share price rises today” is probably not independent 
of the event “Apple’s share price rises today”. 


Example 1479. The event “it rains in Singapore today” is probably not independent of 
the event “it rains in Kuala Lumpur today”. 





Nonetheless, the assumption of independence is frequently—and incorrectly—made even 
when it is implausible. One reason is that the maths is easy if we assume independence— 
we can simply multiply probabilities together. 


We now revisit the Sally Clark case. Previously, we saw that the court’s “expert” witness 
committed the CPF. Now, we’ll see that he also made a second mistake—that of assuming 
independence. 


Example 1480. The “expert” witness claimed that in an affluent, non-smoking family 
such as Sally Clark’s, the probability of an infant suddenly dying with no explanation 
was 1/8543. Hence, he concluded, the probability of two sudden infant deaths in the 


same family was (1/8543) or approximately 1 in 73 million. 
Can you spot the error in the reasoning? 


By simply multiplying together probabilities, the “expert” implicitly assumed that the two 
events—“sudden death of baby #1” and “sudden death of baby #2”—are independent. 
But as any doctor will tell you, if your family has a history of heart attack, diabetes, or 


pretty much any other ailment, then you may be at higher risk (than the average person) 
of suffering the same. 


And so, it may well be that in any given year, a random person has probability 0.001 
of dying of a heart attack. It does not however follow that in any given year, a random 
family has probability 0.0017 = 0.000001 of two deaths by heart attack. 


Similarly, it may be that if one baby in a family has already suddenly died, a second baby 
is at higher risk (than the average baby) of suddenly dying. 
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117.2. Probability: Independence of Multiple Events 


Definition 239. Let P be a probability function and A, B,C’ € = be events. 


A,B,C are pairwise independent if all three of the following conditions are true: 


P(An B) = P(A)P(B), 
P(BaAC) = P(B)P(C), 
P(AnC) =P(A)P(C). 


A, B,C are independent if in addition to the above three conditions being true, it is also 
true that 


P(An BnC) = P(A)P(B)P(C). 





It is tempting to believe that pairwise independence implies independence. That is, if the 
first three conditions listed above are true, then so is the fourth. Alas, this is false, as the 
next exercise demonstrates: 


Exercise 484. (Pairwise independence does not imply independence.) (Answer 
on p. 1964.) 


Flip two fair coins. Let H; be the event that the first coin flip is heads, Ty be the event 
that the second is tails, and X be the event that the two coin flips are different. Show 
that 


(a) These three events are pairwise independent. 





(b) These three events are not independent. 
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118. Fun Probability Puzzles 


118.1. The Monty Hall Problem 


The Monty Hall Problem is probably the world’s most famous probability puzzle. It takes 
less than a minute to state. Yet its counter-intuitive answer confuses nearly everyone. 


You’re at a gameshow. There are three boxes, labelled #1, #2, and #3. One box contains 
one year’s worth of a Singapore minister’s salary. The other two are empty. 


You are asked to pick one box (but you are not allowed to open it yet). 


The host, who knows where the minister’s salary is, opens one of the other two boxes, to 
reveal that it is empty. Important: The host is not allowed to open the box that contains 
the minister’s salary; he must always open a box that is empty. 


You’re now given a choice: Stay (with your original choice) or switch (to the other unopened 
box). What should you do? 


To illustrate: 


Example 1481. Say you pick Box #2. The host then opens an empty Box #1. You’re 
now given a choice: Stay (with Box #2) or switch (to Box #3). Which do you choose? 


Your original choice Should you switeh? 





Take as long as you need to think about this problem, before turning to the 
next page for the answer. 
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A magazine columnist named Marilyn vos Savant®2’ gave the correct answer: 


Yes; you should switch. The first door has a 1/3 chance 
of winning, but the second door has a 2/3 chance. 


Here are two informal explanations: 


1. The probability that the minister’s salary is in the box you picked is 1/3. The probability 
that the minister’s salary is in either of the other two boxes is 2/3. Of the other two boxes, 
the gameshow host (who knows where the salary is) helps you eliminate one of them. So 
the remaining unopened box still has probability 2/3 of containing the minister’s salary. 


2. Imagine instead that there are 100 boxes, of which one contains the minister’s salary 
and the others are empty. You pick one. Of the remaining 99, the gameshow host opens 
98. You are again given the choice: Should you stay or switch? In this more extreme 
version of the game, it is perhaps more obvious that your originally picked box has only 
probability 1/100 of containing the minister’s salary, while the only other unopened box 
has probability 99/100 of the same. Therefore, you should switch. 


Here’s a more formal explanation using the method of enumeration: 


3. Say you originally pick Box #1. There are three possible cases, each occurring with 
probability 1/3: 





Case Box #1 Box #2 Box #3 Host opens 
A | Minister’s salary Empty Empty Box #2 or Box #3 
B Empty Minister’s salary Empty Box #3 
C Empty Empty Minister’s salary Box #2 





Not switching wins you the minister’s salary only in Case A (1/3 probability). 
Switching wins you the minister’s salary in Cases B and C (2/3 probability). 





°27Marilyn vos Savant was, briefly, on the Guinness Book of Records as the person with the world’s highest 
IQ, until Guinness retired this category because IQ tests were considered to be too unreliable. 
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Even with the above explanations, some of you may remain unconvinced. Don’t worry, you 
are not alone. After Marilyn’s initial response, 10,000 readers sent in letters telling her she 
was wrong. Some were from Professors of Mathematics and PhDs. A few examples:°?° 


As a professional mathematician, I’m very concerned with the general 


public’s lack of mathematical skills. Please help by confessing your error 
and in the future being more careful. 


There is enough mathematical illiteracy in this country, and we don’t need 
the world’s highest IQ propagating more. Shame! 


Maybe women look at math problems differently than men. 


Unfortunately for the above letter writers, Marilyn was correct and they were wrong. 


The best way to convince the sceptical is through simulations—try this Google spreadsheet. 
Or if you don’t trust computers, do an actual experiment: 








528Vou can read more of these letters at her website. 
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118.2. The Birthday Problem 


Example 1482. (The birthday problem.) What is the smallest number n of people 
in a room, such that it is more likely than not, that at least 2 people in the room share 
the same birthday?” 


Fix person #1’s birthday. Then 


The probability that person #2’s birthday is different (from person #1) is 364/365. 


The probability that person #3’s birthday is different (from persons #1 and #2) is 
363/365. 


The probability that person #4’s birthday is different (from persons #1, #2, and #3) 
is 362/365. 


The probability that person #£n’s birthday is different (from persons #1 through #n-1) 


is (366 — n)/365. 


Altogether, the probability that no 2 persons share the same birthday is 


364 363 362 366 -—n 
—— xX — xX —x:::x 
365 3865 365 365 


Hence, the probability that at least 2 persons share the same birthday is 


364 363 362 366 -— n 
= x x See x : 
365 365 365 365 





The smallest integer n for which the above probability is at least 0.5 is 23. * That is, 
perhaps surprisingly, with just 23 people, it is more likely than not that at least 2 persons 
share a birthday. 
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119. Random Variables: Introduction 


Informally, a random variable is a function that assigns a real number (you can think of 
this as a “numerical code”) to each possible outcome s. We call any such real number an 
observed value of X. 


Example 1483. Model a fair coin-flip with the usual experiment F = (S,,P), where 


ean. 
o Sa{C, {ah Ar} sh: 
« P:R is defined by P(@) =0, P({H}) = P({H}) =0.5, and P(s) =1. 





Let X : S > R be the random variable that indicates whether the coin-flip is heads. 
That is, the observed value of X is X(H) =1 if the outcome is heads and_X (t) = 0 if 
the outcome is tails. 





Formally: 


Definition 240. Let E = (S,%,P) be an experiment. A random variable X (on the 
experiment F) is any function with domain S and codomain R. 


Given any random variable X and any outcome s € S, we call X (s) the observed (or 
realised) value of the random variable X. We often denote a generic observed value 
X (s) by the lower-case letter x. 
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119.1. A Random Variable vs. Its Observed Values 


Students often confuse a random variable with an observed value of the random variable. 
This confusion is, of course, simply the confusion between a function and the value taken 
by the function. 











Example 1483 (continued from above). X is a function with domain S and codomain 
R. X is therefore a random variable. 


If the outcome of the coin-flip is heads, we do not say that X is 1. Instead, we say that 
the observed value of X is 1. 


If the outcome of the coin-flip is tails, we do not say that X is 0. Instead, we say that 
the observed value of X is 0. 





Remember: A random variable X is a function that can take on many possible real 
number values. Each such value x = X (s) is called an observed value of X. 
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119.2. X =k Denotes the Event {s¢S: X(s) =k} 


Definition 241. Given a random variable X : S > R, the notation “X =k” denotes the 





event {se S: X(s) =k}. 


The notation “X >k”, “X >k”, “X <k”, “X <k”, “a< X <b”, etc. are similarly defined. 
Example 1483 (continued from above). X(H)=1 and X (t) =0. So we can write 


X =1 denotes the event {se S:X(s)=1}={H}, 


X =0 denotes the event {seS:X(s)=0}={T}. 
Moreover, P ({H}) =0.5 and P({T}) =0.5. So we can also write 
P(X =1)=0.5 and P(X=0)=0.5. 


Now let’s try some other arbitrary number like 13.71. Notice there is no outcome s such 
ihat X (s)= 13.71. Thus, 


X = 13.71 denotes the event {s¢S:X(s)=13.71}=@, and P(X =13.71) =0. 
Indeed, for any k #0,1, there is no outcome s such that X (s) =k. Thus, 


X =k denotes the event {se S:X(s)=k}=@, and P(X =k)=0. 


Since P (@) = 0, we also have P(X =k) = P(@) =0, for any k # 0,1. 


Define Y : S > R by Y(#) = 15.5, Y (t) = 15.5. Y is an example of a constant random 
variable. We may write 


Y =15.5 denotes the event {se S:X(s)=15.5}={H,T}, and P(X =15.5)=1. 
Moreover, for any k # 15.5, 


Y =k denotes the event {seS:X(s)=k}=@, and P(Y =k) =0. 
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119.3. The Probability Distribution of a Random Variable 


We call a complete specification of P (X = k) for all values of k the probability distribu- 
tion (or probability law or probability mass function) of X. In the above example, 
we gave the probability distributions of both X and Y. 


More examples of random variables and their probability distributions: 


Example 1484. Flip two fair coins. Model this with the usual experiment, where S = 
ebskesw us Mae 


Let X : S > R indicate whether the two coin flips are the same and Y : S > R count the 
number of heads. That is, 


X(HH)=1, X(HT) =0, X(TH)=0, X(TT) =1, 


Y(HH) =2, Y(HT) =1, Y(TH)=1, Y(TT) =0. 


PX =0)=0.5, POX Sl] 0:5, and PX =—) =0 ter any A= 0), I, 


P(Y =0) =0.25, P(Y =1)=0.5, P(Y =2) =0.25, and P(X =k) =0, for any k #0, 1,2. 





Another example: 
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Example 1485. Pick a random card from the standard 52-card deck. Model this with 
the usual experiment, where 


S = {A@, K4,,...,24, AV, KY¥,...,24,A%, K%,...,20, Ad Kee... 2h}. 
X:S—>R is the High Card Point count (used in the game of bridge). Le., 


X(A of any suit) = 4, X(K of any suit)=3, %(Q of any suit) = 2, 
X(J of any suit)=1, |X (Any other card) = 0. 


Thus, 


36 4 4 
P(X =0)= =, P(X =1)= 55, P(X =2)= 57, 


4 4 
P(X=3)=55, P(X=4)=—, P(X =k)=0, 


for any k= U0, 1,2,3,4, 
Y : S > R indicates whether the picked card is a spade (4). Le., 


Y(Any @)=1, Y(Any other card) = 0. 


39 
P(Y =0) == 


13 
5 P(Y =1)= 55, PY =k) =O ter any «+0, 1 
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Example 1486. Roll two fair dice. Model this with the usual experiment, where 


g- Gere) 4 
oo eer eric ae 


X:5—R is the sum of the two dice. And so for example, 


«(2-7 er «(8 Jes 


The table below says that P (X = 2) = 1/36, because there is only one way the event X = 2 
can occur. And P(X = 3) = 2/36, because there are two ways the event X = 3 can occur. 


You are asked to complete the table in the next exercise. 


ks s such that X (s)=k P(X =k) 
if 
36 
2 
36 








Exercise 485. (Continuation of the above example.) (Answer on p. 1965.) (a) Complete 
the above table. 


Consider the event E, described in words as “the sum of the two dice is at least 10”. 
(b) Write down the event F in terms of X. 
(c) Calculate P(E). 
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119.4. Random Variables Are Simply Functions 


Example 1486 (continued from above). Continue with the same the roll-two-fair- 
dice example, with X again being the random variable that is the sum of the two dice. 


We had 
XX a =f ee = a 
G) 


Let Y :S > R be the product of the two dice. And so for example, 


(SB) a ¥(B)-s 


Remember: random variables are simply functions. And thus, we can manipulate random 
variables just like we manipulate any functions. 


So for example, consider the function X + Y: 5 > R. It is also a random variable. We 
have 


0 @ 
aon Ba and oon(8} <0 


Similarly, consider the function XY: S > R. It is also a random variable. We have 


fs) GC) \ 
on(B]-m and x9)(& Jao 


Finally, consider the function 4X -5Y :S >R. It is also a random variable. We have 


O a 
ax-sr) g)=-2 and ax-sr)(#)-0 
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Exercise 486. Continue with the above roll-two-fair-dice example. Let P: S > R be the 
greater of the two dice. Let @ : S > R be the difference of the two dice. Evaluate the 


funewions), ©, and O at and = . (Answer on p. 1966.) 


Exercise 487. (Answer on p. 1966.) Model a fair die-roll with the usual experiment 
EF ={S,%,P}. Define the function X : S > R by the mapping rule X (1) = 1, X (2) = 2, 
Ni(G)=e, 4 (4) 24.5 (9) =5 and X(6) =6. 

Is X a random variable on E’ Why or why not? 


If X is indeed a random variable on E, then write down also P(X =k), for all possible k. 
Exercise 488. For each of the following real-world scenarios, write down, in precise 
mathematical notation (i) the experiment F = {S,,P}; (ii) what the random variable 
X is; and (iii) P(X =k), for all possible k. (Answers on pp. 1966 and 1967.) 


(a) Flip 4 (fair) coins. Let the random variable X be a count of the number of heads. 
(b) Roll 3 (fair) dice. Let the random variable X be the sum of the three dice. (Tedious. ) 
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120. Random Variables: Independence 


Definition 242. Given random variables X : S > R and Y: S > R, the notation 
“X =2,Y =y” denotes the event {se S:X(s)=2,Y (s) =y}. 


Example 1487. Flip two fair coins. Model this with the usual experiment where S' = 
AelEH Oriel a legates 


Let X : S > R indicate whether the two coin flips were the same and Y : S > R count 
the number of heads. That is, 


MHA a1, > XAT) 0,7 XT) a0 XT) Hd, 
and Y (HA y=2, ¥ CHE) =, V(TH) =, YET) 20, 
Then X =0,Y =0 is the event that the two coin flips were not the same AND the number 
of heads was 0. By observation, this event is the empty set. Thus, P(X =0,Y =0) = 
IE (2) 10), 


X =1,Y =0 is the event that the two coin flips were the same AND the number of heads 
was 0. By observation, this event is {TT}. Thus, P(X =1,Y =0)=P({TT}) =0.25. 


Exercise: Verify for yourself that 





P(X =0,Y =1)=0.5, P(X=1,Y=1)=0, 


P(X =0,Y=2)=0, P(X =1,Y =2) =0.25. 





1225, Contents www.EconsPhDTutor.com 


Informally, two random variables are independent if knowing the value of one does not 
tell us anything about the value of the other. 


Example 1487 (continued from above). Flip two fair coins. We say the two coin-flips 
are independent. Informally, the outcome of one doesn’t affect the other. Knowing that 
the first coin-flip is heads tells us nothing about the second coin-flip. 


A little more formally, let A and B be the random variables indicating whether the first 
and second coin-flip are heads (respectively). That is, A = 1 if the first coin-flip is heads 


and A = 0 otherwise; and B = 1 if the second coin-flip is heads and B = 0 otherwise. Then 
the informal statement “the two coin-flips are independent” may be translated into the 
formal statement “the random variables A and B are independent”. 


Informally, knowing the observed value of A tells us nothing about whether B = 0 or 
B=1. (And vice versa.) 





Formally: 


Definition 243. Given random variables X :S > R and Y : S > R, we say that X and 
Y are independent if for all x, y, 


Rik = t=) PO =a Pe oa), 





Let’s restate the above definition more explicitly. Suppose X can take on values x1, %2,...,2%n 
and Y can take on values yj, y2,-.--,Ym- Then to say that X and Y are independent is to 
say that all of the following n x m pairs of events are independent 


KS Tiny HOt. AS Xie Y¥ =e, we ee ae ee 
A Sto, Y Hoi. A =i, Y Sys, As A Ste, Y SY: 
Naa, SH A=, VY = op, ae Say Soe. 


Independence between two random variables is thus equivalent to independence between 
many pairs of events. 
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Example 1487 (continued from above). We now verify, in more formal and precise 
language, that “the two coin-flips are indeed independent”. 


Again, A and B are the random variables indicating whether the first and second coin-flips 
are heads (respectively). 


We now verify that indeed, P(A =a, B = b) = P(A=a)P(B =5) for all possible values of 
a and b: 


P(A=a,B=b) P(A=a)P(B =D) 

P({TT}) = 0.25 P ({TH,TT}) P ({HT,TT}) = 0.5 x 0.5, 
P ({HT}) =0.25 P ({HH, HT})P ({HT,TT}) = 0.5 x 0.5, 
P ({TH}) =0.25 P ({TH,TT}) P ({HH,TH}) =0.5 x 0.5, 
P({HH})=0.25  P({HH,HT})P({HH,TH}) =0.5~x0.5. 








Exercise 489. Flip two fair coins. Let X : S > R indicate whether the two coin flips 
were the same and Y : S > R count the number of heads. Are X and Y independent 
random variables? (Answer on p. 1969.) 





Earlier we warned against blithely assuming that any two events are independent. Here we 
can repeat this warning: Unless explicitly told (or you have a good reason), do not assume 
that two random variables are independent. 


The assumption of independence is a strong one. There are many scenarios where it is 
plausible. For example, the flips of two coins are probably independent. The rolls of two 
dice are probably independent. 


There are, however, also many scenarios where it is not plausible. Today’s changes in 
the share prices of Google and Apple are probably not independent. Today’s rainfall in 
Singapore and in Kuala Lumpur are probably not independent. 


Nonetheless, the assumption of independence is frequently—and incorrectly—made even 
when it is implausible. The reason is that the maths is easy if we assume independence—we 
can simply multiply probabilities together. Unfortunately, incorrectly assuming indepen- 
dence has sometimes had tragic consequences, as we saw in the Sally Clark case. 
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121. Random Variables: Expectation 


Example 1488. Let X be the outcome of a fair die roll. 


What is the expected value (or the mean) of X? In other words, on average, what’s 
the expected outcome of a fair die roll? 


Note that X takes on a value 1 with probability 1/6. Similarly, it takes on a value 2 with 
probability 1/6. Etc. Hence, the expected value of X, denoted E[X] is given by 








fee Mee el 


= 3.9. 
6 6 


E[X] is thus simply a weighted average of the possible values of X, where the weights 
are the probability weights. 





We'll use the following slightly incorrect definition of a discrete random variable:**” 





That is, a random variable is discrete if it takes on finitely many possible values. 


We can now formally define the expected value of a discrete random variable: 


Definition 244. Let E = (S,%, P) bean experiment. Then the corresponding expectation 
operator, denoted E, is the function that maps any discrete random variable X :S > R 
to a real number, according to the mapping rule 


S P(X =k)-k. 
keRange X 


We call E[X] the expected value (or mean) of X. We often write wx = E[X] or even 
u=EL[X] (if it is clear from the context that we’re talking about the mean of X). 


Example 1489. Let X be the outcome of a fair die roll. The range of X is Range (x) = 
{1,2,3,4,5,6}. So 


> P(X =k)-k 
keRange(z) 


=P(X =1)-1+P(X =2)-2+P(X =3)-3+P(X =4)-4+P(X =5)-5+P(X =6)-6. 


i] 1 il 1 il il 
Sasie (pe) arn ee ir nee Bee oe ae 
5 ae i ae i Dar oo 








°30The correct definition is this: A random variable is discrete if its range is finite or countably infinite. 
I avoid giving this correct definition because this would require explaining what “countably infinite” 
means. 
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Example 1490. Let Y be the sum of two fair die-rolls. 


The range of Y is Range(y) = {2,3,4,5,6,7,8,9, 10, 11,12}. In Exercise 485, we worked 
out that P{¥ =2)=1/36, P(Y =3)=2/36, etc, Vhus, 


y. P(Y=k)-k 


keRange(y) 
=P(Y =2)-2+P(Y =3)-3+P(Y =4)-4+P(Y =5)-5+---+P(Y =12)-12 


1 2 3 4 5 6 5 4 3 2 1 
ea ere emia ean ee ane reels <r Meseeen gets ee ayy reece | | OUe=eUN | prey ae 
— oe 86 9 66° 6 se 6 a 


5 Ue ee 
Sy ae ee a Oe oe 


Example 1491. XXFlip two fair coins and roll two fair dice. Let X be the number of 
heads and Y be the number of sixes. 


Problem: What is E[X +Y]? 


As it turns out, it is generally true that E[X +Y]=E[X]+E[Y] (as we'll see in the next 
section). So if we knew this, then the problem would be very easy: 


ELX +Y]=E[X]+E[Y]=1+=- 


But as an exercise, let’s pretend we don’t know that E[X +Y]=E[X]+E[Y]. We thus 
have to work out E[LX + Y] the hard way: 

First, note that Range(X + Y) = {0,1,2,3,4}. P(X + Y =0) is the probability of 0 heads 
and 0 sixes. And P(X + Y =1) is the probability of 1 head and 0 sixes OR 0 heads and 
1 six. We can compute 





51_ 50, 10 _ 60 
66 5144) 144 


You are asked to complete the rest of this problem in the exercise below. 


Exercise 490. Complete the above example by following these steps: (a) Compute 
P(X +Y =2). (b) Compute P(X + Y =3). (c) Compute P(X + Y =4). (d) Now com- 
pute E[X + Y]. (Answer on p. 1969.) 
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121.1. The Expected Value of a Constant R.V. is Constant 


Example 1492. Let 5 be a constant random variable on some experiment FE = (S,™,P). 
That is, 5: S > R is the function defined by s + 5. (Note that the symbol 5 does double 
duty by denoting both a function and a real number.) Then not surprisingly, 


Function Number 


| 
E [5] 


That is, on average, we expect the random variable 5 to take on the value 5. 





We can easily prove the above observation: 


Fact 235. Jf the constant random variable c maps every outcome to the number c, then 
falke| ave 


Proof. The PMF of the constant random variable c is given by P (c=c) = 1 and P(c=k) =0 
for any k #c. Hence, E|c] =P (c=c)+c=1-c=c. 
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Exercise 491. In the game of 4D, you pay $1 to pick any four-digit number between 
0000 and 9999 (there are thus 10,000 possible choices). There are two variants of the 4D 
game—“big” and “small”. The prize structures are as given below. Let X be the prize 
received from a $1 stake in the “big” game and Y be the prize received from a $1 stake 
in the “small” game. (Answer on p. 1970.) 


(a) Write down the range of X and the range of Y. 

(b) Write down the probability distributions of X and Y. 
(c) Hence find E[X] and E[Y]. 
( 


d) Which game—“big” or “small”—is expected to lose you less money? 


Prize Structure for Ordinary, 4-D Roll and System Entries 


(a) Prize Amounts and Winning Numbers for 4-D Game (Big) 


Number of 4-digit Prize Amount (for 
Prize Winning Numbers every $1 stake) 
Ist Prize one number $ 2,000 
2nd Prize one number $ 1,000 
3rd Prize one number $ 490 
Starter Prizes ten numbers $ 250 
Consolation Prizes ten numbers $ 60 


(b) Prize Amounts and Winning Numbers for 4-D Game (Small) — applicable to 
Ordinary Entry, 4-D Roll and System Entry 


Number of 4-digit Prize Amount (for 
Prize Winning Numbers every $1 stake) 
lst Prize one number $ 3,000 
2nd Prize one number $ 2,000 
3rd Prize one number $ 800 


There shall be no starter prize or consolation prize for 4-D Game (Small). 


(Source: Singapore Pools, “Rules for the 4-D Game”, Version 1.11, 17/11/15. PDF.) 
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121.2. The Expectation Operator is Linear 


Definition 245. Let f: A > B be a function, x,y ¢ A, and ke R. We say that f isa 
linear transformation if it satisfies the following two conditions: 


(a) Additivity: f(x +y) = f (x) +f (y); and 


(b) Homogeneity of degree 1: f(kx) =kf (x). 


Example 1493. The summation operator » is an example of a linear transformation. 
Because it satisfies both additivity and homogeneity of degree 1: 


ae) = yore and 5 (bay) = On 
in a, i=l i=l 


d 
Example 1494. The differentiation operator — is an example of a linear transformation. 


z 
Because it satisfies both additivity and homogeneity of degree 1: 


<(F(0)+9(2))=<f(2)+< g(a) and © (bf (2) = bof (2). 





A common mistake made by students is to believe that “everything is linear”. 
Here are two examples of operators that are not linear transformations. 


Example 1495. The square-root operator \/- is not a linear transformation. In general, 
we do not have 


Jety=Vitvy or Vio =kyZ. 


Example 1496. The square operator -? is not a linear transformation. In general, we 
do not have 


(nty)/=a2+y? or (kx) =ke?. 
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It turns out that the expectation operator is a linear transformation. 


Proposition 22. The expectation operator E is linear. That is, if X and Y are random 
variables and c is a constant, then 


(a) Additivity: E[X +Y]= E[X]+E[Y], 
(b) Homogeneity of degree 1: E[cX ] = cE[X]. 














Proof. Optional, see p. 1731 (Appendices). 





The linearity of the expectation operator is a powerful property, especially because it is 
true even if independence is not satisfied. 


Example 1497. I stake $100 on each of two different 4D numbers for Saturday’s drawing 
(“big” game). (So that’s $200 total.) 


Let X and Y be my winnings (excluding my original stake) from the first and second 
numbers (respectively). Now, X and Y are certainly not independent because for exam- 
ple, if my first number wins first prize, then my second number cannot possibly also win 
first prize. 


Nonetheless, despite X and Y not being independent, the linearity of the expectation 
operator tells us that 


E[X +Y]=E[X]+E[Y] = $65.90 + $65.90 = $131.80. 
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122. Random Variables: Variance 


Example 1498. Consider a random variable X that is equally likely to take on one of 5 
possible values: 0,1,2,3,4. Its mean is 





1 1 i 
He Leh 0 


Now consider another random variable Y that is equally likely to take on one of 5 possible 
values: —8,-3,2,7,12. Coincidentally, its mean is the same: 





1 1 1 1 1 
= i 2 a a2. 
py = DP (V =k) k= 3-(-8) 45: (-8) + 5-24 2-74 2 


The random variables X and Y share the same mean. However, there is an obvious 
difference: Y is “more spread out”. 





What, precisely, do we mean when we say that one random variable is “more spread out” 
than another? 


Our goal in this section is to invent a measure of “spread-outness”. We’ll call this the 
variance and denote the variance of any random variable X by Var [X']. 


It’s not at all obvious how the variance should be defined. One possibility is to define the 
variance as the weighted average of the deviations from the mean. 
Example 1234 (continued from above). (Our first proposed definition of variance.) 


For X, the weighted average of the deviations from the mean is 


V[X] =) P(X =k)-(k-p) 
iI | | 1 1 
Se ele (a) i (ea) 


1 1 1 1 1 
OO I) 


ea ‘i 
jeeee ae 
"5 5 


eae ey 0. 
5 5 


Hmm. This works out to be 0. Is that just a weird coincidence? Let’s try the same for 
Ne. 


Vi Ph) re) 


=-(-8-p)+2-(-8-u)+2-(Q-m)+e-(7-w) +2 -(12-H) 
1 


1 1 1 | 
gp (-8-2)+ 5 (8-2) +e (2-2) +e (7-2) 4 (12-2) 


=-2-14+0+1+2=0. 


Hmm. Again it works out to be 0. 





This is no mere coincidence. It turns out that }° P(X = k)-(k-j) is always equal to 0. 
k 
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This is because 
at a a a a 


k 
= - py P(X =k) = 0. 
k 
eV 
=] 

So our first proposed definition of the variance—the weighted average of the deviations 
from the mean—is always equal to 0. Intuitively, the reason is that the negative deviations 
(corresponding to those values below the mean) exactly cancel out the positive deviations 
(corresponding to those values above the mean). 


This proposed definition is thus quite useless. We cannot use it to say things like Y is 
“more spread out” than X. 


This suggests a second approach: define the variance to be the weighted average of the 
absolute deviations from the mean. 

Example 1234 (continued from above). (Our second proposed definition of vari- 
ance. ) 


For X, the weighted average of the absolute deviations from the mean is 


VIX]=DP(X=h) |= a 
1 1 
Behe yen eee 
Sieh e 5 
Se eee ee ne 
5 5 5 5 
2 1 


=-+4+- 
5 9 


1 1 1 
Ib-pl+e (2-wl+ 5 Bul +e a 


+Q+ 


2_6 
5 5 
YY: 


1 
—s 
5 
And now let’s work out the same for 
Vi = Pe =k) i) 
| il i | i 
a ale = =p aa atl 2a = == 
E | Bl +5 | p+ =| B+ =| B+ = [12 — ps] 
J i i il | 
= —-|-8 —2|/ + —-|-3 -—2/ + —-|2-2|+ —-|7-2/+—-|12-2 
= -|-8-2)+2-|-3-2 +2 -2-I +e -|7-+e- 
see eee eee ato, 


Wonderful! So we can now use this second proposed definition of the variance to say 
things like “Y is more spread out than X”. 





This second proposed definition seems perfectly satisfactory. Yet for some bizarre reason, 
we won’t use it! Instead, we’ll define the variance to be the weighted average of the 
squared deviations from the mean. 
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Example 1234 (continued from above). (The actual definition of variance.) 


For X, the weighted average of the squared deviations from the mean is 


V[X] = SP (X =k) -(k- pn) 
ee 0- peeves eens renee aS ia 
a BY +5 by +5 H) +. by +s LL 


“(0-2)” 


i 
5 


(1-2) 5-2-2)? +5. (8-2)? + 5 (4-2) 


And now let’s work out the same for Y: 


VIY]=>)P(Y =k): (kn) 
=e (-B- pn) te (8p) e-O-w) +e (T- Hy + = (12-1) 
1 9 1 2 il 2 i y 1 2 
Seat =?) aa) +e (2-2) + (ie) +e (12-2) 

= 20+54+0+5+20 = 50. 


2 





Formally, 


Definition 246. Let u = E[X]. Then the variance operator is denoted Var and is the 
function that maps each random variable X to a real number c, given by the mapping 
rule 


V(X] =E|(X-»)). 


We call Var [|X] the variance of X. This is often also instead written as 0% or even more 
simply as o? (if it is clear from the context that we’re talking about the variance of X). 





So to calculate the variance, we do this: Consider all the possible values that X can take. 
Take the difference between these values and the mean of X. Square them. Then take the 
probability-weighted average of these squared numbers. 


More examples: 


1236, Contents www.EconsPhDTutor.com 


Example 1499. Let the random variable X be the outcome of the roll of a fair die. We 
already know that p = 3.5. Hence, 


V [X]=E[(X - p)*] = E[(X -3.5)*] 


= P(X =1)-(1-3.5)? + P(X =2)-(2-3.5)?+---+ P(X =6)- (6 - 3.5)? 


1 
a (2.57 + 1.5? +0.5° + 0.5? + 1.5? + 2.5”) = = x 2.92. 


35 
So the variance of the die roll is as 2.92. ‘This means that the expected squared 


deviation of X from its mean p = 3.5 is oe 2.92, 


Example 1500. Roll two fair dice. Let the random variable Y be the sum of the two 
dice. We already know from Example 1490 that w = 7. So, using also our findings from 
Exercise 485, 


VIY]=E[(Y-4)]=E[(¥-7)'] 


=P(Y =2)-(2-7)?+P(Y =3): (3-7)? +-:-+P(Y =12)- (12-7)? 


_ 2(25 + 32427+16+5) 210 _ 70 2, 
> 36 oe ie ae 


70 
So the variance of the sum of two dice is — » 5.83. This means that on average, the 


square of the deviation of Y from its mean p = 7 is oo 5.83. 





As the above examples suggest, calculating the variance can be tedious. Fortunately, there 
is a shortcut: 
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Fact 236. Let X be a random variable with mean . Then Var[|X | = E[X?] i 


Proof. Using the definition of variance, the linearity of the expectation operator (Proposi- 
tion 22), and the fact that py is a constant, we have 


V [X] =E[(X - p)*] = E[ X24? - 2X py] = E[X?]+E[p?] - 2B [Xp] 














= E[X?] +p? -2uELX]=E[X?]4p?-2u-y=E[X?]-p’. 
We now redo the previous two examples using this shortcut: 


Example 1499 (continued from above). Let the random variable X be the outcome 
of the roll of a fair die. We already know that p = 3.5. So compute 


1 91 
E[X?]=P(X=1)-?+P(X =2)-24--4+P(X =6) 6 =2(P +2 4-46) ==. 


ls2 147 35 
H X]=E[X?]-p? = ‘Se rs 
ence, Var [X ] [ | Ll 3.9 Pm WwW 12 


Example 1500 (continued from above). Let the random variable Y be the sum of 
two rolled dice. We already know from Example 1490 that p = 7. So, using also our 
findings from Exercise 485, 


EY?) =P(Y =2)22?4 P(Y 23): +--+ P(Y = 12): 12? 


1 2 3 1 
Pesan) 2 ier? (a genie gree epee (9 
36° 36° 36. 36 


_ 4418+ 48 +100 + 294 + 320 + 324+ 300+ 242+144 1974 658 
. 36 eeo oe 
_ 658 12 _ 658 588 _ 70 

ne Ds dD: 

This is still tedious, but arguably quicker than before. 


Hence, Var[Y] =E | =the 





Exercise 492. Let the random variable Z be the sum of three rolled dice. Find Var [Z]. 
(Answer on p. 1971.) 
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122.1. The Variance of a Constant R.V. is 0 


A constant random variable cannot vary. So not surprisingly, the variance of a constant 
random variable is 0. 








Proof. Use Fact 1238: Var [c] = E[c?] - (E [ce]? =2-2 =0. 
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122.2. Standard Deviation 


Let X be a random variable. Then E[X] has the same unit of measure as X. In contrast, 
Var [X] uses the squared unit. 


Example 1501. There are 100 dumbbells in a gym, of which 30 have weight 5 kg and 
the remaining 70 have weight 10 kg. Let X be the weight of a randomly chosen dumbbell. 
Then the mean of X is 


E[X]==0.3x5 ke+0.7x 10 kg =8.5 kg. 


And the variance of X is 


V [X] =0.3x (5 kg-8.5 ke)” +0.7x (10 kg-8.5 ke)” 
= 0.3 x 12.25 kg? +0.7 x 2.25 kg? = 5.25 kg”. 





To get a measure of “spread” that uses the original unit of measure, we simply take the 
square root of the variance. This is called the standard deviation as a measure of spread. 


Definition 247. Let X be a random variable and Var[X] be its variance. Then the 
standard deviation of X is defined as 


SD[X] = J/V[X]. 





The variance of a random variable X is often denoted o% or even more simply as o? (if it 
is clear from the context that we’re talking about the variance of X). 


Correspondingly, the standard deviation of X is often denoted ox or a. 


Example 1240 (continued from above). We calculated the variance of X to be 
Vari) 26° = 5.25. 


Hence, the standard deviation of X is simply o = V5.25 » 2.29 kg. 


Exercise 493. There are 100 rulers in a bookstore, of which 35 have length 20 cm and 
the remaining 65 have length weight 30 cm. Let Y be the weight of a randomly chosen 
dumbbell. Find the mean, variance, and standard deviation of Y. (Be sure to include 
the units of measurement.)(Answer on p. 1971.) 
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122.3. The Variance Operator is Not Linear 


The variance operator is not linear. However, given independence, the variance operator 
does satisfy additivity and homogeneity of degree 2. 





Proposition 23. Let X and Y be independent random variables and c be a constant. 
Then 


(a) Additivity: Var_X +Y]= Var[X]+ Var[Y], 
(b) Homogeneity of degree 2: Var[cX] = c? Var X]. 














Proof. Optional, see p. 1732 (Appendices). 





With the above, it becomes much easier than before to find the variance of the sum of 2 
dice, 3 dice, or indeed n dice. 
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Example 1502. Let X be the outcome of a fair die-roll. We showed earlier that Var [X ] = 
30 


12° 
Now roll two fair dice. Let X, and X» be the respective outcomes. Let Y be the sum of 
the two dice (i.e. Y = X1 + X2). Assuming independence, we have 

70 


V[Y]=V[X14+ Xo] =V[Xi] + V [2] = 7. 


Compare this quick computation to the work we did in Example 1500! 
Now roll three fair dice. Let X3, X4, and X; be the respective outcomes. Let Z be the 


sum of the three dice (i.e. Z = X3 + X4+X;5). Again, assuming independence, we have 


V[Z] = V[X5 + Xa + X5] = V[X5] + V [Xa] + V [Xs] = <2. 


Again, compare this quick computation to the work you had to do in Exercise 492! 


Now, let A be double the outcome of a die roll (i.e. A = 2X). Note importantly that 
A#Y. Y is the sum of two independent die rolls. In contrast, A is double the outcome 
of a single die roll. Indeed, by Proposition 23, we see that 








V[A] = V[2X] =4V [X] = = eV[Y]. 


Similarly, let B be triple the outcome of a die roll (i.e. B= 3X). Note importantly that 
B+Z. Z is the sum of three independent die rolls. In contrast, B is triple the outcome 
of a single die roll. Indeed, by Proposition 23, we see that 


V[B] = VI3X]=9V [X]= 52 + V[Z]. 








Exercise 494. The weight of a fish in a pond is a random variable with mean pw kg and 
variance o” ke*. (Include the units of measurement in your answers.) (Answer on p. 
1971.) 


(a) If two fish are caught and the weights of these fish are independent of each other, 
what are the mean and variance of the total weight of the two fish? 


(b) If one fish is caught and an exact clone is made of it, what are the mean and variance 
of the total weight of the fish and its clone? 


(c) If two fish are caught and the weights of these fish are not independent of each other, 
what are the mean and variance of the total weight of the two fish? 
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122.4. The Definition of the Variance (Optional) 


Why is the variance defined as the weighted average of squared deviations from the mean? 


1. First, we tried defining the variance as the weighted average of deviations from the mean, 
i.e. Var[X]=E[X - py]. But this was no good, because this quantity would always be 
equal to 0.°°! 

2. Next, we tried defining the variance as the weighted average of absolute deviations from 
the mean, i.e. Var[X] = E[|X - |]. This seemed to work well enough. But yet for some 
bizarre reason, we choose not to use this definition. 


3. Instead, we choose to use this definition: 
2 
V [X] = E[(X - )']. 


Why do we prefer using squared (rather than absolute) deviations as our definition of 
variance? The conventional view is that the squared deviations definition is superior to 
the absolute deviations definition (but see Gorard (2005) and Taleb (2014) for dissenting 
views). Here are some reasons for believing the squared deviations definition to be superior: 


e The maths works out more nicely. For example: 


— The algebra is easier when dealing with squares than with absolute values. 
— Differentiation is easier (serve that x? is differentiable but |z| is not). 


— Variances are additive: If X and Y are independent, then Var[X +Y] = Var[X] + 
Var [Y ]. In contrast, if we use the definition Var [|X] = E[|X — |], then variances are 
no longer additive. 


¢ Tradition (inertia). 


— A century or two ago, some Europeans preferred using squared to absolute deviations. 
And so we’re stuck with using this. 


See also these five SE discussions: &%, &, [mathoverflow, &, @. 





°31This is easily proven: ELX - uw] =E[X]-E[p] =u-p=0. 
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123. The Coin-Flips Problem (Fun, Optional) 


Here’s another example of a probability problem that can be stated very simply, yet have 
counter-intuitive results. 


Example 1503. Keep flipping a fair coin until you get a sequence of HH (two heads in 
a row). Let X be the number of flips taken. 


Now, keep flipping a fair coin until you get a sequence of HT’. Let Y be the number of 
flips taken. 
Which is larger px = E[X] or py = E[Y]? 


Intuition might suggest that “obviously”, wx = wy. Intuition would be wrong. It turns 
out that, surprisingly enough, zy = 6 and py = 4! 


Example 1504. Now suppose we flip a fair coin 10,001 times. This gives us a sequence 
of 10,000 pairs of consecutive coin-flips. 


For example, if the 10,001 coin-flips are HHTAT'..., then the first four pairs of consec- 
utive coin-flips are HH, HT, TH, and AT. 


Let A be the proportion of the 10,000 consecutive coin-flips that are HH. Let B be the 
proportion of the 10,000 consecutive coin-flips that are HT. 
Which is larger 4 = E[A] or wg = E[B]? 


In the previous example, we saw that it took, on average, 6 flips before getting HH and 
4 flips before getting HT. So “obviously”, we’d expect a smaller proportion to be H H’s. 
That is. fA < fee. 


Sadly, we would again be wrong! It turns out that w4 = wp = 1/4! This Google spreadsheet 
simulates 10,001 coin-flips and calculates A and B. 





If you’re interested, the results given in the above two examples are formally proven in Fact 
300 (Appendices). 
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124. The Bernoulli Trial and the Bernoulli Distribution 


A Bernoulli trial is an experiment (5,%,P). A coin flip is an example of a Bernoulli 
trial. 


Example 1505. Flip a coin. We can model this with a Bernoulli trial with probability 
of success (heads) 0.5: 


e Sample space S = {T, H}, 
e Event space © = {@,{T},{H}, S}, 


e Probability function P({T}) = 0.5 and P({H}) =0.5. 


The corresponding Bernoulli random variable is simply the random variable X : S > 
R defined by X ({T7}) = 0 and X ({H}) = 1. Its probability distribution is given by 
PX =0) =0.5 and P(X =1) =0.5, 





Formally: 


Definition 248. A Bernoulli trial with probability of success p is an experiment (S,¥, P) 
where 


e S={0,1}. (The sample space contains 2 elements.) 


eo {2. (0) (st. 
e P:.=>R is defined by P({0}) = 1-p and P({1}) =p. (And as usual P(@) = 0 and 
P(sj\=1,) 


The corresponding Bernoulli random variable is simply the random variable X : S > 
R defined by X ({0}) = 0 and X ({1}) = 1. Its probability distribution is given by 
Px 20) 21 prand Pix = 1) aap: 





Note that we can denote the two elements of the sample space with any symbols. We could 
use 0—standing for failure—and 1—standing for success. Or we could use 7’ and H, as was 
done in the example above. 


Example 1506. On any given day, our refrigerator at home has probability 0.001 of 
breaking down. We can model this with a Bernoulli trial with probability of success 
0.001: 


¢ Sample space S = {0,1}, 
e Event space © = {@, {0}, {1}, S}, 


¢ Probability function P({0}) = 0.999 and P({1}) = 0.001. 


The corresponding Bernoulli random variable is simply the random variable T’: S > R 
defined by T ({0}) =0 and T ({1}) =1. 

Its probability distribution is given by P (T = 0) = 0.999 and P(T = 1) = 0.001. In words, 
the probability of no failure is 0.999 and the probability of a failure is 0.001. 
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Example 1507. 90% of H2 Maths students pass their H2 Maths A-Level exams. We 
randomly pick a H2 Maths student and see if she passes her H2 Maths A-Level exam. 


We can model this with a Bernoulli trial with probability of success 0.9: 
e Sample space S = {F, P}, 

e Event space © = {@,{F},{P},S}, 

¢ Probability function P({F'}) =0.1 and P({P}) =0.9. 


The corresponding Bernoulli random variable is simply the random variable Y : S > 


R defined by Y ({F'}) = 0 and Y ({P}) = 1. Its probability distribution is given by 
P(Y =0)=0.1 and P(Y = 1) =0.9. 





The following two statements are equivalent: 


1. T is a Bernoulli random variable with probability of success p. 


2. The random variable JT’ has Bernoulli distribution with probability of success p. 
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124.1. Mean and Variance of the Bernoulli Random Variable 


Fact 238. A Bernoulli random variable T with probability of success p has mean p and 
variance p(1-p). 


Proof. E[T] =P(T =0)-0+P(T=1)-1=(1-p)-0+p-1=p. 


For the variance, first compute 


E[T?]=P(T=0)-0?+P(T=1)-=(1-p)-0+p- =p. 





Hence, Var [T] = E[T?] - (E[T])° = p-p? = p(1-p). 
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125. The Binomial Distribution 


Informally, the binomial random variable simply counts the number of successes in a 
sequence of n identical, but independent Bernoulli trials. 


Example 1508. Flip 3 fair coins. Let X be the number of heads. 


1 
X is an example of a binomial random variable X with parameters 3 and 5 


X can take on values 0, 1, 2, or 3 (corresponding to the number of heads). 


The probability distribution of X is given by 
Sey ly ie eal 
P(X =0)= —~| {-] =- P(x =ljye 
\ (*J0)@) 8 ( ) (’ 


rov=2)-(3)(j =) UEC 











Formally: 


Definition 249. Let 7), 7>, ..., JT, be n identical, but independent Bernoulli random 
variables, each with probability of success p. Then the binomial random variable X with 
parameters n and p is defined as: 


X =7,4+754+---+T). 





The following three statements are entirely equivalent: 


1. X is a binomial random variable with parameters n and p. 
2. The random variable X has the binomial distribution with parameters n and p. 
3. X ~ B(n,p). 
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Example 1509. 90% of H2 Maths students pass their A-Level exams. 


Let Y be the number of passes among two randomly chosen students. Then Y is a 
binomial random variable with parameters 2 and 0.9. Its probability distribution is given 
by 


P= 0)= ( ; Joos = 0.01, 


yer ( ; Joao. = 0.18, 


Py a2) = ( ; Joo." =10,81) 


In words, the probability that both fail is 0.01, the probability that exactly one passes is 
0.18, and the probability that both pass is 0.81. 
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125.1. Probability Distribution of the Binomial R.V. 


Let X ~ B(n,p). What is P(X =k)? 
Observe that P(X = k) is simply the probability that in a sequence of n independent 
Bernoulli trials, each with probability of success p, there are exactly k successes. 


First consider instead the probability that in a sequence of n trials, the first k trials are 
successes and the remaining n -k are failures. We know that the probability of a success 
is p and the probability of a failure is 1-p. Hence, by the Multiplication Principle, this 
probability is simply p*(1-p)"™. 


The above is the probability of k successes and n-k failures, but where exactly the first k 
trials are successes and exactly the last n —k trials are failures. But we don’t care about 
where the successes are. We only care that there are k successes. And there are C(n,k) 
ways to have exactly k successes in n trials. Thus, 


P(X =k) = ( . } tans 


In summary: 


Fact 239. Let X ~ B(n,p). Then for any k =0,1,...,n, 


P(X =k) = ( Jan 


Example 1510. Let X be the number of heads when 10 fair coins are flipped. 
Then X ~ B(10,0.5). And the probability that exactly 8 coins are heads is 


45 
P(X =8) = ( = Joss = a 


Example 1511. 90% of H2 Maths students pass their A-Level exams. 


Let Y be the number of passes among 20 randomly chosen students. Then Y ~ B(20,0.9). 
And the probability that at least 18 pass is 


P(Y > 18) = P(Y = 18) +P(Y =19) +P(Y = 20) 


wee 0.020.117 + ot Oo 0 ay 0.9290.19 = 0.677. 
18 19 20 
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125.2. The Mean and Variance of the Binomial Random Variable 


Example 1512. Problem: Three machines each have, independently, probability 0.3 of 
failure. What is the expected number of failures? What is the variance of the number of 
failures? 


Solution: Let Z ~ B(3,0.3) be the number of failures. Then 


(Za) -( ; Jostor. 4 =D) ( ; Jo.vo.. CZ =o ( : Joon 


Hence, E[Z]=P(Z=1)-1+P(Z=2)-2+P(Z=3)-3 


-( : Jost ( : Josto.24| ; Joo 


= 0.441 + 0.378 + 0.081 = 0.9. 
That is, the expected number of failures is 0.9. 


Now,E[Z?] =P(Z=1)-1?+P(Z=2)-2?+P(Z =3)-3? 


= ( ' Josie ile ( ; Jostoz' a ( ; Joo 9 


= 0.441 + 0.756 + 0.243 = 1.44. 


Hence, V[Z]=E[Z?]-(E[Z])° = 1.44-0.9? = 0.63. 


That is, the variance of the number of failures is 0.63. 





It turns out though that there is a much quicker formula for finding the mean and variance 
of any binomial random variable. 
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Fact 240. If X ~ B(n,p), then E[X]=np and Var[X] = np(1-p). 


(You can verify that this formula works for the last example: n = 3, p = 0.3, and thus 
E [Z| =np = 0.9.) 


Proof. Let T\, To, ..., T;, be identical, but independent Bernoulli random variables with 
parameter p. Then X =7) + 75)+---+7),. Hence, 


E[X]=E[%+7)+---+7,]=E[NM]+E[]+---+E[T,] =p+pt---+pe=np. 


V[X]=V[T. + 4+---+T,])=V[T]+V [i] +---+ V [Th] 
=p(1-p)+p(1-p) +--+ p(1-p) =np(1 -p). 














Exercise 495. (Answer on p. 1972.) Plane engine #1 contains 20 components, each 
of which has probability 0.01 of failure. Plane engine #2 contains 35 components, each 
of which has probability 0.005 of failure. The probability that any component fails is 
independent of whether any other component has failed. 


An engine fails if and only if at least 2 of its components fail. What is the probability 
that both engines fail? 
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126. The Continuous Uniform Distribution 


So far, all examples of random variables we’ve seen have been discrete. For example, the 
binomial random variable X ~ B(n,p) is discrete, because Range (X) = {0,1,2,...,n} is 
finite. 


We'll now look at continuous random variables. Informally, a random variable Y is con- 
tinuous if its range takes on a continuum of values. 


For H2 Maths, you need only learn about one continuous random variable: the normal 
random variable (subject of the next chapter). 


Nonetheless, we’ll first look at another continuous random variable that is not in the syl- 
labus. This is the continuous uniform random variable. It is much simpler than 
the normal random variable and can thus help build up your intuition of how continuous 
random variables work. 


1253, Contents www.EconsPhDTutor.com 


126.1. The Continuous Uniform Distribution 


A line measuring exactly 1 metre in length is drawn on the floor. It is about to rain. Let 
X be the position of the first rain-drop that hits the line. X is measured as the distance 
(in metres) from the left-most point of the line. 

So for example, if the first rain-drop hits the left-most point of the line, then x = 0. If it 
hits the exact midpoint of the line, then x = 0.5. And if it hits the right-most point, then 
ea 

Assume we can measure X to infinite precision. 


Then, assuming the first rain-drop is equally likely to hit any point of the line, we can 
model X as a continuous uniform random variable on [0,1]. This says that 


e The range of X is [0,1] (the first rain-drop can hit any point along the line); and 
e X is equally likely to take on any value in the interval [0,1] (the first rain-drop is equally 
likely to hit any point along the line). 


The following three statements are entirely equivalent: 


1. X is a continuous uniform random variable on [0,1]. 
2. X is a random variable with the continuous uniform distribution on [0,1]. 


a eres (Cae 


Recall that previously with any discrete random variable Y, we could find its probability 
distribution. That is, we could find P(Y =k) (the probability that Y takes on the value 
k). For example, if Y ~ B(3,0.5) modelled the number of heads in three coin-flips, then 


3 
the probability that there was one heads was P (Y = 1) = ( ' }osios = 3 


Now, in contrast, for any continuous random variable X, strangely enough, there is 
zero probability that X takes on any particular value! For example, if X ~ U[0,1], then 
P(X =0.37) = 0. That is, there is zero probability that X takes on the value of 0.37! 


At first glance, this may seem strange. 


But remember: There are infinitely many real numbers in the interval [0,1]. So it makes 
sense to say that the probability of X taking on any particular value is zero.”” 


So for any continuous random variable X, it is pointless to try to write down P(X =k) for 
different possible values of k, because P (X =k) is always equal to zero (regardless of what 
k is). Instead, we shall try to write down P(a< X <b), for different possible values of a 
and 0. 


Now, if X ~ U[0,1], then the probability that X takes on values between 0.3 and 0.7 is 
simply 0.7 - 0.3 = 0.4. That is, 





°2But strangely enough, zero probability is not the same thing as impossible. For example, we’d 
say that 


¢ There is zero probability, but it is not impossible that X ~ U[0,1] takes on the value 0.37. 
¢ There is zero probability and it is impossible that X ~ U[0,1] takes on the value 1.2. 
(Actually, rather than use the word “impossible”, mathematicians prefer saying “almost never”, which 


has a precise definition.) 
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P (0.3 < X <0.7) =0.7-0.3=0.4. 


Similarly, the probability that X takes on values between 0.16 and 0.35 is simply 0.35-0.16 = 
0.19. That is, 


P (0.16 < X < 0.35) = 0.35 - 0.16 = 0.19. 


The above observations suggest that it may be useful to define a new concept, called the 
cumulative distribution function. 
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126.2. The Cumulative Distribution Function (CDF) 


The CDF simply tells us the probability that X takes on values less than or equal to k, for 
every k € R. Formally: 


Definition 250. The cumulative distribution function (CDF) of a random variable X is 
the function Fy :R — R given by the mapping rule 


Fy(k) =P(X <k). 





It turns out that every random variable can be uniquely defined by giving its 
CDF. For example, the continuous uniform random variable is formally defined thus: 


Definition 251. X is the continuous uniform random variable on [0,1] if its CDF Fy: 
R = R is defined by 


0, Mi ie <1) 
Fy(k)={k, if e [0,1], 
iF He feo, 





Armed with the concept of the CDF, the formal definition of a continuous random variable 
can be simply stated: 





We can now summarise the three possible types of random variables. 


1. Discrete random variables. A random variable is discrete if its range is finite.°°* 


Examples: Bernoulli, binomial. 


2. Continuous random variables. A random variable is continuous if its CDF is con- 
tinuous. Examples: Continuous uniform, normal. 


3. Other random variables. There are random variables that are neither discrete nor 
continuous. But you will not study any of these for the A-Levels. 


Note that every random variable (discrete, continuous, or otherwise) has a cumulative 
distribution function (CDF). 





°33Or countably infinite. 
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126.3. Important Digression: P(X <k)=P(X <k) 
For any continuous random variable X, we have 
P(X <h)=P(X <8). 


That is, whether an inequality is strict makes no difference. The reason is that by the third 
Kolmogorov axiom (additivity), 


P(X <k)=P(X <k)+P(X =k) =P(X <k)+0=P(X<k). 


Thus, for continuous random variables, it doesn’t matter whether inequalities are strict or 
weak. 


Example 1513. Let X ~ U[0,1]. Then 


P(0.2< X<0.5)=P(0.2<X<0.5)=P(0.2<X <0.5)=P(0.2<X<0.5). 
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126.4. The Probability Density Function (PDF) 


The PDF is simply defined as the derivative of the CDF.°*4 


Definition 253. Let X be a random variable whose CDF F'y is differentiable. Then the 
probability density function (PDF) of X is the function fx : R > R defined by 


fx(k) = —Fx(h) 





The PDF has an intuitive interpretation. The area under the PDF between points a and 
b is equal to P(a< X <b). This, of course, is simply a consequence of the Fundamental 
Theorems of Calculus: 


ff fee)ak = Spe k)dk PEC By (6) ~ Fy (a) = P(X $b) ~ P(X <a) = P(as X <b). 


The PDF of X ~ U[0,1] (graphed below) is simply the function fx : R > R defined by 


fx(k)=1, ifke[0,1], and fx(k)=1, otherwise. 





— —_ _ 


_( 1if0<k<1 
~ | 0, otherwise. 
P(0.2 < X < 0.3) =0.1 








0.25 k 


For any a < b, the area under the PDF between a and b is precisely P(a< X <b). For 
example, there is probability 0.25 (red area) that X takes on values between 0.5 and 0.75. 
There is probability 0.1 (blue area) that X takes on values between 0.2 and 0.3. 


Exercise 496. The continuous uniform random variable Y ~ U[3,5] is equally likely to 
take on values between 3 and 5, inclusive. (a) Write down its CDF Fy. (b) Write down 


and graph its PDF fy. (c) Compute, and also illustrate on your graph, the quantities 
P(3.1<Y <4.6) and P(4.8< Y < 4.9). (Answer on p. 1973.) 








°34Note that although every random variable has a CDF, not every random variable has a PDF. In 
particular, if the random variable’s CDF is not differentiable, then by our definition here, the random 
variable does not have a PDF. 
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127. The Normal Distribution 


The standard normal (or Gaussian) random variable (SNRV) is very important. In 
fact, it is so important that we usually reserve the letter Z for it, and the Greek letters @ 
and ® (lower- and upper-case phi) for its PDF and CDF. 


The following three statements are entirely equivalent: 

1. Zisa SNRV. 

2. Z is arandom variable with the standard normal distribution. 
3. Z~N(0,1). 

Here’s the formal definition: 


Definition 254. Z is called a standard normal random variable (SNRV) if its PDF 
@:R- R is defined by 


2 
e7 9-54 ; 


(a) = 


1 
V2 





For the A-Levels, you need not remember this complicated-looking PDF. Nor need you 
understand where it comes from. 


The normal PDF is often also referred to as the bell curve, due to its resemblance to a 
bell (kinda). 









P(0.5 < Z < 0.75) 
= 0.0819 


Pi=1 <2 = 03) 
= 0.4593 














As with the continuous uniform, for any a < b, the area under the normal PDF between 
a and 6 gives us precisely P(a< X <b). For example, there is probability 0.25 (red area) 
that X takes on values between 0.5 and 0.75. There is probability 0.1 (blue area) that X 
takes on values between 0.2 and 0.3. 
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As usual, the CDF ®: R > R is defined by 


®(a)=P(Z<a)= [. ob (x) dx = [. see te. 


Unfortunately, this last integral has no simpler expression (mathematicians would say that 
it has no “closed-form expression”). Instead, as we’ll soon see, we have to use the so-called 
Z-tables (or a graphing calculator) to look up values of ®(k). 


The next fact summarises the properties of the normal distribution. Some of these proper- 
ties are illustrated in the figure that follows. 


Fact 241. Let Z ~ N(0,1) and @ and © be its PDF and CDF. 


. B(oo) = 1. (As with any random variable, the area under the entire PDF is 1.) 


. d(a) > 0, for alla eR. (The PDF is positive everywhere. This has a surprising 
implication: however large a is, there is always some non-zero probability that Z >a.) 


38 Za ihe mean; Za: 0.) 


4. The PDF @ reaches a global mazimum at the mean 0. (In fact, we can go ahead and 
compute o (0) = x 0.399. 
pute (0) oe ) 


. Var| Z| = 12 (the variance of Z 15 1.) 


. P(Z<a)=P(Z<a). (We've already discussed this earlier. It makes no difference 
whether the inequality is strict. This is because P(Z = a) = 0.) 


. The PDF @ is symmetric about the mean. This has several implications: 
(a) P(Z >a) = P(Z < -a) = ®(-a). 
(b) Since P(Z >a) =1- P(Z <a) =1-(a), tt follows that ®(-a) = 1- ®(a) or, 
equivalently, ® (a) =1-®(-a). 
(c)O( 0) = 1 = © (0) = 0.5. 
. P(-1<Z<1)=@(1)-®(-1) x 0.6827. (There is probability 0.6827 that Z takes on 
values within 1 standard deviation of the mean.) 


. P(-2< Z <2) = ®(2) - ®(-2) + 0.9545. (There is probability 0.9545 that Z takes on 
values within 2 standard deviations of the mean.) 


. P(-3< Z <3) = ®(3) - ®(-3) x 0.9973. (There is probability 0.9973 that Z takes on 
values within 3 standard deviations of the mean.) 


. The PDF ¢ has two points of inflerion, namely at +1. (The points of inflexion are one 
standard deviation away from the mean.) 














Proof. Optional, see p. 1735 (Appendices). 
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— —  P(-3 <Z <|3) = 0.9973 ————————> 





; 1 
Inflection points ——- = 0.399 
c= 


21 


P=127 <1) 
= 0.6827 





P(Z < -1.5) P(Z > 1.5) 
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Example 1514. Let’s use the T184 to find ®(2.51). 


1. Press the blue button and then DISTR (which corresponds to the QUA button). 
This brings up the DISTR menu. 


2. Press (2) to select the “normalcdf” option. 


The TI84 is now asking for your lower and upper bounds. Since ©(2.51) = ®(2.51) - 
®(-oo), your lower bound is —oo and your upper bound is 2.51. 


3. But there’s no way to enter —co on your TI84. So instead, you'll enter -10°’, which is 
simply a very large negative number. To do so, press |(-)|, the blue button, EE 
(which corresponds to the g button), and then (9) 9). (Don’t press (AND) yet!) 

. Now to enter your upper bound. First press g (this simply demarcates your lower and 


upper bounds). Then enter your upper bound 2.51 by pressing (2) o ©) (1). Then press 
NVA. Your TI84 says that the answer is (2.51) x 0.99396. 


Step 1. Step 2. Step 3. Step 4. 


P(Z < 2.51) = 0.9940 











1262, Contents www.EconsPhDTutor.com 


Example 1515. To find ®(-2.51), (1.372), and P(-4< Z <4), the steps are very 
similar. So for each, I'll simply give the screenshot from the T184: 


b(-2.51) (1.372) P(-4<Z<4) 





-4 -3 -2 -1 0 








PZ = 1,372) 
= 0.9150 
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Example 1516. We'll find ®(2.51), ®(-2.51), ®(1.372), and P(-4< Z< 4) using Z- 
tables. 


Refer to the Z-tables on p. 1265. (These are the exact same tables that appear on the 
List of Formulae (MF 26).) 


¢ To find 6(2.51), look at the row labelled 2.5 and the column labelled 1—read off the 
number 0.9940. We thus have ©(2.51) = 0.9940. 


To find ®(-2.51), note that the table does not explicitly give values of ®(z), if z <0. 
But we can exploit the fact that the standard normal is symmetric about the mean 
y.=0. This fact implies that 6(-z) = 1-®(z). Hence, 6(-2.51) = 1-6(2.51) = 0.0060. 


To find 6(1.372), first look at the row labelled 1.3 and the column labelled 7—read off 
the number 0.9147. This tells us that (1.37) = 0.9147. Now look at the right end of the 
table (where it says “ADD”). Since the third decimal place of 1.372 is 2, we look under 
the column labelled 2—this tells us to ADD 3. Thus, ®(1.372) = 0.9147+0.003 = 0.9150. 


To find P(-4 < Z <4), the Z-tables printed are actually useless, because they only go 
to 2.99. So you can just write P(-4< 7 <4) #1. 
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THE NORMAL DISTRIBUTION FUNCTION 


If Z has a normal distribution with mean 0 and 
variance | then, for each value of z, the table gives 
the value of ®(z), where 


D(z) =P(Z< z). 


For negative values of z use ®(—z) = 1—(z). 


D(z) 












































0 Zz 
123 45678 9 
Zz 0 1 2 3 4 5 6 7 8 9 ADD 
0.0 | 0.5000 | 0.5040 0.5080 0.5120]0.5160 0.5199 0.5239 |0.5279 0.5319 0.5359} 4 8 12/16 20 24/28 32 36 
0.1 | 0.5398 | 0.5438 0.5478 0.5517 | 0.5557 0.5596 0.5636 | 0.5675 0.5714 0.5753} 4 8 12/16 20 24/28 32 36 
0.2 | 0.5793 | 0.5832 0.5871 0.5910 | 0.5948 0.5987 0.6026 | 0.6064 0.6103 0.6141} 4 8 12/15 19 23/27 31 35 
0.3 | 0.6179 | 0.6217 0.6255 0.6293 | 0.6331 0.6368 0.6406 | 0.6443 0.6480 0.6517) 4 7 11/15 19 22/26 30 34 
0.4 |0.6554/ 0.6591 0.6628 0.6664 |0.6700 0.6736 0.6772 | 0.6808 0.6844 0.6879} 4 7 11/14 18 22/25 29 32 
0.5 | 0.6915 | 0.6950 0.6985 0.7019 | 0.7054 0.7088 0.7123 | 0.7157 0.7190 0.7224|3 7 10/14 17 20/24 27 31 
0.6 | 0.7257] 0.7291 0.7324 0.7357 | 0.7389 0.7422 0.7454 | 0.7486 0.7517 0.7549} 3 7 10/13 16 19/23 26 29 
0.7 | 0.7580] 0.7611 0.7642 0.7673 | 0.7704 0.7734 0.7764 | 0.7794 0.7823 0.7852}3 6 9/12 15 18/21 24 27 
0.8 | 0.7881 |0.7910 0.7939 0.7967 | 0.7995 0.8023 0.8051 | 0.8078 0.8106 0.8133}3 5 8/11 14 16/19 22 25 
0.9 | 0.8159} 0.8186 0.8212 0.8238 | 0.8264 0.8289 0.8315 | 0.8340 0.8365 0.8389}3 5 8/10 13 15/18 20 23 
1.0 | 0.8413 | 0.8438 0.8461 0.8485 |0.8508 0.8531 0.8554 |0.8577 0.8599 0.8621}2 5 7] 9 12 14/16 19 21 
1.1 | 0.8643 | 0.8665 0.8686 0.8708 | 0.8729 0.8749 0.8770} 0.8790 0.8810 0.8830}2 4 6] 8 10 12/14 16 18 
1.2 | 0.8849 | 0.8869 0.8888 0.8907 | 0.8925 0.8944 0.8962 /0.8980 0.8997 0.9015}2 4 6) 7 9 11/13 15 17 
1.3 | 0.9032 | 0.9049 0.9066 0.9082} 0.9099 0.9115 0.9131 /0.9147 0.9162 0.9177}2 3 5] 6 8 10/11 13 14 
1.4 | 0.9192 | 0.9207 0.9222 0.9236/0.9251 0.9265 0.9279 | 0.9292 0.9306 0.9319} 1 3 4] 6 7 8/10 11 13 
1.5 | 0.9332 | 0.9345 0.9357 0.9370 | 0.9382 0.9394 0.9406 | 0.9418 0.9429 0.9441} 1 2 4/5 6 7] 8 10 11 
1.6 | 0.9452 | 0.9463 0.9474 0.9484/0.9495 0.9505 0.9515/0.9525 0.9535 0.9545}1 2 3/4 5 6/7 8 9 
1.7 | 0.9554 | 0.9564 0.9573 0.9582 /0.9591 0.9599 0.9608 | 0.9616 0.9625 0.9633} 1 2 3) 4 4 5|/6 7 8 
1.8 | 0.9641 | 0.9649 0.9656 0.9664 | 0.9671 0.9678 0.9686 | 0.9693 0.9699 0.9706} 1 1 2/3 4 4|/5 6 6 
1.9 | 0.9713] 0.9719 0.9726 0.9732 |0.9738 0.9744 0.9750/0.9756 0.9761 0.9767} 1 1 2] 2 3 4|/ 4 5 5 
2.0 | 0.9772 | 0.9778 0.9783 0.9788 | 0.9793 0.9798 0.9803 |0.9808 0.9812 0.9817/0 1 1] 2 2 3}3 4 4 
2.1 | 0.9821 | 0.9826 0.9830 0.9834 | 0.9838 0.9842 0.9846 | 0.9850 0.9854 0.9857}0 1 1/2 2 2/3 3 4 
2.2 | 0.9861 | 0.9864 0.9868 0.9871 | 0.9875 0.9878 0.9881 | 0.9884 0.9887 0.9890}0 1 1/1 2 2/2 3 3 
2.3 | 0.9893 | 0.9896 0.9898 0.9901 | 0.9904 0.9906 0.9909 | 0.9911 0.9913 0.9916};0 1 1/1 1 2}/2 2 2 
2.4 |0.9918 | 0.9920 0.9922 0.9925 | 0.9927 0.9929 0.9931 | 0.9932 0.9934 0.9936/0 0 1/1 1 1/1 2 2 
2.5 | 0.9938 | 0.9940 0.9941 0.9943 |0.9945 0.9946 0.9948 | 0.9949 0.9951 0.9952}0 0 O; 1 1 1/1 1 ~=1 
2.6 | 0.9953 | 0.9955 0.9956 0.9957 | 0.9959 0.9960 0.9961 | 0.9962 0.9963 0.9964}0 0 0; 0 1 1/1 1 ~=+1 
2.7 | 0.9965 | 0.9966 0.9967 0.9968 | 0.9969 0.9970 0.9971 | 0.9972 0.9973 0.9974|}0 0 0; 0 0 1/1 1 =+1 
2.8 | 0.9974 | 0.9975 0.9976 0.9977 | 0.9977 0.9978 0.9979 |0.9979 0.9980 0.9981}0 0 0; 0 0 0/0 1 1 
2.9 | 0.9981 | 0.9982 0.9982 0.9983 | 0.9984 0.9984 0.9985 | 0.9985 0.9986 0.9986}0 0 0} 0 0 0} 0 O O 
Critical values for the normal distribution 
If Z has a normal distribution with mean O and 
variance | then, for each value of p, the table 
gives the value of z such that 
P(Z< z) =p. 
Pp 0.75 0.90 0.95 0.975 0.99 0.995 0.9975 0.999 0.9995 
Zz 0.674 1.282 1.645 1.960 2.326 2.576 2.807 3.090 3.291 




















127.1. The Normal Distribution, in General 


Let Z ~ N(0,1) be the SNRV and o,€ R be constants. 
Consider oZ + 4, itself a random variable. We know that since E[Z] = 0 and Var[Z] = 1, 
it follows that 

E[oZ+pl=cE[Z]+pwap and V[oZ+p)]=0°V[Z] =o’. 


It turns out that oZ +: is a normal random variable with mean j and variance o7: 


Definition 255. X is called a normal random variable with mean ts and variance o? if 
its PDF fx :R-— R is defined by 


1 
fx (a) = ie 





Once again, for the A-Levels, you need not remember this complicated-looking PDF. Nor 
need you understand where it comes from. 


The following three statements are entirely equivalent: 


1. X is anormal random variable with mean j: and variance o?. 


2. X isa random variable with normal distribution of mean 1 and variance o”. 
3. X ~ N (u,07). 
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Example 1517. The normal random variables A ~ N(-1,1), B ~ N(1,1), and C ~ N(2,1) 
have variance 1 (just like the SNRV), but non-zero means. Their PDFs are graphed below. 
(Included for reference is the standard normal PDF in black.) 


We see that the effect of increasing the mean p is to move the graph of the PDF rightwards. 
And decreasing the mean moves it leftwards. 

















1267, Contents www.EconsPhDTutor.com 


Example 1518. The normal random variables D ~ N(0,0.1), E ~ N(0,2), and F ~ 
N(0,3) have mean 0 (just like the SNRV), but non-unit variances. Their PDFs are 
graphed below. (Included for reference is the standard normal PDF in black.) 


The effect of changing the variance o? is this: 


e The larger the variance, the “fatter” the “tails” of the PDF and the shorter the peak. 


e Conversely, the smaller the variance, the “thinner” the “tails” of the PDF and the 
taller the peak. 
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Example 1519. The normal random variables G ~ N(-1,0.1), H ~ N(1,2), and J ~ 
N(2,3) have non-zero means and non-unit variances. Their PDFs are graphed below. 
(Included for reference is the standard normal PDF in black.) 
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In general, normality is preserved under linear transformations: 


Fact 242. Let X ~ N(p,07) and a,b€R be constants. Then aX +b ~ N(au + b,a’o). 

















Proof. Optional, see p. 1737 (Appendices). 


Thus, we can easily transform any normal random variable into the SNRV: 


ee 
Corollary 49. If X ~ N(p, 0°), then ane N(0,1). Equivalently, X =o0Z + wu. 
o 





(Just to be clear, two random variables are identical if their CDFs are identical.) 














Proof. The next exercise asks you to prove this corollary. 


Xe 
Exercise 499. Using Fact 242, prove that if X ~ N(x, a), then ogee N(O, 1). 





(Answer on p. 1975.) 


The above corollary gives us an alternative method for computing probabilities associated 
with normal random variables. In general, if X ~ N (nu, a), then 


oO 


P(X <e)=P(oZ+ysc)=P(Z<—") -0(—#)_ 
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The properties that we listed for the SNRV also apply, with only a few modifications, to 
any NRV. I highlight any differences in red. The figure that follows illustrates. 


Fact 243. Let X ~ N(n, 07) and let fx and Fy be the PDF and CDF of X. 


1. B(co) =1. (The area under the entire PDF is 1. This, of course, is true of any random 
variable. ) 


. (a) >0, for alla eR. (The PDF is positive everywhere. This has the surprising 
implication that no matter how large a is, there is always some non-zero probability 
that Z >a.) 


3: E[X =e (ihe mean of Z is iu) 
4. The PDF fx reaches a global maximum at the mean ps. (In fact, we can go ahead and 


_ 0. ae 
ov 27 o 


. Var[X]=07. (The variance of X is o?.) 


.P(Z<a)=P(Z<a). (We've already discussed this earlier. It makes no difference 
whether the inequality is strict. This is because P(Z =a) = 0.) 


compute fx (U1) = 


. The PDF @ is symmetric about the mean. This has several implications: 


(a) P(X >p+a) = P(X <p-a) = Fx(u-a). 

(b) Since P(X >+a)=1-P(X <p+a)=1-Fx(p+<a), tt follows that Fx(u-a) = 
1-Fy(p+a) or, equivalently, Fy(+a)=1-Fx(u-a). 

(c) Fy (mu) =1- Fx (1) = 0.5. 


.P(u-o< X <pt+o) = O(1) - ®(-1) # 0.6827. (There is probability 0.6827 that X 
takes on values within 1 standard deviation of the mean.) 

.P(u-o< X <pt+o) = &(2) - ®(-2) x 0.9545. (There is probability 0.9545 that X 
takes on values within 2 standard deviations of the mean.) 

.P(u-o< X <pt+o) = &(3) - ®(-3) x 0.9973. (There is probability 0.9973 that X 
takes on values within 3 standard deviations of the mean.) 


. The PDF @ has two points of inflexion, namely at to. (The points of inflexion are 
one standard deviation away from the mean.) 
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——_ P(u— 30 $< X <p +30) 0.9973 ——> 


fx(a) 


Inflection points 


\ 


P(X < wp — 1.50) 








Pou-o<X <u 


= 0.6827 


(u-20<X <put+2o) 


1 0.399 
ov 21 0 








P(X > wt 1.50) 


a) 


0.9545 +N] . 
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Example 1520. Let G ~ N(-1,0.1), H ~ N(1,2), and J ~ N(2,3). We'll find P(G < 2) 
using our TI84. The first few steps are similar to before: 


. Press the blue button and then QUAN (which corresponds to the DISTR button). 
This brings up the DISTR menu. 

. Press (2) to select the “normalcdf” option. 

. Enter the lower bound -10” by pressing [(-)|, the blue button, EE (which corre- 
sponds to the g button), and then (9) (9). (Don’t press yet!) 

. Enter the upper bound 2 by pressing g and (2). (Don’t press yet!!). 


Step 1. Step 2. Step 3. Step 4. 


Previously, we didn’t bother telling the TI84 our mean yp and standard deviation co. 
And so by default, if we pressed (2MIN38) at this point, the TI84 simply assumed that 
we wanted the SNRV Z ~ N(0,1). Now we’ll tell the TI84 what p and o are: 


5. First enter the mean p= —1. Press g (U). 


6. Now enter the standard deviation o = V0.1 (and not the variance). Press g \/ (0) 
e p. Finally, press (NUH. The TI84 says that P(G < 2) #1. 


Step 5. Step 6. 


Finding P(H <2), P(I <2), P(-1<G<1), P(-1< HAH <1), and P(-1<J <1) is similar: 


PCH <2) ond PU <2) P(-1<G<1) P(-1<H<1) P(-1<IJ<1) 


Since J has mean ps = 2, we should have exactly P (J <2) = 0.5. So here the TI84 has 
actually made a small error in reporting instead that P (J < 2) » 0.5000000005. 
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Example 1521. We now redo the previous two examples, but use Z-tables: 





=I 
- 04868] = © (9.4868) » 1, 


x 0071] = ® (0.7071) * 0.7601, 


att - 2-2-0) -0(0)=05, 


-1-(-1) 7 1-0), 
Tei <Z< Toi x 0306) 
= © (6.3246) - ®(0)  1- ®(0) = 0.5. 


P(-l<A<1)=P (-111 wi 


= ©(0) - ®(-1.4142) x 0.5 - [1 - 6(1.4142)] 
= 6(1.4142) - 0.5 x 0.9213 - 0.5 = 0.4213, 


i cs 
Pil <i <ijeP (-1.7321 x Se es -0.5774] 
V3 ve 
= 6(-0.5774) - ®(-1.7321) = 1 - 6(0.5774) — [1 - ®(1.7321)] 


w 0.9584 — 0.7182 = 0.2402. 


Exercise 501. Let X ~ N(2.14,5) and Y ~ N(-0.33,2). Using both the Z-tables 
and your graphing calculator, find the following: (a) P(X 21) and P(Y 21). (b) 
P(-2< X <-1.5) and P(-2< Y <-1.5). (Answer on p. 1977.) 
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127.2. Sum of Independent Normal Random Variables 


Theorem 48. Jf X and Y are independent normal random variables, then X +Y is also 


a normal random variable. Moreover, X —Y is also a normal random variable. 








Proof. Omitted. 











We already knew from before that E[X +Y]=E[X]+E[Y]. Moreover, if X and Y are 
independent, then Var|X + Y] = Var[X]+ Var[Y]. Thus, the above theorem implies: 


Corollary 50. Let X ~ N(px, 0%) and Y ~ N(py, 07) be independent and a,b « R 
be constants. Then X +Y ~ N(ux + py, 0% + oF) and more generally, aX + bY ~ 
N(apx + buy, a?o% + Doe 


Moreover, X-Y  ~ N(x — pty 0% + oy) and more generally, aX -bY ~ 
N(apx — by, eae + aes 





Examples: 


1275, Contents www.EconsPhDTutor.com 


Example 1522. The weight (in kg) of a sumo wrestler is modelled by X ~ N (200,50). 
Assume that the weight of each sumo wrestler is independent of the weight of any other 
sumo wrestler. 


We randomly choose two sumo wrestlers. 

(a) What is the probability that their total weight is greater than 405 kg? 

(b) What is the probability that one is more than 10% heavier than that the other? 

(a) Let X; ~ N (200,50) and X> ~ N (200,50) be the weight of the first and second sumo 

wrestler. Then X, + X2~ N (400,100). Thus, 

405 — 400 
Vv 100 


(b) Our goal is to find p = P(X; > 1.1X_) + P(X2>1.1X1). This is the probability that 
the first sumo wrestler is more than 10% heavier than the second, plus the probability 
that the second is more than 10% heavier than the first. Of course, by symmetry, these 
two probabilities are equal. Thus, p = 2 x P (X; > 1.1X2). Now, 


P (Xi + Xe> 405) =P(Z> 


=P(Z>0.5)=1-6(0.5) x 1- 0.6915 = 0.3085. 


P (Xy > 1.1X2) =, (Xx, ry 1.1X5 > 0) ; 


But X1- 1.1X2 ~ N(200- 1.1 - 200, 50+ 1.1?- 50) = N (-20, 110.5). Thus, 


0 - (-20) 
Pi Sid Pt Bie SOV PI Ze 
ee eS ae ( ) 


x P(Z > 1.9026) = 1 - © (1.9026) » 1 - 0.9714 = 0.0286. 





Altogether then, p = 2P (X, > 1.1X2) = 2 x 0.0286 = 0.0572. 
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Example 1523. The weight (in kg) of a caught fish is modelled by X ~ N(1,0.4). The 
weight (in kg) of a caught shrimp is modelled by Y ~ N(0.1,0.1). Assume that the 
weights of any caught fish and shrimp are independent. 


(a) What is the probability that the total weight of 4 caught fish and 50 caught shrimp 
is greater than 10 kg? 


(b) What is the probability that a caught fish weighs more than 9 times as much as a 
caught shrimp? 


(a) Let S be the total weight of 4 caught fish and 50 caught shrimp. Note, importantly, 
that it would be wrong to write S =4X + 50Y, because 4X + 50Y would be 4 times the 
weight of a single caught fish, plus 50 times the weight of a single caught shrimp. 


In contrast, we want Z to be the sum of the weights of 4 independent fish and 50 inde- 
pendent shrimp. Thus, we should instead write S =X, + X9+X3+X4+Y,+Yo+---+Y¥59, 
where 


e X,~N(1,0.4), Xo ~N(1,0.4), X3 ~ N(1,0.4), and X, ~ N(1,0.4) are the weights of 
each caught fish. 

¢ Y; ~ N(0.1,0.1), Yo ~ N(0.1,0.1), ..., and Ys ~ N(0.1,0.1) are the weights of each 
caught shrimp. 

Now, S~N(4x14+50x0.1,4 x 0.4+50x0.1) = N(9,6.6). 


(Note by the way that in contrast, 4X + 50Y ~ N (9,4 x 0.4450? x 0.1) = N (9, 256.4), 
which has a rather different variance!) 


Thus, P (S > 10) » 0.3485 (calculator). 


(b) P(X > 9Y) = P(X -9Y > 0). But X-9Y ~N(1-9x 0.1,0.4+9? x 0.1) =N(0.1,8.5). 
Thus, P (X - 9Y > 0) * 0.5137 (calculator). 


Exercise 502. (Answer on p. 1978.) Water and electricity usage are billed, respectively, 
at $2 per 1,000 litres (1) and $0.30 per kilowatt-hour (kWh). Assume that each month, 
the amount of water used by Ahmad (and his family) at their HDB flat is normally 
distributed with mean 25,000 1 and variance 64,000,000 1°. Similarly, the amount of 
electricity they use is normally distributed with mean 200 kWh and variance 10,000 
kWh’. 

Assume that monthly water usage and electricity usage are independent. 


(a) Find the probability that their total water and electricity utility bill in any given 
month exceeds $100. 


(b) Find the probability that their total water and electricity utility bill in any given year 
exceeds $1,000. 


Suppose instead that electricity usage is billed at $2 per kWh. 


(c) Then what is the maximum value of x, in order for the probability that the total 
utility bill in a given month exceeds $100 is 0.1 or less? 
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127.3. The Central Limit Theorem and The Normal 
Approximation 


Suppose we have n independent random variables, each identically distributed with mean 
uw €R and variance o? € R. Then informally, the Central Limit Theorem (CLT) says that if 
n is “large enough”, then their sum (which is also a random variable) has the approximate 
distribution N (np, no”). Formally: 


Theorem 49. (The Central Limit Theorem.) Let X), X2,...,Xy be random vari- 
ables. Suppose (i) they are independent; and (ti) they are identically distributed, with 
mean € R and variance o7 € R. 


nm 
Then the sum ys = X,+ Xo+---+Xp converges in distribution to N(np, no’). 
i=l 

















Proof. The proof is a little advanced and thus entirely omitted from this book. 


What does it mean for one random variable to “converge in distribution” to another? This 
is a little beyond the scope of the A-Levels, but informally, this means that as n > oo, 
nm 


the random variable a X; becomes “ever more” like the random variable with distribution 
i=l 
N (np, no’). 


One big use of the CLT is this: 


If n is “large enough”, then the sum of n independent, 
identically distributed random variables can be 
approximated by a normal distribution. 


How large is “large enough”? The most common rule-of-thumb is that n > 30 is “large 
enough”, so that’s what we'll use in this book, even though this is somewhat arbitrary. 


Indeed, if the original distribution from which the random variables are drawn are not 
“nice enough”, then n>30 may not be “large enough”. (Informally, a distribution is 
“nice enough” if it is—among other things—fairly symmetric, fairly unimodal, and not 
too skewed.) 








You can safely assume that all distributions you’ll ever encounter in the A-Levels are “nice 
enough”, so that the n > 30 rule-of-thumb works. But whenever you use the CLT normal 
approximation, you should be clear to state that you assume the distribution is “nice 
enough”. 
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Example 1524. Let X be the random variable that is the sum of 100 rolls of a fair 
die. From our earlier work, we know that each die roll has mean 3.5 and variance 35/12. 
Problem: Find P(X > 360) and P(X > 360). 


The CLT says that since n = 100 > 30 is large enough and the distribution is “nice 
enough” (we are assuming this), the random variable X can be approximated by the 
normal random variable Y ~ N (100 x 3.5, 100 x 35/12) = N (350, 3500/12). 


Now, in using Y as an approximation for X, we might be tempted to simply write 


P(X > 360)» P(Y >360) and P(X >360)*« P(Y > 360). 


Note however that X is a discrete random variable, so that P(X > 360) # P(X > 360). 
More specifically, 


P(X > 360) = P(X = 360) + P(X > 360). 


In contrast, Y is a continuous random variable, so that P(Y > 360) = P(Y > 360). Hence, 
if we simply use the approximations P(X > 360) » P(Y > 360) and P(X > 360) » P(Y > 
360), then implicitly we’d be saying that P(X = 360) = 0, which is blatantly false. 


To correct for this, we perform the so-called continuity correction. This says that we'll 
instead use the approximations 


P(X > 360) « P(Y > 359.5) and P(X > 360) * P(Y > 360.5). 


Thus, P(X > 360) « P(Y > 359.5) x 0.2890 (calculator) and P(X > 360) « P(Y > 360.5) x 
0.2693. 


Continuity Correction. If X is a discrete random variable that is to be approximated 
by a continuous random variable Y, then 


* P(X k)xP(Y 2k-05), 
os P(Mah) oP (Yh 0lo) 
ee Glee ra (05) . 
P(X <hr <k 015). 





Note that if the random variable to be approximated is itself continuous, then there is no 
need to perform the continuity correction. This is illustrated in Exercise 504 below. 


Exercise 503. Let X be the random variable that is the sum of 30 rolls of a fair die. 
Find P(100 < X < 110). (Answer on p. 1979.) 


Exercise 504. The weight of each Coco-Pop is independently and identically distributed 


with mean 0.1 g and variance 0.004 g?. A box of Coco-Pops has exactly 5,000 Coco-Pops. 
It is labelled as having a net weight of 500 g. Find the probability that that the actual 
net weight of the Coco-Pops in this box is less than or equal to 499 g. (Answer on p. 
1979.) 
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128. The CLT is Amazing (Optional) 


The Fundamental Theorems of Calculus and the CLT are the most profound and amazing 
results you’ll learn in H2 Maths. This chapter briefly explains why the CLT is so amazing 
and why the normal distribution is ubiquitous. 


128.1. The Normal Distribution in Nature 
The normal distribution is ubiquitous in nature. The classic example is human height.°? 


Example 1525. Below is a histogram of the heights of the 4,060 NBA players who ever 
played in an NBA game (through the end of the 2016 season). (Heights are reported in 
feet and a whole number of inches, where 1 in = 2.54 cm and 1 ft = 12 in, so that 1 ft = 
30.48 cm.) The histogram has 28 bins and (arguably) looks normal (bell-shaped). 


The width of each bin is 1 inch. For example, the red bin says 410 players have had 
reported heights of 6 ft 7 in (approx. 200 cm). 

The 
blue (rightmost) bin is also barely visible and says that only 2 players have had reported 
heights of 7 ft 7 in (approx. 231 cm). The average or mean height is approx. 6 ft 6 in 
(approx. 198 cm). 


NBA Players’ Heights, Histogram 
450 
400 
390 a 
300 
250 
200 


150 
5 ft3 in = 160 em 
0 


ae ns — 
50 


() 


Mean Height ~ 198 em 6 ft 7 in ~ 200 em 


7 ft 7 in = 231 em 








°8°Data: Excel spreadsheet. Source: Basketball-Reference.com (retrieved June 15th, 2016). Caveats: (1) 
For some reason, out of the 4060 players in that database at the time of retrieval, there was exactly 
one player (George Karl) whose height was not listed. WIKIPEDIA lists George Karl’s height as 6 ft 
2 in, so that is what I have used for his height. (2) By NBA, I actually mean the BAA (1946-1949), 
the NBA (1949-present), and the ABA (1967-1976), combined. (3) As is well-known among basketball 
fans, the listed heights of NBA players are not accurate and can sometimes be off by as much as 2 to 3 
inches (5 to 7.5 cm). (See this recent Wall Street Journal article.) 
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Manute Bol (approx 231 cm) and Muggsy Bogues Gaal 160 
cm) were briefly on the same team. (You ghts.) 


An infamous example of the normal distribution concerns human intelligence: 
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Example 1526. The 1994 book The Bell Curve: Intelligence and Class Structure in 
American Life was named after the observation that the Intelligence Quotient (IQ) seems 
to be normally distributed. This observation was neither new nor controversial (though 
some scholars would dispute the usefulness of IQ measures). 


The black and white IQ distributions in the NLSY, Version I 


Frequency distributions for populations of equal size 


Black bes, ‘ White 
distribution sae distribution 


70 80 90 100 110 120 130 140 150 
IQ 


What made the book especially controversial were its claims that intelligence was largely 
heritable and that black Americans had lower intelligence than whites. The figure above 
is taken from p. 279 of the book. It suggests that 

e Black IQ is normally distributed, with a mean of around 80. 

e White IQ is normally distributed, with a mean of around 105. 





Another example—the Galton box: 
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Example 1527. (Galton box.) Small beans are released from the top, through a 
narrow passage. There are numerous pegs that tend to randomly divert the path of the 
beans. 


At the bottom of the box, there are many different narrow slots of equal width, into 
which the beans can drop. The beans will tend to form a bell shape at the bottom. 


Beans just released. Pegs divert beans. Beans fill slots. 


(Source: YouTube.) 





Question: 
Many things in nature seem to be normally distributed. Why? 


We will try to answer this question, but only after we’ve illustrated how the Central Limit 
Theorem works. 
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128.2. I[lustrating the Central Limit Theorem (CLT) 


Example 1528. Flip many fair coins. Model each with the Bernoulli random variables 
T1, T>, T3, ..., each with probability of success (heads) 0.5. 

Let X,, = 7, +7) +---+T, ~ B(n,0.5). 

Below are the histograms of the distributions of X1, Xo, ..., and Xg. X, has probability 


0.5 of taking on each of the values 0 and 1. X» has probability 0.5 of taking on the value 
of 1; and probability of 0.25 of taking on each of the values 0 and 2. Etc. 


Distribution of X; = T; i Distribution of 





0 











(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


On this and the next page are the histograms of the distributions of X7, Xs, X9, X10, 
Xo), X39, X49, X50, and Xjo9. Observe that as n grows, the shape of the probability 
distribution of X, looks ever more bell-shaped. This is exactly what the CLT says. 


Distribution of 
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(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


> 


Distribution of 7 Distribution of 


























10 20 30 40 50 


Distribution of 
X100 = 1, + Tz + +++ + T1000 





The CLT says the following: 


1. Draw a sufficiently large number of independent and identical random variables from 
ANY distribution.*° 
2. Add them up to get another random variable S. 


3. The probability distribution of S will look normal. 





°36T should say nearly any distribution. For the classical CLT to apply, the variance must be finite. 
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What makes the CLT particularly amazing is that it works with ANY distribution. 


To illustrate, next up is an example where the original distribution is highly skewed and 
does not look at all bell-curved. Nonetheless, the CLT still works out nicely. 


Example 1529. Flip many biased coins, each with probability 0.9 of heads. Model 
each with the Bernoulli random variables Y|, Y2, Y3, ..., each with probability of success 
(heads) 0.9. 


Let S,, = Yi + Yo+--:+Y, be the number of heads in the first n coin-flips. (By the way, 
Sn ~ B(n,0.9).) 


On this and the next page are the histograms of the distributions of 5, S2,..., and Sjo. 
5S, has probability 0.1 of taking on value 0 and 0.9 of taking on value 1. S» has probability 
0.01 of taking on the value of 0, 0.18 of taking on the value 1, and 0.81 of taking on the 
value 2. S3 has probability 0.001 of taking on the value of 0, 0.036 of taking on the value 
1, 0.486 of taking on the value 2, 0.2916 of taking on the value 3, 0.6561 of taking on the 
value 4. Etc. 


ge 


Distribution o | Distribution of 
i 8, =; L Sp =Y¥,+Y> 








r 


Distribution of 
Sy =Y¥,+Y¥.+¥3+Y%, 


Distribution of 
S3 = Y; + Y, + Y3 











(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Distribution of 
S6 =Y,+Y,+°:-+Y, 


| Distribution of 
[os = 44 gre Ye 
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It certainly does not look like the distribution S, is becoming increasingly bell-curved. 
Well, let’s see. 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 
Below are the histograms of the distributions of $9, $39, S49, S59, and Si99. Remarkably 
enough, as n grows, the shape of the probability distribution of S,, looks ever more bell- 


shaped. As promised by the CLT. 


ll Distribution of 
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128.3. Why Are So Many Things Normally Distributed? 
We now return to the question posed earlier: 
Many things in nature seem to be normally distributed. Why? 
This is a deep question. The standard quick answer is this: 


If S is the sum of a very large number of independent random variables, 
then by the CLT S is (approximately) normally distributed. 


Examples to illustrate: 


Example 1530. Assume that human height is entirely determined by 1000 independent 
genes (assume all human beings have these 1000 genes). 


Assume that each of these 1000 genes is associated with an independent random variable 
X1, Xo, ..., Xio00, each identically distributed with mean j.x and variance o%. Assume 
also that human height is simply equal to the sum of these random variables. That is, a 
human being’s height is simply given by H = X1 + Xo +--++ X1000. 


Then the CLT says that since n = 1000 is “large enough”, H will be approximately 
normally distributed, with mean 1000jx and variance 1000c%. Amongst the world’s 7.4 
billion people, there will be some very short people and some very tall people, but most 
people will be near the mean height 1000ix. 


Example 1531. Ah Kow’s Mooncake Factory manufactures mooncakes. Each mooncake 
is supposed to weigh exactly 185 g, if the standard recipe is followed with absolute 
precision. 


However, the exact weight of each mooncake will usually vary, due to myriad factors, 
such as whether the baker was paying attention, how much water the baker added, how 
long the mooncake was left in the oven, the room temperature that day, etc. 


Assume there are 300 independent factors that determine the exact weight of a mooncake. 
Assume that each of these 300 factors is associated with an independent random variable 
Y, Yo, ..., Y300, each identically distributed with mean py and variance 0%. Assume also 
that the weight of a mooncake is simply given by W = Y, + Yo +--- + Y300. 


Then the CLT says that since n = 300 is “large enough”, W will be approximately normally 
distributed, with mean 300yy and variance 3000}. Amongst the millions of mooncakes 
produced, there will be some very light mooncakes and some very heavy mooncakes, but 
most mooncakes will be near the mean weight 300Uy. 
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128.4. Don’t Assume That Everything is Normal 


Mathematical modellers often assume that “everything is normal”. There are three justifi- 

cations for this: 

1. We have strong empirical evidence that many things in nature are normally distributed. 

2. We have a strong theoretical reason (the CLT) for why this might be so. 

3. The normal distribution is easy to handle (because e.g. the maths is easy, compared to 
some other distributions). 

However, many things are not normally distributed. It is thus a mistake to assume that 

“everything is normal”. 


One example of a common but non-normal distribution found in nature is the Pareto 
distribution. We'll skip the formal details. Informally, it is called the Pareto Principle or 
the 80-20 Rule and businesspersons say things like: 


“80% of your output is produced by 20% of your employees.” 
“80% of your sales come from 20% of your clients.” 


It is believed the Pareto distribution is a good description of many aspects of human 
performance (though apparently not of height or IQ). By the way, it was named after 
Vilfredo Pareto, who in 1896 found that approximately 80% of the land in Italy was owned 
by 20% of the population. 


Let’s see if the points scored in the NBA resembles the Pareto distribution.°” In particular, 
is it the case that 20% of NBA players have scored 80% of the points? 





°37Source: Basketball-Reference.com. Caveats: (1) The data were retrieved on June 15th, 2016, so the 
points scored are between 1946 and that date. (2) By NBA, I actually mean the BAA (1946-1949), the 
NBA (1949-present), and the ABA (1967-1976), combined. Data: Excel spreadsheet. 
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Example 1532. Below is a histogram of the total points scored by each of 4,060 NBA 
players. Clearly, the total points scored by each player is not normally distributed. 


The histogram has 20 bins of equal width. The leftmost bin says that 2,615 players 
scored 0 to 1919 points. The rightmost bin says that only 2 players scored 36,468 to 
38, 387 points. 


The grand total number of points ever scored in the NBA is 11,565,923. Of which, 
8,424,242 (or 72.8%) were scored by the top 20% (812). So it appears that the 80-20 
Rule is a reasonably good description of the distribution of total points scored by players! 
In contrast, the normal distribution is obviously not a good description. 


Points Scored by NBA Players, Histogram 
3000 
2500 
2000 
1500 
1000 
500 


0 





It’s fairly obvious to anyone who bothers graphing the data that “points scored in the 
NBA” is not normally distributed. There are however instances where this is less obvious. 
One is thus more likely to mistakenly assuming a normal distribution. A famous and tragic 
example of this is given by the financial markets. 
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Example 1533. The Dow Jones Industrial Average (DJIA) is one of the world’s leading 
stock market indices. It is a weighted average of the share prices of 30 of the largest US 
companies (e.g. Apple, Coca-Cola, McDonald’s). 


Trading starts in the morning and closes in the afternoon (right now, the trading hours 
are 9:30 am to 4 pm). The next graph is of the daily closing values for the past 30 years. 


DJLA (1,000s), Daily Close, 16/06/1986 - 15/06/2016 


Red vertical lines 


indicate first trading 
day of each year. 


Sorcsccom! 
El al ae sade 
pope RE ep 
[ea ee =, dane 


of 


1987 991 1995 1999 


Ae 


BRRR EEE 
Ge Pe ce 


Let q; be the % change in closing value on day 7, as compared to day i-1. For example, 
on June 14th, 2016, the DJIA closed at 17,674.82. On June 15th, 2016, it closed at 
17,640.17, 34.65 points lower than the previous day’s close. Thus, 


—34.65 
—_—_—_—___—_—_. 2 
420160615 = 17,074. ou —0.20%. 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


The graph here is of g, on 36,044 consecutive trading days (over 131 years). In black 
are those days when the DJIA rose; in red are when it fell. 


Can you spot the single largest one-day fall in the DJIA? (We’ll talk about this singular 
day shortly.) 


% Daily Change in DJLA (q), 17/02/1885 - 15/06/2016 


16% 
Green vertical lines 
12% mark the first trading 
day of years ending in 0. 
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(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


The graph here is also of g, but in the form of a histogram. Each bin has width 0.1% 
(except the leftmost and rightmost bins). For example, on 2,204 days (out of 36,044), 
q € (-0.1%, 0%] (the DJIA fell by between 0.1% and 0%). 


On 78 days, g < -5% (the DJIA fell by more than 5%). On 70 days, g > 5% (the DJIA 
rose by more than 5%). 


Histogram of % Daily Change in DJLA (q), 1885-2016 


2,500 a 
Each bin is (x, x + 0.1%], except 


On 2,204 days (out of 36,044), for the left- and right-most bins. 


— 0.01 < qs 0%. 
2,000 


1,500 


1,000 


On 78 days (out of On 70 days (out of 
36,044), q < —5%. 36,044), g > 5%. 
500 


» 


0 


It seems reasonable to say that q is normally distributed (at least if we ignore the leftmost 
and rightmost bins). 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


The sample mean and standard deviation (from the 36,044 observations) are pu » 0.023% 
and o x 1.064%. So let’s suppose q were normally distributed with mean jz and variance 
2 

O: 

If so, then we’d predict (as per the properties of the normal distribution) that 

1. 0.6827 of the time, gq is within 1 standard deviation of the mean, i.e. 
(-1.04%, 1.09%). 

2. 0.954 of the time, gq is within 2 standard deviations of the mean, i.e. 
(-2.10%, 2.15%). 

As it actually turned out, 

1. 0.7965 of the time (28, 709 out of 36,044 days), q was within 1 standard deviation of 
the mean. (A bit off, but not too bad.) 

2. 0.9536 of the time (34, 373 out of 36,044 days), q was within 2 standard deviations of 
the mean. (Almost exactly correct!) 


In addition to the above “evidence”, we might make the following theoretical argument: 
Share prices are affected by a myriad random and arguably independent factors. Hence, 
by the CLT, we’d expect share prices (and thus q as well) to be normally distributed. 


Unfortunately, modelling g as a normal random variable would be a disastrous mistake, 

especially when it comes to predicting rare events. If q is indeed normally distributed, 

then we’d predict that the DJIA rises or falls by more than ... 

1. ... 5% less than once every 2,000 years. 

2. ... 7% less than once every 100 million years. 

3. ... 10% less than once every 480,000, 000 billion years. (For comparison, the universe 
is estimated to be 13.8 billion years old.) 


And so, during the 131 years for which we have data, it should have been very unlikely 
that the DJIA ever rose or fell by more than 5%. 


But as it actually turned out, during these 131 years, the DJIA rose or fell by more than 


1. ... 5% 148 times. 
2. ... 7% AO times. 
3. ... 10% 10 times. 





(Data: Excel spreadsheet.) 
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129. Statistics: Introduction (Optional) 


129.1. Probability vs. Statistics 





Probability | Statistics 
Given a known model, what can we Given observed data, what can 
say about the data we'll observe? we say about the model? 


Example 1534. Let p be the probability of a coin-flip turning up heads. (p is an example 
of a parameter.) Flip a coin thrice. 


Probability question: “Suppose we know that p = 1/2. Then what can we say about 
the probability of observing three heads (i.e. P(HHH))?” (For most probabilists, this 
is a simple question with a straightforward answer: P (H HH) = 1/8.) 


Statistics question: “Suppose we observe HHH. Then what can we say about p?” 
(Different statisticians will give different answers.) 


In the real world, we will almost never know what p “truly” is. Instead, we usually only 
have some limited data observations (such as observing HHH). 


Probability is about making heroic assumptions about what p is, in order to draw infer- 
ences about what the observed data will look like. 


In contrast, statistics is about using limited, observed data to draw statistical infer- 
ences about the model and its parameters. 
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129.2. Objectivists vs Subjectivists 


Example 1535. Ann and Bob are two infinitely intelligent persons. Ann believes that 
the probability of rain tomorrow is 0.2 and Bob believes that it is 0.6. 


e Objectivist view: There is some single, “correct” probability p of rain tomorrow. 
Perhaps no one (except some Supreme Being up above) will ever know what exactly p 
is. But in any case, we can say that exactly one of the following must be true: 


. Ann is correct (and Bob is wrong); 


. Bob is correct (and Ann is wrong); or 


. Both Ann and Bob are wrong. 


Subjectivist view: A probability is not some objective, rational thing that exists 
outside the mind of any human being. There is no “correct” probability. Instead, a 
probability is merely 


the degree of belief in the occurrence of an event attributed by a given person at 
a given instant and with a given set of information. (De Finetti, infra, pp. 3-4.) 


Thus, Ann and Bob can legitimately disagree about the probability of rain tomorrow, 
without either being wrong. After all, the numbers 0.2 and 0.6 are merely their personal, 
subjective degrees of belief in the likelihood of rain tomorrow. 





Bruno de Finetti(1906—1985) was perhaps the most famous and extreme subjectivist ever. 
In the preface to his Theory of Probability (1970)?**, he wrote: 


My thesis, paradoxically, and a little provocatively, but nonetheless gen- 
uinely, is simply this: 


PROBABILITY DOES NOT EXIST 


The abandonment of superstitious beliefs about the existence of Phlogiston, 
the Cosmic Ether, Absolute Space and Time, ..., or Fairies and Witches 
was an essential step along the road to scientific thinking. Probability, too, 
if regarded as something endowed with some kind of objective existence, ts 
no less a misleading misconception, an illusory attempt to exteriorize or 
materialize our true probabilistic beliefs. 


In this textbook (and also in H2 Maths), we will be strict objectivists. The main practical 
implications of being an objectivist are illustrated in the following examples: 





38 Originally published in 1970 in Italian as Teoria delle probabilité. The link is to a recent 2017 English 
edition. 
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Example 1536. Judge Ann says the murder suspect is probably innocent. Judge Bob 
says the suspect is probably guilty. 


Objectivist interpretation: Ann and Bob cannot both be correct. The suspect is 
either innocent (with probability 1) or guilty (with probability 1). 


In fact, we can go even further and say that both Ann and Bob are talking nonsense. It is 
nonsensical to say things like the suspect is “probably” innocent (or “probably” guilty), 
because the suspect either is innocent or not. 


Subjectivist interpretation: Ann and Bob are perfectly well-entitled to their beliefs. 


Moreover, it is perfectly meaningful to say things like the suspect is “probably” innocent 
(or “probably” guilty). Ann and Bob do not know for sure whether the suspect is innocent 
or guilty. They are thereby perfectly well-entitled to speak probabilistically about the 
innocence or guilt of the suspect. 


Example 1537. We flip a coin 100 times and get 100 heads. 


Given these observed data (100 heads out of 100 flips), what can we say (what statistical 
inference can we make) about whether or not the coin is fair? 


Subjectivist answer: The coin is probably not fair. (This is perhaps the answer that 
most laypersons would give.) 


Objectivist answer: The coin either is fair (with probability 1) or isn’t fair (with 
probability 1). Subjectivist statements like the coin is “probably” not fair are nonsensical. 





Most untrained laypersons are innately subjectivist. Yet in this book (and also for the 
A-Levels), you'll be trained to think like strict objectivists. 


Note though that it is not the case that one school of thought is correct and the other 
wrong. Both the objectivist and subjectivist schools of thought have merit. The growing 
consensus amongst statisticians is to take the best of both worlds. 


Nonetheless, in this textbook, we learn only the objectivist interpretation. Not because it 
is necessarily superior, but rather because 


1. The maths is easier. 


2. Tradition: For most of the 20th century, the objectivist interpretation was favoured. 
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130. Sampling 


130.1. Population 


Definition 256. A population is any ordered set (i.e. vector) of objects we’re interested 


idle 





A population can be finite or infinite. But to keep things simple, we’ll look at examples 
where it is finite. 


Example 1538. The two candidates for the 2016 Bukit Batok SMC By-Election are Dr. 
Chee Soon Juan and PAP Guy. It is the night of the election and voting has just closed. 


Our objects-of-interest are the 23,570 valid ballots cast. (A ballot is simply a piece of 
paper on which a vote is recorded. The words ballot and vote are often used interchange- 
ably.) 


Arrange the ballots in any arbitrary order. Let v; = 1 if the first ballot is in favour of Dr. 
Chee and v,; = 0 otherwise. Similarly and more generally, for any 2 = 2,3,..., 23570, let 
vu; = 1 if the ith ballot is in favour of Dr. Chee and v, = 0 otherwise. 


Our population here is simply the ordered set P = (v1, vo, ...,V23579). So in this example, 
the population is simply an ordered set of 1s and Os. 
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130.2. Population Mean and Population Variance 


The population mean J is simply the average across all population values. The popu- 
lation variance o” is a measure of the variation across all population values. Formally:°” 


Definition 257. Given a finite population P = (v1,v2,...,Ux), the population mean ju 
and population variance o” are defined by 


Se, a er ay BC Ce es 


k k; 


Example 1300 (continued from above). Suppose that of the 23,570 votes, 9,142 
were for Dr. Chee and the remaining against. So the vector (v1, v2,...,V23570) contains 
9,142 1s and 14, 428 Os. 


Then the population mean is 


Uptgt--:+Un 9142x1+14428x0 9142 
p= —~ = = 


SS eee 
n 23970 23570 


In this particular example, the population values are binary (either 0 or 1). And so 
we have a nice alternative interpretation: the population mean is also the population 
proportion. In this case, it is the proportion of the population who voted for Dr. Chee. 
So here the proportion of votes for Dr. Chee is about 0.3879. 


The population variance is 


2 9142 \2 9142 \2 
2 (v1 — 1) + (v2 — p)° ot) 4 9142. ( a Sua) + 14428. (0 — a 
ee ee 
n 23570 


x 0.2374 


As usual, the variance tells us about the degree to which the v;’s vary. Of course, in 
this example, we already know that the v;’s can take on only two values—0 and 1. So the 
variance isn’t terribly interesting or informative in this example. In particular, it doesn’t 
tell us anything more that the population mean didn’t already tell us (indeed, it can be 
shown that in this example, o? = p:- p/”). 








°39TIn the case of an infinite population, the definitions of and o? must be adjusted slightly, but the 
intuition is the same. 
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130.3. Parameter 


Informally, a parameter is some number we're interested in and which may be calculated 
based on the population. 


Example 1300 (continued from above). A parameter we might be interested in 
is the population mean p—this is also the proportion of votes in favour of Dr. Chee. 
(Another parameter we might be interested in is the population variance 07, but let’s 
ignore that for now.) 


Voting has just closed. In a few hours’ time (after the vote-counting is done), we will 
know what exactly p is. But right now, we still don’t know what p is. 


Suppose we are impatient and want to know right away what yw might be. In other 
words, suppose we want to get an estimate of the true value of py. What are some 
possible methods of getting a quick estimate of ju? 


One possibility is to observe a random sample of 100 votes and count the proportion of 
these 100 votes that are in favour of Dr. Chee. So for example, say we do this and observe 
that 39 out of the 100 votes are for Dr. Chee. That is, we find that the observed sample 


mean (which in this context can also be called the observed sample proportion) is 
0.39. ‘Then we might conclude: 


Based on this observed random sample of 100 votes, we estimate that p is 0.39. 
The layperson might be content with this. But the statistician digs a little deeper and 
asks questions such as: 


¢ How do we know if this estimate is “good”? 


e What are the criteria to determine whether an estimate is “good”? 


We'll now try to address, if only to a limited extent, these questions. But to do so, we 
must first precisely define terms like sample and estimate. 
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130.4. Distribution of a Population 


Informally,”? the distribution of a population tells us 


1. The range of possible values taken on by the objects in the population; and 
2. The proportion of the population that takes on each possible value. 
Example 1300 (continued from above). The population is P = (v1, v2,...,V23570), 


the ordered set of 23570 ballots. Suppose that of these, 9,142 are votes for Dr. Chee 


(hence recorded as 1s) and the remaining 14,428 are for PAP Guy (hence recorded as 
Os). 


Then the distribution of the population can informally be described in words as: 


e A proportion 9142/23570 of the population are 1s, and 
¢ A proportion 14428/23570 of the population are Os. 


Example 1539. The population is P = (3,4, 7,7, 2,3). 


Then the distribution of the population can informally be described in words as: 


¢ A proportion 1/6 of the population are 2s; 
¢ A proportion 2/6 of the population are 3s; 
¢ A proportion 1/6 of the population are 4s; and 
e A proportion 2/6 of the population are 7s. 








*40Formally, we’d define the population distribution as a function. Indeed, some writers define the popu- 
lation itself as the distribution function. 
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130.5. A Random Sample 


Informally, to observe a random sample of size n, we follow this procedure: Imagine the 
23,570 ballots are in a single big bag. 


1. Randomly pull out one ballot. Record the vote (either we write x; = 1, if the vote was 
for Dr. Chee, or we write x, = 0, if it wasn’t). 

2. Put this ballot back in (this second step is why we call it sampling with replacement). 

3. Repeat the above n times in total, so as to record down the values of 71, %9, ..., Lp. 


We call (21,%2,...,%,) an observed random sample of size n. Note that this is an 
ordered set (or vector) of numbers. Formally: 


Definition 258. Let P be a population. Then the random vector (i.e. ordered set of 
random variables) (X1, X2,...,Xn) is a random sample of size n from the population P 


if 


..., Xp are independent; and 


..., Xn are identically distributed, with the same distribution as P. 





As always, we must be careful to distinguish between a function and a value taken on by 
the function. This table summarises. 








Function Value taken by the function 
f is a function f (x) is a possible value taken on by the function 
X is a random variable x is a possible observed value of the random variable 
(X,, Xo,...,X,) is arandom sample | (21, %2,...,%p) is a possible observed random sample 


An example to illustrate: 
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Example 1300 (continued from above). To repeat, the distribution of the population 
P = (v1, V2,---, 023570) can informally be described in words as: 


e 9142/23570 of the population were 1s; and 
e 14428/23570 of the population were Os. 


Let X,, X2, and X3 be independent random variables, each with the same distribution 
as the population. That is, for each 7 = 1, 2,3, 


14428 9142 


P(X;=0) =——— and P(X;=1)=——. 
23570 23570 


The ordered set (or vector) (X), X29, X3) is a random sample of size 3. 


An example of an observed random sample of size 3 might be (71, x2, 73) = (1, 1,0)— 
this would be where we randomly sample 3 ballots (with replacement) and find that the 
first two are votes for Dr. Chee but the third is not. 


Another example of an observed random sample of size 3 might be (21, x2, 73) = (0,0,0)— 
this would be where we randomly sample 3 ballots (with replacement) and find that none 
of the three are for Dr. Chee. 


As another example, (X1, X2,X3, X4, X5) is a random sample of size 5. 


An example of an observed random sample of size 5 might be (21, 22,73,24,25) = 
(0,1,0,1,0)—this would be where we randomly sample 5 ballots (with replacement) and 
find that only the second and fourth are votes for Dr. Chee. 


Another example of an observed random sample of size 5 might be (21, 22,13, 4,25) = 
(1,1,0,1,1)—this would be where we randomly sample 5 ballots (with replacement) and 
find that only the third is not a vote for Dr. Chee. 





In this textbook, we’ll be very careful to distinguish between a random sample (which is 
a vector of random variables) and an observed random sample (which is a vector of real 
numbers). 


This may be contrary to the practice of your teachers or indeed even the A-Level exams. 
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130.6. Sample Mean and Sample Variance 


Definition 259. Let (X), Xo,...,X,) be a random sample of size n. Then the corre- 
sponding sample mean X and the sample variance S? are the random variables defined 











(The List of Formulae (MF26) will contain the observed sample variance.) 


Note that strangely enough, the denominator of S? is n—1, rather than n as one might 
expect. As we’ll see later, there is a good reason for this. 


By the way, there are two other formulae for calculating the sample variance: 


Fact 244. Let S = (X1, Xo,...,Xn) be a random sample of size n. Let X be the sample 
mean and S? be the sample variance. Let a€ R be a constant. Then 


n a ‘ale n zy i4(Xi-a a 
es a fs : = and {b) S*= es) eee (Sus Z zs : 


(ure = 


n-1 n-1 





(The List of Formulae (MF26) has a but not b.) 





Proof. Optional, see p. 1738 (Appendices). 
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Once again, it is important to distinguish between 


¢ The sample mean X (a random variable) vs. the observed sample mean 7Z (a real 
number). 


¢ The sample variance S? (a random variable) vs. the observed sample variance s” 


(a real number). 


Example 1300 (continued from above). Let (X),X2,X3) be a random sample of 
size 3. The corresponding sample mean X and sample variance S? are these random 
variables: 


pit ket Xs gp Cae oerece one ce 4: 


3 3-1 
Suppose our observed random sample of size 3 is (1,0,0). Then the corresponding ob- 


served sample mean Z and observed sample variance s” are these real numbers: 


a se 
‘ n oe eee: 


pee (a - B)" + (wo - 8) + (ag - 2) . Us 
ieee | 


Let (X1, X29, X3, X14, X5) be a random sample of size 5. The corresponding sample mean 
X and sample variance S” are these random variables: 


GE G2. Ge Gee a (Gp ae eee (ee) 


5 ta: 


X= 


Suppose our observed random sample of size 5 is (0,1,0,0,1). Then the corresponding 
observed sample mean Z and observed sample variance s” are these real numbers: 


CS ase ae 


oy n 5 O 





go _ (0178) + (wa 8) + (as ~ 8)" + (a-F)" + (5-8) 


(0-})"+(0- 
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We call a random variable an estimator if it is used to generate estimates (“guesses”) 
for some parameter. Example: 


Example 1300 (continued from above). It is the night of the election and polling 
has just closed. We still do not know the true proportion p that voted for Dr. Chee. 


We decide to get a random sample of size 3: (X,X2,X3). The corresponding sample 
mean X3 = (X; + Xj +-X3)/3 shall be an estimator for y. (Informally, an estimator is 
a method for generating “guesses” for some unknown parameter, in this case ju.) 


This estimator is used to generate estimates (“guesses”) for yz. For every observed 
random sample, the estimator generates an estimate. 


Suppose our observed random sample of size 3 is (1,0,0). We calculate the corresponding 
observed sample mean to be % = 1/3. We say that z = 1/3 is an estimate for wp. 


(By the way, unless we are extremely lucky, it is highly unlikely that the true value of 
the unknown parameter p is precisely 1/3. After all, 1/3 is merely an estimate obtained 
from a single observed random sample of size 3.) 


Suppose instead that our observed random sample of size 3 were (0,1,1). Then the 
corresponding observed sample mean would be x = 2/3. We’d instead say that x = 2/3 is 
our estimate for ju. 


There is also more than one estimator we can use. For example, suppose instead that 
we decide to get a random sample of size 5: (X1, X2, X3, X4, X;). We shall instead use 
the corresponding sample mean X = (X, + Xj + X3+X4+X5)/5 as our estimator for ju. 
And so for example suppose our observed random sample of size 5 is is (0, 1,0,0,1). Then 
the corresponding observed sample mean x = 0.4 and x = 0.4 would be our estimate for 
[L. 

Now, are these estimators and estimates “good” or “reliable”? How much should we 
trust them? These are questions that we’ll address in the next section. 
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A different example: 


Example 1540. Suppose we wish to find the average height ju (in cm) of an adult male. 


As a practical matter, it would be quite difficult to locate and record the height of every 
adult male in the world. So instead, what we might do is to randomly pick 4 adult males 
and record their heights. This gives us a random sample (H,, H2, H3, Hy) of heights. The 
corresponding sample mean is the random variable H = (H, + Hy + H3+H4)/4. H shall 
serve as our estimator for ju. 


Suppose our observed random sample is (hy, h2, h3, ha) = (178, 165, 182, 175). 


Then the corresponding observed sample mean is 





hythothgt+hg 178+ 165+182+175 
n 7 4 


h = = Wee 
Thus, h = 175 serves as an estimate (or “guess”) of the true average male height pu. 


Again, are the estimator H and estimate h = 175 “good” or “reliable”? How much 
should we trust them? These are questions that we’ll address in the next section. 
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Example 1541. Let X be the random variable that is the height (in cm) of an adult 
female Singaporean. Our parameters-of-interest are the true population mean p and true 
population variance a? of X. We wish to generate estimates for ps and 0”. 


To this end, we get a random sample of size 8: (X 1, Xo,...,Xg). The corresponding 
sample mean X = (X, + X2+---+ Xg)/8 will serve as our estimator for y. And the 


ore 
corresponding sample variance S? = }*(X;-X) /(8-1) will serve as our estimator for 
cme = 


(a) Suppose our observed random sample is such that 


8 8 
ya; = 1,320) and = yz, > 218, 360. 


i=1 a 


Then the observed sample mean Z and the observed sample variance s? are 


eee 
aa 

n 8 
_ Diop = Se _ 218360 - 


= 80. 
n-1 ff 


And our estimates for and o” are, respectively, 165 cm and 80 cm?. 


(b) Suppose instead our observed random sample is such that 


8 8 
> (#:-160)=72 and > (a;- 160)" = 1,560. 


t=1 i=1 


Then the observed sample mean Z and the observed sample variance s” are 


= Rae TOO TOO) _ Ee 10) 160 = 2 5 160 = 1689, 
nr 


nr 


"| (a; — 160)? - Zevert60 1 569 — 2 
go = Deen (> 160) ss Sy 130.3. 
| ¢ 





And our estimates for 1 and o? are, respectively, 169 cm and 130.3 cm?. 
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130.7. Sample Mean and Sample Variance are Unbiased Estimators 


Earlier we asked: How do we decide if an estimator and the estimates it generates are 
“good”? How do we know whether to trust any given estimate? 


For H2 Maths, we’ll learn only about one (important) criterion for deciding whether an 
estimator is “good”. This is unbiasedness. Informally, an estimator is unbiased if on 
average, the estimator “gets it right”. Formally: 


Definition 260. Let X be a random variable and 6 € R be a parameter (i.e. just some 
real number). We say that X is an unbiased estimator for @ if 


Eix|=é. 


If x is an estimate generated by an unbiased estimator X, then we call x an unbiased 
estimate. 





The next proposition says that the sample mean X is an unbiased estimator for the 
population mean ju; and the sample variance S” is an unbiased estimator for the 
population variance o”. 


Proposition 24. Let (X1,X2,...,Xn) be a random sample of size n drawn from a dis- 
tribution with population mean p and population variance a7. Let X be the sample mean 
and S” be the sample variance. Then 


(a) E|X| =p. And 
(Oe Soe 





Proof. You are asked to prove (a) in Exercise 508. For the proof of (b), see p. 1739 
(Appendices) (optional). 














Proposition 24(b) is the reason why, strangely enough, we define the sample variance with 
n-—1 in the denominator: 

=42 =42 =\2 

gt (X,-X) + (XQ-X) +--+ (X,-X) | 


n-1 


As defined, S? is an unbiased estimator for the population variance 07. This, then, is the 
reason why we define it like this. 


Some writers call S? the unbiased sample variance, but we shall not bother doing so. We’ll 
simply call S? the sample variance. 
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Example 1538 (continued from above). (Chee Soon Juan election.) 


Suppose two observed random samples of size 3 are (21, 72,73) = (1,0,0) and (21, 72, 73) = 
(1,0,1). The corresponding observed sample means are X; = 1/3 and £2 = 2/3. These are 
two possible estimates (“guesses”) of the true sample proportion ju. 


Unless we’re extremely lucky, it’s unlikely that either of these two estimates is exactly 
correct. Nonetheless, what the above unbiasedness proposition tells us is this: 


Suppose the unknown population mean is pz = 0.39. We draw the following 10 observed 
random samples of size 3 (table below). For each sample 7, we calculate the corresponding 


observed sample mean 7Z;. 


Sample 7 


a 


8 
wo 


8 
ws 


Lj 





1 


2 
3 
4 
5 
6 
i 
8 
J 


—_ 
S| 





See) sa: (Sea Ss ea ea 


— Sea ea oS S| SS FS SoS S&S 


Coreooqo9qoereoeod e& 


2/3 
0 
2/3 
1/3 
2/3 
1/3 
0 
0 
1/3 
2/3 


Note that every estimate Z; is wrong. Indeed, since the sample mean X; can only take 
on values 0, 1/3, 2/3, or 1, the estimates can never possibly be equal to the true pz = 0.39. 


Nonetheless, what the above proposition says informally is that on average, the estimate 


gets it correct. Formally, E [x | =p=0.a9. 


For a demonstration that you can play around with, try this Google spreadsheet. 
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Exercise 507. (Answer on p. 1980.) We are interested in the weight (in kg) of Singa- 
poreans. We have an observed random sample of size 5: (32,88, 67, 75, 56). 


(a) Find unbiased estimates for the population mean jp and variance o? of the weights of 
Singaporeans. (State any assumptions you make.) 


(b) What is the average weight of a Singaporean? 


Exercise 508. Prove that E [x | =p. (This is part (a) of Proposition 24). (Answer on 
p. 1981.) 


Exercise 509. Suppose we flip a coin 10 times. The first 7 flips are heads and the next 
3 are tails. Let 1 denote heads and 0 denote tails. (Answer on p. 1981.) 


(a) Write down, in formal notation, our observed random sample, the observed sample 
mean, and observed sample variance. 


(b) Are these observed sample mean and variance unbiased estimates for the true popu- 
lation mean and variance? 





(c) Can we conclude that this a biased coin (i.e. the true population mean is not 0.5)? 
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130.8. The Sample Mean is a Random Variable 


This section is just to repeat, stress, and emphasise that the sample mean X is itself a 
random variable. This is an important point. 


Indeed, the sample mean_X is both (i) a random variable; and (ii) an estimator. In 
contrast, an observed sample mean Z is both (i) a real number; and (ii) an estimate. 


We’ve showed that E [x | =p. This equation can be interpreted in two equivalent ways: 


e The expected value of the sample mean equals the population mean 1. 
e The sample mean is an unbiased estimator for the population mean ju. 
We now give the variance of the sample mean. It turns out to be equal to the population 


variance o”, divided by the sample size n. 


= o2 
Fact 245. Var|[X | = = 











Proof. You are asked to prove this fact in Exercise 510 . 





= 1 
Exercise 510. Prove Fact 245. (Hint: Note that X = —(X,+ X9+---+X,) and X1, Xo, 
n 
..., Xn are independent.) (Answer on p. 1981.) 
Exercise 511. For each of the following terms, give a formal definition and an intuitive 
explanation. (State whether each term is a random variable or a real number.) For 


simplicity, you may assume that the finite population is given by P = (1,%9,...,2x). 
(Answer on p. 1982.) 


) The population mean. 
b) The population variance. 
c) The sample mean. 


d) The sample variance. 


f) The variance of the sample mean. 
g) The mean of the sample variance. 


h) The observed sample mean. 





( 
( 
( 
( 
(ce) The mean of the sample mean. 
( 
( 
( 
( 


i) The observed sample variance. 
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130.9. The Distribution of the Sample Mean 


Fact 246. Let X,Xo,...,Xn~ N(n, o”) be independent random variables. Then 


X= Sey (u,Z). 
n n 





Proof. Corollary 50 tells us that the sum of normal random variables is itself a normal 
random variable. So X, + X9+---+ X,, is a normal random variable. 


Fact 242 tells us that a linear transformation of a normal random variable is itself a normal 
random variable. So X;, = (X, + X9+-:-+X,,)/n is a normal random variable. 


In the previous sections, we already showed that X,, has mean p and variance o?/n. 














_ 2 

Altogether then, X, ~ N (1. =). 

n 

Now, suppose instead X1, X9,...,X, are not normally distributed. Surprisingly, a similar 

result still holds, thanks to the CLT. Informally, draw X1, Xo,...,X» from any distribution. 

Then thanks to the CLT, it will still be the case that—provided n is “large enough”—X,, 
is (approximately) normally distributed. Formally: 


Fact 247. Let X1,Xo,...,Xn be independent random variables, each identically dis- 
tributed with mean fr € R and variance o2 €R. Let 


oe iL 1m AO) ar +h 
n 


S o2 
Then lin x, ~ n(n, =). 
n— co n 





Proof. The CLT says that if n is “large enough”, then X, +X )+---+X~, is well-approximated 
by the normal distribution N (np, no’). 


And so it follows from Fact 242 (a linear transformation of a normal random variable is 
itself a normal random variable) that X = (X,+ X2+-:-+X,,) /n is well-approximated by 














2 
the normal distribution N (1. =). 
n 


In the next chapter, we’ll make greater use of the two results given in this section. 
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130.10. Non-Random Samples 
Some examples to illustrate the concept of a non-random sample: 


Example 1542. Suppose we’re interested in the average height of a Singaporean. The 
only way to know this for sure is to survey every single Singaporean. This, however, is 
not practical. 


Instead, we have only the resources to survey 100 individuals. We decide to go to a 
basketball court and measure the heights of 100 people there. We thereby gather an 
observed sample of size 100: (21,%2,...,100). We find that the average individual’s 
height is = 7 2,;/100 = 179 cm. 


Is x = 179 cm an unbiased estimate of the average Singaporean’s height? Intuitively, we 
know that the answer is obviously no. 


The reason is that our observed sample of size 100 was non-random. We picked a basket- 
ball court, where the individuals are overwhelmingly (i) male; and (ii) taller than average. 
Our estimate x = 179 cm is thus probably biased upwards. 


Example 1543. Suppose we’re interested in what the average Singaporean family spends 
on food each month. The only way to know this for sure is to survey every single family 
in Singapore. This, however, is not practical. 


Instead, we have only the resources to survey 100 families. We decide to go to Sixth 
Avenue and randomly ask 100 families living there what they reckon they spend on food 
each month. We thereby gather an observed sample of size 100: (21,22,...,2199). We 
find that the average family spends Z = )° z;/100 = $2, 700 on food each month. 


Is z = $2,700 an unbiased estimate of the average monthly spending on food by a Singa- 
porean family? Intuitively, we know that the answer is obviously no. 


The reason is that our observed sample of size 100 was non-random. We picked an 
unusually affluent neighbourhood. Our estimate % = $2,700 is thus probably biased 
upwards. 
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131. Null Hypothesis Significance Testing (NHST) 


Here’s a quick sketch of how Null Hypothesis Significance Testing (NHST) works: 


Example 1544. A piece of equipment has probability 6 of breaking down. We have 
many pieces of the same type of equipment. Assume the rates of breakdown across the 
pieces of equipment are identical and independent. 

1. Write down a null hypothesis Ho. In this case, it might be “Ho: 6 = 0.6”. 

2. Write down an alternative hypothesis H4. In this case, it might be “Hy: 0 < 0.6”. 


(This is a one-tailed test—to be explained shortly.) 


3. Observe arandom sample. For example, we might have an observed random sample 
of size 5, where only the fourth piece of equipment breaks down. And so we’d write 
(ie U2,U3,04, 5) Fy (0, 0, 0, I, 0). 


. Write down a test statistic. In this case, an obvious test statistic is the sample 
number of failures T = X, + Xo + X3 + X4+ X5. Our observed test statistic is thus 
t=%,+%.+%3+%4+%5 =0+0+04+1+0=1. 


. Now ask, how likely is it that—if Hp were true—our test statistic would have been “at 
least as extreme as” that actually observed? That is, what is the probability 


P (Observe data as extreme as that observed| Ho)? 


The above probability is called the p-value of the observed sample. 


In this case, the p-value is the probability of observing a random sample where 1 or fewer 
pieces of equipment broke down, assuming Hp: 6 =0.6 were true. That is, 


p=P(T<t=1|Ay). 


Now, remember that 7’ is a random variable. In fact, it’s a binomial random variable. 
Assuming Ho to be true, we have T ~ B(n,@) = B(5,0.6). Thus, 


p=P(T<1|Ho) = P(T=0|Ho) +P(T = 1|Ho) = ( : Joo. + ( ° Josto = 0.08704. 


This says that 7f Hp were true, then the probability of observing a test statistic as extreme 
as the one we actually observed is only 0.08704. We might interpret this relatively small 
p-value as casting doubt on or providing evidence against Ho. 
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Here is the full list of the ingredients that go into NHST. 


Null Hypothesis Significance Testing (NHST) 


. Null hypothesis Hp (e.g. “this equipment has probability 0.6 of breaking down”). 


. Alternative hypothesis Hy, (e.g. “this equipment has probability less than 0.6 of 
breaking down”). The test is either one-tailed or two-tailed, depending on Hy. 


3. A random sample of size n: (X1, X2,...,Xn). 


4. A test statistic T (which simply maps each observed random sample to a real num- 
ber.) 


. The p-value of the observed sample. This is the probability that—assuming Hp were 
true—T' takes on values that are at least “as extreme as” the actual observed test 
statistic ft. 


. The significance level a. This is a pre-selected threshold, usually chosen to be some 
small value. The conventional significance levels are a = 0.1, a = 0.05, or a = 0.01. 


We then conclude qualitatively that 


e A small p-value casts doubt on or provides evidence against Ho. 


e A large p-value fails to cast doubt on or provide evidence against Hp. 


In particular, if p< a, then we say that we reject Ho at the significance level a. And 
if p >a, then we say that we fail to reject Hp at the significance level a. 





Note importantly that to reject Ho (at some significance level a) does NOT mean that Ho 
is false and Hy is true. Similarly, failure to reject Hyp does NOT mean that Hp is true and 
fH, is false. More on this below. 


Another example of NHST, now slightly more formally and carefully presented. 
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Example 1300. (Dr. Chee election example.) Our parameter of interest is ju, the 
proportion of votes for Dr. Chee. We guess that Dr. Chee won only 30% of the votes. 
We might thus write down two competing hypotheses: 


Hy c= 0.8, 
Aas ti oUs3. 


We call Hp the null hypothesis and Hy, the alternative hypothesis. 


We pre-select a = 0.05 as our significance level. This is the arbitrary threshold at which 
we'll say we reject (or fail to reject) Ho. 


We gather a random sample of 100 votes: (X1, X2,..., X190). Our test statistic is the 
number of votes in favour of Dr. Chee, given by 


Ji =. ¢ + Xo +-+++ X09. 
Suppose that in our observed random sample (21,2%2,...,2199), we find that 39 are in 


favour of Dr. Chee. Our observed test statistic is thus t = 39. 


We now ask: What is the probability that—assuming Ho were true—T takes on values 
that are at least “as extreme as” the actual observed test statistic t? That is, what is the 
p-value of the observed sample? 


Now, assuming Hp were true, 7’ is a binomial random variable with parameters 100 and 
0.3. That is, f ~B @,p)= B (100,03). So, 


p=P(T > 39|Hy) = P(T = 39|Ho) + P(T = 40|Ho) +--- + P (T= 100 Ho) 


i O35 07 > mp 03 20:7) Ses oul 0.31.79 ~ 0.03398. 
39 40 100 


The small p-value casts doubt on or provides evidence against Ho. 


And since p * 0.03398 < a = 0.05, we can also say that we reject Hp at the a = 0.05 
significance level. 
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Let @ be the parameter we’re interested in. Under the objectivist interpretation, the value 
of 6 may be unknown, but it is fixed. This has two consequences: 


1. We never speak probabilistically about #, because @ is a fixed number. For example, we 
never say “@ is probably less than 0.6” or “@ has probability 0.8 of being between 0.4 
and 0.7”. Such statements are nonsensical. 


2. The null hypothesis, which is always written as an equality (e.g. “Hp : 6 = 0.6”), is 
almost certainly false. After all, @ can (usually) take on a continuum of values. So do 
NOT interpret “we fail to reject Hj)” to mean “Hp is true”. This is because Ho 
is almost certainly false. 


When performing NHST, we will assiduously avoid saying things like “Hp is true”, “Ho is 
false”, “H 4 is true”, or “Hy is false”. Instead, we will stick strictly to saying either “we 
reject Hop at the significance level a” or “we fail to reject Hop at the significance level a”. 
Each of these two statements has a very precise meaning. The first says that p< a. The 
second says that p >a. Nothing more and nothing less. 
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131.1. One-Tailed vs Two-Tailed Tests 


In the previous section, all the NHST we did were one-tailed tests.™! For example, in 
the NHST done for Dr. Chee, we had 


Ag t= Osa; 

Axi y>0.3. 
This was a one-tailed test because the alternative hypothesis H,4 was that js was to the 
right of 0.3. 


If instead we changed the alternative hypothesis to: 


Hig =a; 

Hatred. 
Then this would be called a two-tailed test, because the alternative hypothesis Hy is that 
jt is either to the left or to the right of 0.3. 


We now repeat the examples done in the previous section, but with H, tweaked so that we 
instead have two-tailed tests. The difference is that the p-value is calculated differently. 





>41By the way, the more common convention is to say “one-tailed” and “two-tailed” tests, rather than 
“one-tail” and “two-tail” tests, as is the norm in Singapore (similar to those “Close for break” signs you 
sometimes see). But after some consultation with my grammatical experts, I have been told that both 
are equally correct. 
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Example 1544 (equipment breakdown). 


Everything is as before, except that we now change the alternative hypothesis: 


He 6 206. 
Had 40.6. 


Say we observe the same random sample as before: (21, %2, 73, 24, 5) = (0,0,0,1,0). 


Again our test statistic is the sample number of failures T =X, + X9+ X3+X4+X5. And 
so again our observed test statistic is f= 2%, +%9+%3+%4+%5;=0+0+0+1+0=1. 


The difference now is how the p-value (of the observed sample) is calculated. In words, 
the p-value gives the likelihood that our test statistic is “at least as extreme as” that 
actually observed—assuming Ho were true. 


Previously, under a one-tailed test, we interpreted “our test statistic is at least as extreme 
as that actually observed” to mean the event 7’ < t= 1. 


Now that we’re doing a two-tailed test, we’ll instead interpret the same phrase to mean 
both the event T < t = 1 and the event that T is as far away on the other side of 


BIT |Ho| = 3. The second event is, specifically, JT > 5. Altogether then, the p-value is 
given by 


(7 <1,f> 5| Ho ) 


(@ s 0| Ho) +P (7 = 1 Ho) +P (7 = 5| Ho) 


p=P 
= |e 
5 On 45 5 ee 5 Pe 
= ; 0.6°0.4° + ; 0.6°0.47* + 0.6°0.4* = 0.1648. 


Since p = 0.1648 > a = 0.1, we say that we fail to reject Hp at the a = 0.1 significance 
level. 


Observe that previously, under the one-tailed test, we could reject Ho at the a = 0.1 
significance level, because there p = 0.08704. Now, in contrast, under the two-tailed test, 
we fail to reject Ho at the same significance level. 





In general, all else equal, the p-value for an observed random sample is greater under a 
two-tailed test than under a one-tailed test. ‘Thus, under a two-tailed test, we are less 
likely to reject Ao. 
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Example 1300 (Dr. Chee election). We change the alternative hypothesis: 


dij — ho 
At U3, 


Say we observe the same random sample as before: (21, 22,...,2199), in which 39 votes 
were in favour of Dr. Chee. So again our observed test statistic is t = 71+%9+---+2X 199 = 39. 


The difference now is how the p-value (of the observed sample) is calculated. In words, 
the p-value gives the likelihood that our test statistic is “at least as extreme as” that 
actually observed—assuming Ho were true. 


Previously, under a one-tailed test, we interpreted “our test statistic is at least as extreme 
as that actually observed” to mean the event T' >t = 39. 


Now that we’re doing a two-tailed test, we’ll instead interpret the same phrase to mean 
both the event 7’ > t = 39 and the event that T’ is as far away on the other side of 


E [P| Ho] = 30. The second event is, specifically, T < 21. Altogether then, the p-value is 
given by 


p=P(T<21,T > 39|Ho) = 1-P(22<T < 38|Ho) 


=1-[P(T = 22|Ho) +P (T = 23|Ho) +---+P(T = 38|Ao)| 


t= oul 0370.7" 4 ull 02330 Se pull 0.3°°0.7°7 | ~ 0.06281. 
22 23 38 


Since p = 0.06281 > a@ = 0.05, we say that we fail to reject Hp at the a = 0.05 


significance level. 


Again observe that previously, under the one-tailed test, we could reject Ho at the a = 0.05 
significance level, because there p = 0.03398. Now, in contrast, under the two-tailed test, 
we fail to reject Ho at the same significance level. 
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131.2. The Abuse of NHST (Optional) 


NHST is popular because it gives a simplistic, formulaic cookbook procedure. Moreover, 
its conclusion appears to be binary: either we reject Ho or we fail to reject Ho. 


However, NHST is widely misunderstood, misinterpreted, and misused even within scientific 
communities. It has long been heavily criticised. In March 2016, the American Statistical 
Association even issued an official policy statement on how NHST should be used! 


Here I discuss only the most important, commonly made error. 


We may write the p-value as 
p=P(D|Ab), 


where D stands for the observed data and Hp stands for the null hypothesis. The p-value 
answers the following question:—assuming Ho were true, what’s the probability that we’d 
get data “at least as extreme” as those actually observed (D)? 


Say we get a p-value of 0.03. We should then say simply that 


e The small p-value casts doubt on or provides evidence against Ho. 


e If the pre-selected significance level was a = 0.05, then we may say that we reject Ho 
at the 5% significance level. 


However, instead of merely saying the above, some researchers may instead conclude that 
Ho is true with probability 0.03. 


Do you see the error here? The researcher has gone from the finding that p = P (D|Ho) = 0.03 
to the conclusion that P (Ho|D) = 0.03. This is precisely the Conditional Probability Fallacy 
(CPF), which we discussed at length in subsection 116.1. 


The error is the same as leaping from “A lottery ticket buyer who doesn’t cheat has a small 
probability q of winning” to “Jane bought a lottery ticket and won. Therefore, there is only 
probability q that she didn’t cheat.” 


The p-value is NOT the probability that Hp is true.°” Instead, it is the probability that— 
assuming Ho were true—we would have gotten data “at least as extreme” as those actually 
observed. This is an important difference. But it is also a subtle one, which is why even 
researchers get confused. 





°42Tndeed, under the objectivist view, such a statement is nonsensical anyway, because Hp is either true 
or not true; it makes no sense to talk probabilistically about whether Ho is true. 
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131.3. Common Misinterpretations of the Margin of Error 
(Optional) 


The sampling error or margin of error is often misinterpreted by laypersons (and 
journalists). 


Example 1545. On the night of the 2016 Bukit Batok SMC By-Election, the Elections 
Department announced™? that based on a sample count of 900 ballots, 

¢ Dr. Chee had won 39% of the votes. 

¢ These sample counts have a confidence level of 95%, with a +4% margin of error. 
What does the above gobbledygook mean? Let ys be the true proportion of votes won by 
Dr. Chee. Let X be the sample proportion and x be the observed sample proportion. 


It’s clear enough what the 39% means—they randomly counted 900 ballots and found 
(after accounting for any spoilt votes) that % = 39% were in favour of Dr. Chee. 


What’s less clear is what the 95% confidence level and +4% margin of error mean. 


Here are three possible interpretations of what is meant. Only one is correct. 


1. “With probability 0.95, ju € (% - 0.04, Z + 0.04) = (0.35, 0.43).” 


2. “With probability 0.95, X € (%-0.04,Z + 0.04) = (0.35, 0.43).” 


Equivalently, suppose we repeatedly observe many random samples of size 900. Then we 
should find that in 0.95 of these observed random samples, the observed sample mean is 
between 0.35 and 0.43. 


3. “With probability 0.95, X ¢ (w— 0.04, w+ 0.04).” 
We have no idea what y is. All we can say is that with probability 0.95, the sample mean 
X of votes for Dr. Chee is between ps - 0.04 and p+ 0.04. 


Equivalently, suppose we repeatedly observe many random samples of size 900. Then we 
should find that in 0.95 of these observed random samples, the observed sample mean is 
between jz — 0.04 and p+ 0.04. 


Take a moment to understand what each of the above interpretations say. Then decide 
which you think is the correct interpretation, before turning to the next page. 





(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Interpretation #1—“with probability 0.95, yw € (x - 0.04, % + 0.04) = (0.35, 0.43)”—is per- 
haps the one most commonly made by laypersons.°“ It makes two errors: 


1. It is nonsensical to speak probabilistically about the proportion p of votes 
won by Dr. Chee. pu is some fixed number. So either p is in the interval (0.35, 0.43), 
or it isn’t. It makes no sense to speak probabilistically about whether p is in that 
interval. 


. The margin of error is applicable to the true proportion j and not to the 
observed sample proportion x = 0.39. 


Some “authorities” often attempt” to correct Interpretation #1 by offering Interpre- 
tation #2—“with probability 0.95, X ¢ (z-0.04,%+0.04) = (0.35,0.43)”. However, 


Interpretation #2 is still wrong, because it still makes the second of the above two errors. 


Unfortunately, the correct interpretation is also the one that says the least. It is Inter- 
pretation #3—“with probability 0.95, X € (w- 0.04, ~+0.04)”. 


This interpretation says merely that if we were somehow able to repeatedly observe 
random samples of size 900, then we’d find that 0.95 of the corresponding observed 
sample means will be in (j1- 0.04, w+ 0.04). Which isn’t saying much, because first of all, 
we have only one observed random sample; we do not get to repeatedly observe random 
samples. Secondly, this still doesn’t tell us much about jz, which is what we’re really 
interested in. 


The correct interpretation (Interpretation #3) is the least interesting interpretation. Per- 
haps this explains why journalists often prefer to give an incorrect interpretation. 





See section 147.8 (Appendices) for a discussion of where the Elections Department’s +4% 
margin of error comes from. 
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Journalists often try to explain what the confidence level and margin of error mean—they 
almost always get it wrong. 


Example 1546. On the night of the 2016 Bukit Batok SMC By-Election, a website 
called Mothership.sg wrote: 


“Based on the sample count of 100 votes,>*® it was revealed at 9.26pm that the SDP 
Sec-Gen received 39 percent of votes. In other words, Chee would score 35 per cent in 
the worst case scenario and 43 per cent in the best case scenario.” 


This is the most absurd misinterpretation of the margin of error I have ever seen.?*” 


Let’s see what the correct worst- and best-case scenarios are. 


Suppose that in the observed random sample of 900 votes, exactly 39% or 0.39 x 900 = 351 
were votes for Dr. Chee and the remaining 549 were for PAP Guy. Then, 


e Worst-case scenario: The observed random sample of 900 votes happened to contain 
exactly all of the votes in favour of Dr. Chee. That is, Dr. Chee won only 351 votes 
and PAP Guy won the remaining 23, 570-351 = 23,219 votes. So the correct worst-case 
scenario is that Dr. Chee won » 1.5% of the votes. 


Best-case scenario: The observed random sample of 900 votes happened to contain 
exactly all of the votes in favour of PAP Guy. That is, PAP Guy won only 549 votes 
and Dr. Chee won the remaining 23570 — 549 = 23,021 votes. So the correct best-case 
scenario is that Dr. Chee won x 97.7% of the votes. 


These worst- and best-case scenarios are admittedly unlikely. Nonetheless, they are 
possible scenarios all the same. The journalist’s purported worst- and best-case scenarios 
are completely wrong. 








46By the way, even this basic fact was wrong. The sample count was not 100 votes. Instead, it was 900 
votes, consisting of 100 votes from each of 9 polling stations. 
Moreover, the Mothership.sg journalist failed to report the confidence level of 95%, either because he 
didn’t know what it meant or because he didn’t think it important. But it is important. It is pointless 
to inform the reader about the margin of error without also specifying the confidence level. 

°47Vou can find several misinterpretations of the margin of error collected in this academic paper. None is 
as absurdly bad as the error committed here. 
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131.4. Critical Region and Critical Value 


Informally, the critical region is the set of values of the observed test statistic t for which 
we would reject the null hypothesis. ‘The critical region is thus sometimes also called the 
rejection region. 

And the critical value(s) is (are) the exact value(s) of the observed test statistic t at 
which we are just able to reject the null hypothesis. 


Example 1300. (Dr. Chee election.) Say that as before, we have a one-tailed test 
where the two competing hypotheses are 


ey — ra 
Ane i> 3: 


Say that as before, we choose a = 0.05 as our significance level. 


Say that as before, in our observed random sample of 100 votes, 39 are in favour of Dr. 
Chee, so that our observed test statistic is t = 39. 


We calculated that the corresponding p-value is 0.03398 and so we were able to reject Ho 
at the a = 0.05 significance level. 


We now calculate the critical region and the critical value. We can calculate that 
if t = 38, then the corresponding p-value is x 0.053 (you should verify this for yourself). 
And so we would be unable to reject Hp. 


We thus conclude that the critical value is 39, because this is the value of t at which we 
are just able to reject Ho. 


And the critical region is the set {39, 40,41,...,100}. These are the values at which we’d 
be able to reject Ho at the a = 0.05 significance level. 
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Same example as above, but now two-tailed: 


Example 1300. (Dr. Chee election.) 


Say that as before, we have a two-tailed test where the two competing hypotheses are 


igo — ea, 
Bs: 


The significance level is again aw = 0.05. Again, the observed random sample of 100 votes 
contains 39 in favour of Dr. Chee, so that our observed test statistic is t = 39. 


We calculated that the corresponding p-value is 0.06281 and so we failed to reject Ho at 
the a = 0.05 significance level. 


We calculate that if t = 40, then the corresponding p-value is » 0.03745 (you should verify 
this for yourself). Thus, the critical values are 20 and 40, because these are the values of 
t at which we are just able to reject Ap. 


The critical region is the set {0,1,...,20,40,41,...,100}. These are the values at which 
we'd be able to reject Ho at the a = 0.05 significance level. 


Exercise 514. (Answer on p. 1984.) We flip a coin 20 times. What are the critical 
region and critical value(s) in 


(a) A test, at the 5% significance level, of whether the coin is biased towards heads. 
(b) A test, at the 5% significance level, of whether the coin is biased. 
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131.5. Testing of a Population Mean 
(Small Sample, Normal Distribution, 7? Known) 


Example 1547. The weight (in mg) of a grain of sand is X ~ N(y,9). Our unknown 
parameter of interest is the true population mean p (i.e. the true average weight of a 
grain of sand). Our “guess” is that 4 = 5. We thus write down two competing hypotheses: 


Ay: w=5, 
levee Sor 


(Note that this is a two-sided test.) 


We take a random sample of size 4—(X1, Xo, X3,X1). Our test statistic is the sample 
mean X = (Xx, a Xo + X3 + X4) /4. 


Our observed random sample is (11, 22,%3,%4) = (3,9,11,7). That is, we randomly pick 
four grains of sand that happen to have weights 3, 9, 11, and 7 mg. Then the observed 
test statistic is 


Ob Jaf > 
ri = 





7.0. 


pe 


The p-value is the probability that the test statistic X takes on values “at least as extreme 
as” our observed test statistic Z = 7.5, assuming Hp : 4 = 5 were true. Note that if Ho 
were true, then X ~N (nu, o”/n) =N(5,9/4). Thus, the p-value is given by 


) =: (x > 7.5|Ho) +P (x < 2.5|Ho) 


® 0.04779 + 0.04779 = 0.09558. 


Thus, we reject Hp at the a = 0.1 significance level. However, we would fail to reject Ho 
at the a = 0.05 significance level. 
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The table below summarises the tests to use for the population mean, in different circum- 
stances. In this section, we learnt how to handle the first case (any sample size, normal 
distribution, 0? known). The following sections will deal with the other three cases. 


























Sample size | Distribution a” o”? known 
xs 
Any Normal Known | Z-test: ~ N(0, 1). 
o/ Ji 
L A K Z-test: zie ~ N(0, 1) 
arge ny nown -tes ,1). 
o| Fi 
L A Unk Z-test: uae N(O, 1) 
arge ny nknown -tes a 
s/ = 
Small Normal Unknown ere N(0, 1) 548 
J = 
Small Non-normal Either Not in A-Levels. 


Exercise 515. The Singapore daily high temperature (in °C) can be modelled by 
X ~ N(y,8). Our unknown parameter of interest is the true population mean su (i.e. 
the true average daily high temperature). Your friend guesses that ps = 34. You gather 


the following data on daily high temperatures, of 10 randomly chosen days in 2015: 
(35, 35, 31, 32, 33, 34, 31, 34, 35,34). Test your friend’s hypothesis, at the a = 0.05 signifi- 
cance level. (Be sure to write down your null and alternative hypotheses.) (Answer on 


p. 1985.) 
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131.6. Testing of a Population Mean 
(Large Sample, Any Distribution, c? Known) 


We'll recycle the same example from the previous section. Before, we knew that X was 
normally distributed. Now the big difference is that we have absolutely no idea what 
distribution XY comes from! 


To compensate, we require also that our random sample is “large enough”, so that the 
CLT-approximation can be used. 


Example 1548. The weight (in mg) of a grain of sand is X ~ (w,9). (This says simply 
that X is distributed with mean pw and variance 9.) Our unknown parameter of interest 
is the true population mean yu (i.e. the true average weight of a grain of sand). Again, 
we “guess” that = 5. Again, we write down: 


Ay: w=5, 
HAs iso, 


(Note that this is, again, a two-sided test.) 


This time, we'll take a random sample of size 100—(X1, X2,...,X100). Again, our test 
statistic is the sample mean X = (X1 + Xp +--+ + Xyq9) /100. 


Recall the magic of the CLT. Even if we have absolutely no idea what distribution X 
is drawn from, then provided n is sufficiently large, X is normally distributed. So here, 
since the sample is large (n = 100 > 20), by the CLT, we know that X has, approximately, 
the normal distribution N (u, o |n). So, if Hp were true, then we have, approximately, 
X ~N(p,0?/n) = N (5, 9/100). 

Say the observed test statistic we get is 


1+ La+°++++ 71900 


5.0. 
100 





‘Ga= 





(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Again, the p-value is the probability that our test statistic X takes on values “at least as 
extreme as” our observed test statistic Z = 5.6, assuming Ho: w= 5 were true. ‘Thus, the 
p-value is given by 


p=P(X256,X <4.4|H))=P(X 2 5.6|Ho) +P(X <4.4|Ho) 


6-H) pf, 44-4) _p(7, 56-5) p[,- 44-5 
ie) P(2< ae] p(z> Ses) (2s 4t-5 | 


=P{4 22)4P(Z <2) = 00450, 
Thus, we reject Ho at the a = 0.05 significance level. 


Exercise 516. The Singapore daily high temperature (in °C) can be modelled by X ~ 
(u,8). Our unknown parameter of interest is the true population mean yp (i.e. the true 
average daily high temperature). Your friend guesses that jz = 34. You gather the data 
on daily high temperatures, of 100 randomly chosen days in 2015 and find the observed 
sample average temperature to be 33.4 °C. Test your friend’s hypothesis, at the a = 0.05 
significance level. (Be sure to write down your null and alternative hypotheses. Also, 
clearly state where you use the CLT.) (Answer on p. 1985.) 
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131.7. Testing of a Population Mean 
(Large Sample, Any Distribution, c? Unknown) 


We'll recycle the same example from the previous section. Again, we have absolutely no 
idea what distribution X comes from. And again, the random sample is large enough, so 
that the CLT can be used. 


But now, o? is unknown. This turns out to be no big deal. We can simply replace o 
with the observed unbiased sample variance s”, and do the same thing as before. 


2 


Example 1549. The weight (in mg) of a grain of sand is X ~ (us, a (This says simply 
that X is distributed with mean p and variance o?.) Our unknown parameter of interest 
is the true population mean yu (i.e. the true average weight of a grain of sand). Again, 
we “guess” that = 5. Again, we write down 


Ay: w=5, 
HAs s. 


(Note that this is, again, a two-sided test.) 


Again, we take a random sample of size 100—(X1, X2,..., Xio0). Again, our test statistic 
is the sample mean X = (Xq + X29 +--+ + X49) /100. 


Again, since the sample is large (n = 100 > 20), by the CLT, that X has, approximately, 
the normal distribution N (u, G |n). So, if Hp were true, then we have, approximately, 


X~N (u,07/n) =N (5, 07/100). Since the sample variance S” is an unbiased estimator 


for o”, it is plausible that we also have, approximately, X ~ N (u, o*/n) = N (5, s*/100), 
where s” is the observed sample variance. 


Say the observed sample mean and observed sample variance we get are 


100 = 2 
_ GA te BD a + £100 See wana Pe: - a (ay i) _ 
100 n-1 








8 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 


Again, the p-value is the probability that our test statistic X takes on values “at least as 
extreme as” our observed test statistic Z = 5.6, assuming Ho: w= 5 were true. ‘Thus, the 
p-value is given by 


p=P(X256,X <4.4|H))=P(X > 5.6|Ho) +P(X <4.4|My) 


(6105 Ny (aed 2) TY fe te 9) 2a WY a 
: P(22 wd P(2: Th) P(z2 585) (2s 42-5] 


«P(Z > 2.1213) + P(Z < -2.1213) » 0.03389. 
Thus, we reject Hop at the a = 0.05 significance level. 


Exercise 517. The Singapore daily high temperature (in °C) can be modelled by X ~ 
(nu, cae Our unknown parameter of interest is the true population mean p (i.e. the true 
average daily high temperature). Your friend guesses that j= 34. You gather the data 
on daily high temperatures, of 100 randomly chosen days in 2015. Your observed sample 
mean temperature is 33.4 °C and your observed sample variance is 11.2 °C”. Test your 
friend’s hypothesis, at the a = 0.05 significance level. (Be sure to write down your null 
and alternative hypotheses. Also, clearly state where you use the CLT.) (Answer on p. 
1986.) 
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131.8. Formulation of Hypotheses 


Example 1550. We flip a coin 100 times. We get 100 heads. What can we say about 
the coin? 


This is an open-ended question, to which there can be many different answers. Here’s 
the answer we’re taught to give for H2 Maths: 


Let pz be the probability that a coin-flip is heads. We formulate a pair of competing 
hypotheses: 


Ey Ul, 
Hat jie... 


Our test statistic T is the number of heads (out of 100 coin-flips). Our observed test 
statistic t is 100. The corresponding p-value (note that this is a two-tailed test) is 


P(T > 100,T < 0|Ho) = P(T =0|Ho) + P(T = 100|Hp) 


= O50 aut 0.50.5" & 1.578 x 10. 
0 100 


The tiny p-value may be interpreted as casting on or providing evidence against Ho. 


We note also that we can easily reject Hp at any of the conventional significance levels 
(eo =U) a= 005, or a= 041). 


Exercise 518. (Answer on p. 1986.) We observe the weights (in kg) of a random sample 
of 50 Singaporeans: (21, %2,...,250). We observe that }) x;/50 = 68 and )) 77/50 = 5000. 


A friend claims that the average American is heavier than the average Singaporean. It is 
known that the average American weighs 75 kg. Is your friend correct? If you make any 
assumptions or approximations, make clear exactly where you do so. (Hint: Use Fact 
244(a)). 
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132. Correlation and Linear Regression 


132.1. Bivariate Data and Scatter Diagrams 
In this chapter, we’ll be interested in the relationship between two sets of data. 


Example 1551. We measure the heights and weights of 10 adult male Singaporeans. 
Their heights (in cm) and weights (in kg) are given in this table: 


a a eee ee 
h; (em) | 182 165 173 155 178 174 169 160 150 190 
wi (kg)| 81 70 71 53 72 75 69 60 44 80 





We call (h;,w;) observation 7. So for example, observation 5 is (178, 72) and observation 
9 is (150, 44). 


We can plot a scatter diagram of these 10 persons’ weights (vertical axis) against their 
heights (horizontal). 


90 7 Weight (kg) 
80 


Height (em) 


145 165 175 185 195 


The black dotted line is called a line of best fit. Shortly (section 132.4), we’ll learn 
how to construct this line of best fit. 


The more closely the data points in the above scatter diagram lie to a straight line, the 
more strongly linearly correlated are weight and height. So here with these particular 
data, the linear correlation between weight and height seems strong. In the next section, 
we'll learn about the product moment correlation coefficient, which is a way to 
precisely quantify the degree to which two sets of data are linearly correlated. 


Because the line of best fit is upward-sloping, we can also say that the linear correlation 
is positive. 
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Example 1552. We have data from the Clementi weather station for the daily high 
temperature (in °C) and daily rainfall (in mm) on 361 days in 2015. (Strangely, data 
were missing for four days, namely Feb 10-13.) 


(| ee ae ee ee 
TCO Macaca ar” 30.2 
poanm) | 02902) 0 ao 12.4 





We can again plot a scatter diagram of rainfall against temperature. 


Rainfall Gnm) 


30 Temperature (degrees Celsius) 35 


Again, the black dotted line is a line of best fit. The data points do not seem close to 
this line. Thus, it seems that the linear correlation between temperature and rainfall is 
weak. 


The line of best fit is downward-sloping and so we say that the linear correlation is 
negative. 


Exercise 519. (Answer on p. 1987.) The table below shows the prices charged (p) and 
the number of haircuts (q) given by 5 different barbers, during June 2016. 


Draw a scatter diagram with price on the horizontal axis. Plot also what you think looks 
like a line of best fit. 
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132.2. Product Moment Correlation Coefficient (PMCC) 


In the previous section, we used a scatter diagram to determine if there was a plausible 
linear relationship between two sets of data. This, though, was a very crude method. 


A more precise measure of the degree to which two sets of data are linearly correlated is 
called the product moment correlation coefficient (PMCC). Formally: 


Definition 261. Let (71, 22,...,2%n) and (41, y2,---;Yn) be two ordered sets of real num- 
bers. The product moment correlation coefficient (PMCC) is the following real number: 


Di (i ~ T) (Yi - 9) 


See ee 





—— 





Properties of the PMCC. 
1. -1<r<1. (Surprisingly, this can be proven using vectors: Fact 302 (Appendices).) 


2. We say the linear correlation is positive if r > 0 and negative if r < 0. 


3. If r = 1, the linear correlation is positive and perfect. 


J r=1. 
Datapoints lie exactly ona 
upward-slopingline. 89 5 —_=§_------ 77 
a---° 





4. If r = -1, the linear correlation is negative and perfect. 


r=-1. 
C-e__ Datapoints lie exactly ona 
Re eae 00-6 downward-sloping line. 





. we 3 Datapoints lie close to 
e a upward-sloping line. 





1340, Contents www.EconsPhDTutor.com 


6. If r is close to —1, the linear correlation is very strong. 


Ds rz -0.941. 
ia ee Datapoints lie close toa 





& 
6 e e 
Dee ee eee Me secs ae 
es r= 0.080. 
The data are positively and very ° 
i 6 XxX 





® X 





The data are negatively an 
very weakly linearly corfelated. 


8. r is merely a measure of linear correlation and nothing else. Two variables may be very 
closely related but not linearly correlated. For example, data generated by the quadratic 
model y; = x? may have a very low r. 
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Example 1551 (continued from above). This is the height and weight example 
revisited. For convenience, we reproduce the data and scatter diagram: 


Mita Se ee ee ee nl 
182 165 173 155 178 174 169 160 150 190 
70 71 53 72 75 69 60 44 80 





Height (em) 


175 185 195 


_ 182+ 165+ 173+ 155+ 178 +174 + 169 + 160 + 150 + 190 
‘ 10 





= 169.6, 


pe aU ee Ue ee 
si 10 ial 


5 (hy — R) (ww; —) = (182 - A) (81-1) +--+ (190 — h) (80 - @) = 1237 


t=1 


y\(h (182 - 169. 6) +-+++ (190 - 169. 6)° x 37.180640, 


= ll 


tb)’ = \/ (81 - 67.5)" + +--- + (80 — 67.5)” x 35.418922, 


coe 
Dict (hj = h) Die (Wy - w)” 


As expected, r > 0 (the linear correlation is positive or, equivalently, the line of best fit 
is upward-sloping). Moreover, r is close to 1 (the linear correlation is very strong). 
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Example 1552 (continued from above). This is the temperature and rainfall example 
revisited. For convenience, we reproduce the data and scatter diagram: 


pein 6 esol 
HCO) || 27a 2205 aie 30.2 
Pom 10s 02a 0 a0 12.4 





We can again plot a scatter diagram of rainfall against temperature. 


Rainfall Gnm) 


O95 30 Temperature (degrees Celsius) 35 


— 27.34 29.54 31.1432 +--+ +30.2 
7 361 


Ye (ti -t) (wi - B) 
yee poe 


(002707 0ros+ Da 
7 361 > 


~ 31.5, 5.0. 





: (273,=31.5) (= 5.0)4-2-4 (202— 31-5) 1245.0) 
AQT 381) fee ( 802315) (5.0) 2 Ce 50) 


a —0.1623. 





As expected, r < 0 (the linear correlation is negative or, equivalently, the line of best fit 
is downward-sloping). Moreover, r is fairly close to 0 (the linear correlation is weak). 
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132.3. Correlation Does Not Imply Causation (Optional) 


Correlation does not imply causation. This saying has now become a cliché. Doesn’t make 
it any less true. 


Below is an amusing but spurious correlation (source): 


US spending on science, space, and technology 
correlates with 


Suicides by hanging, strangulation and suffocation 





1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 
$30 billion 10000 suicides 
o 
I 
By o 
9 $25 billion 8000 suicides 3 
s S 
3° 3 
oD Q 
i=] Qn 
3 EB. 
a sae me ie} 
© $20 billion 6000 suicides 5 
a ® 
n n 
n 
~Y 
$15 billion 4000 suicides 
1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 


-® Hanging suicides-e US spending on science 


The PMCC is r x 0.99789126. So the two sets of data are almost perfectly linearly corre- 
lated. But of course, this doesn’t mean that spending on science causes suicides or that 
suicides cause spending on science. More likely, the correlation is simply spurious. 


A comic from xkcd: 
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132.4. Linear Regression 


Example 506 (continued from above). We suspect that the heights and weights of 
adult male Singaporeans are linearly correlated. We thus write down this linear model: 


w=atbdh. 


Recall the quote: “All models are wrong, but some are useful.” The model w =a + bh is 
unlikely to be exactly correct. But hopefully it will be useful. 


We treat a and b as unknown parameters (do you expect b to be positive or negative?). 
Our goal is to try to get estimates for a and b, from an observed random sample of height 
and weight data. 


We recycle the data from earlier. These, along with the scatter diagram, are reproduced 
for convenience. 


Se eee eee 
h; (cm) | 182 165 173 155 178 174 169 160 150 190 
w; (kg) | 81 70 71 53 72 75 69 60 44 80 





Weight (kg) 


Height (em) 


175 185 195 


The basic idea of linear regression is this: Find the line that “best fits” the given data. 
Drawn in the figure above are three plausible candidates for the “line of best fit”. But 
there can only be one line of best fit. Which is it? 


At the end of the day, we’ll choose black dotted line as “the” line of best fit. But why? 
This will be answered in the next section. 
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Example 1552 (continued from above). We suspect that daily rainfall and daily high 
temperatures for 2015 were linearly correlated. We thus write down this linear model: 


p=at bt. 


Again, our goal is to get estimates for the unknown parameters a and 6 (do you expect 
b to be positive or negative’). 


We gather the following data (recycled from before): 


el ae | 
COM Pre woe eleey 30.2 
pi(mm)| 0 02 0 0 12.4 





We can again plot a scatter diagram of rainfall against temperature. 


80 F Rainfall (nm) 


30 Temperature (degrees Celsius) 35 


Again, drawn in the figure above are several plausible candidates for the “line of best fit”. 
It turns out that the black dotted line will be “the” line of best fit. 
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132.5. Ordinary Least Squares (OLS) 


There are different methods for determining “the” line of best fit. Each method will give a 
different line of best fit. 


The method we’ll learn in H2 Maths is the most basic and most standard method. It is 
called the method of ordinary least squares (OLS). 


Let’s assume there is some true linear model, which may be written as y = a+b. As always, 
we stick to the objectivist interpretation. The parameters a and b have some true, fixed 
values. However, they are unknown (and may forever be unknown). 


Nonetheless, we’ll try to do our best and get estimates for a and b. These estimates will be 
denoted @ and b. And our line of best fit will then be y = @+ bx. 


How do we find this line of best fit? Intuitively, this will be the line to which the data 
points are “as close as possible”. But there are many ways to define the term “as close 
as possible”. For example, we could try to minimise the sum of the distances between the 
points and the line. But we shall not do this. 


Instead, we’ll use the method of OLS: 


1. Measure the vertical distance of each data point (x;, y;) from the line. This is called the 
residual and is denoted %@;. 


2. Our goal is to find the line y = @+ bx that minimises \> a?—this quantity is called the 
Sum of Squared Residuals (SSR). 


Example: 
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Example 1551 (height and weight example revisited). Our candidate line of best 
fit is w = @+bh = 65+ 0h = 65. This is a horizontal line, which simply “predicts” that 
everyone’s weight is always 65 kg, regardless of their height. (This is a somewhat silly 
candidate line of best fit. Not surprisingly, this is not the actual line of best fit.) 


Weight (kg) 


w=4@+bh=65+ 0h 


Height (em) 


165 175 


1 2 3 4 Oo 10 
160 150 190 

w; (kg) 60 44 80 

i; (kg) 65 65 65 

lg = Ww, — Wy (ks) -5 -21 15 








The second last row of the above table gives, for each person with height h;, the cor- 
responding predicted weight #; (as per our candidate line of best fit). The residual @; 
(last row) is then defined as the vertical distance between the data point and the weight 
predicted by the candidate line of best fit. 


10 
The SSR is }* &? = 167+ 5° +6? + (-12)?+ 7° + 10° +4? + (-5)? + (-21)? + 15? = 1317. 
i=l 
Can we do better than this? That is, can we find another candidate line of best fit whose 
SSR is smaller than 1317? 
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The following fact gives two formulae for 6, the gradient of the line of best fit. Formula. (i) 
is printed in the List of Formulae (MF26) you get during exams, but formula (ii) is not. 


Fact 248. Let (21,22,...,%n) and (y1,Y2,---,Yn) be two ordered sets of data. The OLS 
regression line of y onx is y-y=b(x-2Z), where 


oe ene OT) 
i) 6-3 — 
. Te 


ig) 
ii) b= — >.——_. 
(ii) yr Sng 


Moreover, the regression line can also be written in the form y = 4+ bx, where b is as 
given above and @= y — bz. 





Proof. We want to find @ and 6 such that the line y = 4+ bx has the smallest SSR possible. 


The residual @; is defined as the vertical distance between (x;,y;) and the line y = @+ ba. 
That is, 


i= yi-y = yi (4+ bx:). 


Thus, the SSR is ein = » Ei - (a + ini). 


We wish to minimise the SSR, by choosing appropriate values of @ and b. This involves the 
following pair of first order conditions:>”” 


<a =o, = Lat=0. 


The remainder of the proof simply involves taking derivatives and doing the algebra, and 
is continued on p. 1744 (Appendices). 




















*49There’s a bit of hand-waving here. 
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Example 1551 (height and weight example revisited). We already calculated 
h=169.6, w=67.5, > (hi- h) = 1382.4,  (h;-h) (w;-w) = 1237. 
i=l ll 


So, b= ry (hi - h) og) agers 
ya (h; = h) 1382.4 





x 0.8948. 


Thus, the regression line is w — 67.5 = 0.8948 (h - 169.6) or w = 4+ bh = -84.26 + 0.8948h. 


Weight (kg) 


OLS line of best fit 
w =@+4+bh = —84.26 + 0.8948h 


Height (em) 





182 165 173 155 178 174 #+169 160 150 = 190 
8l 70 71 53 72 #75 69 60 44 ~ 80 
78.6 63.4 70.5 54.4 75.0 71.4 67.0 58.9 50.0 85.8 
24 66 ~ 00  =—14 =30° 36° 20 1.1 —6.0" =5:8 





10 
The SSR for the actual line of best fit is }° @? = 2.47 +---+ (-5.8)? 147.6. This is much 


i=1 
better than the SSR of 1317 that we found for the previous candidate line of best fit, 
which was simply a horizontal line. 
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132.6. TI84 to Calculate the PMCC and the OLS Estimates 


Example 1553. We'll find the PMCC and the regression line for these data: 





. Press to turn on your calculator. 


. Press the blue button and then CATALOG (which corresponds to the (0) button). 
This brings up the CATALOG menu. 
. Using the down arrow key S scroll down until the cursor is on DiagnosticOn. 


. Press (NER once. And press (NBR a second time. The TI84 now says “DONE”, 
telling you that the Diagnostic option has been turned on. 


The above steps need only be performed once. Unless of course you’ve just reset your 
calculator (as is required before each exam). In which case you have to go through the 
above steps again. 


Step 1. Step 2. Step 3. Step 4. 


. Press SABNY to bring up the STAT menu. 
. Press (1) to select the “1:Edit” option. 


7. The T184 now prompts you to enter data under the column titled “L1”. This is where 
you should enter the data for x, using the numeric pad and the UINSIs) key as is 
appropriate. (I omit from this step the exact buttons you should press. ) 


. After entering the last entry, press the right arrow key })| to go to column L2. So enter 
the data for y, again using the numeric pad and the (20098) key as is appropriate. 


Step 5. Step 6. Step 7. Step 8. 


(Example continues on the next page ...) 
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(... Example continued from the previous page.) 
9. Now press to again bring up the STAT menu. 
10. Press the right arrow key })] to go to the CALC submenu. 
11. Press (4) to select the “4:LinReg(ax+b)” option. 
12. To tell the TI84 to go ahead and do the calculations, simply press (DNPH. 


The TI84 tells you that the PMCC is r = -.8147656398. The equation of the regression 
line of y on x is y = ax +b = —.859375x + 11.75625. 


(Be careful to note that the TI84 uses the symbol “a” for the coefficient for 7, whereas 
in the List of Formulae (MF 26), they use b instead. Don’t get these mixed up!) 


Step 9. Step 10. Step 11. Step 12. 


Exercise 522. Using your TI84, find the PMCC between q and p, and also find the 
regression line of g on p (see data below). Verify that your answer for this exercise is the 
same as those in the last two exercises. (Answer on p. 1989.) 
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132.7. Interpolation and Extrapolation 


Given any value of x, we call the corresponding % = b(ax —~£)+y the fitted value or the 
predicted value. One use of the regression line is that it can help us predict (or “guess”) 
the value of y, even for x for which we have no data. 


Example 1551 (height and weight example revisited). Say we want to guess 
the weight of an adult male Singaporean who is 185 cm tall. Using our regression line, 
we predict that his weight is Wp-ig5 = 0.8948 x 185 — 84.26 » 81.3 kg. This is called 
interpolation, because we are predicting the weight of a person whose height is between 
two of our observations. 


Say instead we want to guess the weight of an adult male Singaporean who is 210 cm tall. 
Using our regression line, we predict that his weight is Wp-219 = 0.8948 x 210—-84.26 = 103.6 
kg. This is called extrapolation, because we are predicting the weight of a person whose 
height is beyond on our rightmost observation. 


il Zz 3 a 5 6 7 8 9 10 
182 165 173 5d 178 174 169 160 150 190 185 © 210 
8l 70 71 53 72 #7 69 #60 44 80 - - 
78.6 63.4 70.5 54.4 75.0 71.4 67.0 58.9 50.0 85.8 81.3 103.6 








OLS line of best fit 
w = 4+ bh = —84.26 + 0.8948h_-~ 


Height|(em) 
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For the A-Level exams, you are supposed to mindlessly and formulaically say that “Extrap- 
olation is less reliable than interpolation”, because 


The former predicts what’s beyond the known observations; the 
latter predicts what’s between two known observations. 


This, though, is not a very satisfying explanation for why extrapolation is “less reliable” 
than interpolation. It merely leads to another question: “Why should a prediction be more 
reliable if done between two known observations, than if done to the right of the right-most 
observation (or to the left of the left-most observation)?” 


We won’t give an adequate answer to this latter question. Instead, we'll simply give a 
bunch of examples to illustrate the dangers of extrapolation: 


Example 1554. A man on a diet weighs 115 kg in Week #1. Here’s a chart of his weight 
loss. 


The OLS line of best fit suggests that he has been losing about 0.5 kg a week. 


He forgot to record his weight on Week #6. By interpolation, we “predict” that his 
weight that week was 112.5 kg. This is probably a reliable guess. 


By extrapolation, we predict that his weight on Week #£201 will be 15 kg. This guess is 
obviously absurd. It requires that he keeps losing 0.5 kg a week for nearly 4 years. 
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Example 1555. A growing boy is 160 cm tall in Month #1. Here’s a chart of his growth. 


170 + Height (cm) 
169 
168 
167 
166 
165 
164 
163 
162 
161 


160 
10 


The OLS line of best fit suggests that he has been growing by about 1 cm a month. 


He forgot to record his height in Month #6. By interpolation, we “predict” that his 
height that month was 165 cm. This is probably a reliable guess. 


By extrapolation, we predict that his height in Month #101 will be 260 cm. This guess is 
obviously absurd. It requires that he keep growing by 1 cm a month for the 8-plus years. 
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Here are three colourful examples of the dangers of extrapolation from other contexts. 


Example 1556. Russell’s Chicken (Problems of Philosophy, 1912, Google Books link): 


The man who has fed the chicken every day throughout its life at last wrings its neck 
instead, showing that more refined views as to the uniformity of nature would have been 
useful to the chicken. .... The mere fact that something has happened a certain number 
of times causes animals and men to expect that it will happen again. Thus our instincts 
certainly cause us to believe the sun will rise to-morrow, but we may be in no better a 
position than the chicken which unexpectedly has its neck wrung. 


Example 1557. The Fermat numbers are 


ae 
= aes, 
(eee ail = lig 


Pe 2 57 
F, = 22° +1 = 65537. 


Remarkably, the first five Fermat numbers are all prime. This observation led Fermat to 
conjecture (guess) in the 17th century that all Fermat numbers are prime. This was an 
act of extrapolation. 


Unfortunately, Fermat’s act of extrapolation was wrong. About a century later, Euler 
5 

showed that F; = 2? + 1 = 4294967297 = 641 x 6700417 is composite (not prime). 

Today, the Fermat numbers F;, Fg, ..., f32 are all known to be composite. Indeed, 

it was shown in 1964 that F32 is composite. Over half a century later, it is not yet 


known if F33 = 2" +1 is prime or composite. F 3 is an unimaginably huge number, with 
2,585, 827,973 digits. 
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Example 1558. On Ah Beng’s first day at school, he learns in Chinese class that the 
Chinese character for the number 1 is written as a single horizontal stroke. 


On his second day at school, he learns that the Chinese character for the number 2 is 
written as two horizontal strokes. 


On his third day at school, he learns that the Chinese character for the number 3 is 
written as three horizontal strokes. 


The Chinese The Chinese The Chinese 
character for 1 character for 2 character for 3 


After his third day at school, Ah Beng decides he'll skip at least the next few Chinese 
classes, because he thinks he knows how to write the Chinese characters for the numbers 4 
and above. 4 simply consists of four horizontal strokes; 5 simply consists of five horizontal 
strokes; etc. Unfortunately, Ah Beng’s act of extrapolation is wrong. 


The characters for the numbers 4 through 10 look instead like this: 


ABN GIN 
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On the other hand, here are two historical examples of extrapolation that, to everyone’s 
surprise, have held up remarkably well (at least to date). 


Example 1559. Moore’s Law. In 1965, Gordon Moore observed that the number of 
components that could be crammed onto each integrated circuit doubled every year. He 
predicted that this rate of progress would continue at least through 1975. 


In 1975, he adjusted his prediction to a more modest rate of doubling every two years. 
Thus far, this latter prediction has held up remarkably well. The following from Nature: 


MOORE’S LORE 


For the past five decades, the number of transistors per microprocessor 
chip — a rough measure of processing power — has doubled about every 
two years, in step with Moore’s law (top). Chips also increased their ‘clock 
speed’, or rate of executing instructions, until 2004, when speeds were 
capped to limit heat. As computers increase in power and shrink in size, a 
new class of machines has emerged roughly every ten years (bottom). 





1019 


OT al 


10% = 


LOS: 


GS 
LOW = 


ee © 


Transistors per chip 


@ 
Clock speeds (MHz) 
1.G:4 TE OR 1 ck: nee Speed Se See oe) ee eee ee Oe ye a poe ey a ea eee eet 
1960 1974 1988 2002 2016 


Unfortunately, as stated in the same Nature article, it “has become increasingly obvious 
to everyone involved” that “Moore’s law ... is nearing its end”. 
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Example 1560. Augustine’s Law. In 1983, Norman Augustine observed that the cost 
of a tactical aircraft grows four-fold every ten years. (Google Books.) 
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This is considerably quicker than the rate at which the annual US defense budget and 
US Gross National Product (GNP) grows. Extrapolating, he concluded: 
e In 2054, the entire annual US defense budget will be spent on a single aircraft. 


e Early in the 22nd century, the entire US GNP will be spent on a single aircraft. 


$ 


ONE 
QUINTILLION = 


1361, Contengwe aad 
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(... Example continued from the previous page.) 
These seemingly absurd conclusions were written at least partly in jest. 


Except so far they have been right on track. In a 2010 Economist article, Augustine was 
quoted as saying, “We are right on target. Unfortunately nothing has changed.” That 
article also presented an updated version of Augustine’s Law. 


The latest F-35 fighter program is estimated to cost the US Department of Defense 
US$1.124 trillion. To be fair, that estimate is the cost of the entire program over its 
projected 60-year lifespan (through 2070)—this includes R&D, the purchase of over 2, 000 
F-35s, and operating costs. But still, US$1.124 trillion is a mind-blowing figure.®” 


Augustine’s, not Moore’s law 2 


US combat aircraft O Estimated 


rary 
a) 
2) 


7 
7 
F-22 Raptor 70 


70 
F-14 Tomcat 15 Fale) 35 aSF 
. F-18 Hornet 


- pp 


P-51 


Mustang F-16 Falcon 


Ee 


7 


mae P-40 Kittyhawk 
2 Wright Model A 


E 
D 
aS} 
ww 
7 
S 
w 
= 
3 
e 
<x 


Sources: Norman R. 
Augustine; John D. Christie 


Exercise 523. Using the data below, “predict” how many haircuts were sold in June 
2016 by (a) a barber who charged $7 per haircut; and (b) a barber who charged $200 per 
haircut. Which prediction is an act of interpolation and which is an act of extrapolation? 
Which prediction do you think is more reliable?(Answer on p. 1989.) 
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132.8. Transformations to Achieve Linearity 


Two variables may have a relationship, but not a linear one. Here we consider cases where 
the relationship is quadratic, reciprocal, or logarithmic. 


Example 1561. Quadratic. Consider the following data. There is a very strong, but 
not perfect degree of linear correlation between x and y (r * 0.950). The observations are 
very close to, but are not exactly on the OLS line of best fit. 


40 xy 
35 
30 
Za 


2 3 4 
6.7 5.3 143 25.4 38.4 


Perhaps we can do better by transforming the data. We’ll do a quadratic transformation: 
let z; = 27. Then we have the following data. 


40 xy 
35 
30 
25 


4 9 16 
6.7 5.3 14.3 25.4 38.4 


The degree of linear correlation between z and y is near perfect (r # 0.995). The obser- 
vations also lie closer to the line of best fit than before. 
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Example 1562. Reciprocal. Consider the following data. There seems to be a moder- 
ate degree of linear correlation between x and y (r * -0.603). The observations are fairly 
close to the OLS line of best fit. 


Perhaps we can do better by transforming the data. We’ll do a reciprocal transformation: 
let z; = 1/x;. Then we have the following data and scatter diagram. 


1/3 1/5 1/7 
49 55 5.5 52, 


The degree of linear correlation between z and y is much stronger (r * 0.899). The 
observations also lie closer to the line of best fit. 
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Example 1563. Logarithmic. Consider the following data. There seems to be a fairly 
strong degree of linear correlation between x and y (r » 0.873). The observations are 
fairly close to, but are not exactly on the OLS line of best fit. 


TS Oy 


Perhaps we can do better by transforming the data. We'll do a reciprocal transformation: 
let z; = Ina;. Then we have the following data and scatter diagram. 


7.5 KV 


48 65 7.0 74 T.3 


The degree of linear correlation between z and y is much stronger (r * 0.978). The 
observations also lie closer to the line of best fit. 
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132.9. The Higher the PMCC, the Better the Model? 


There are no routine statistical questions, only questionable statistical rou- 
tines. 


— J.M. Hammersley””! 


It’s much more interesting to live not knowing than to have answers which 
might be wrong. 


— Richard Feynman (1981). 


502 


The A-Level examiners?” want you to say, mindlessly and formulaically, that 


All else equal, a model with a higher PMCC 
is better than a model with a lower PMCC. 


Regurgitating the above sentence will earn you your full mark. But in fact, without the 
“all else equal” clause, it is nonsense. And since it is almost never true that “all else is 
equal”, it is almost always nonsense. 


In every introductory course or text on statistics, one is told that the PMCC is merely 
a relatively unimportant consideration, in deciding between models. Yet somehow, the 
A-Level examiners seem to consider the PMCC an all-important consideration. 


Here’s a quick example to illustrate. 


Example 1564. (From the 2015 exam—see Exercise 793 below.) In an experiment the 
following information was gathered about air pressure P, measured in inches of mercury, 
at different heights above seA-Level h, measured in feet. 


h | 2000 5000 10000 15000 20000 25000 30000 35000 40000 45000 
P | 27.8 249 20.6 169 138 111 889 704 552 4.28 





The exam first asks us to find the PMCCs between (a) h and P; (b) Inh and P; and (c) V/A 
and P. The answers are (a) rg * —0.980731; (b) r, » —0.974800; and (c) r, » —0.998638. 


The A-Level exam then says, “Using the most appropriate case ..., find the equation 
which best models air pressure at different heights.” The “correct” answer is that (c) 


P =a+bVh is the “most appropriate” model, simply because the PMCC there is the 
largest. 





(Example continues on the next page ...) 





*52See 9740 N2015/II/10(iii), N2014/II/8(b)(ii), N2012/II/8(v), N2011/II/8(iii), N2010/II/10(iii), and 
N2008/II/8(i). These are given in this textbook as Exercises 793, 799, 814, 820, 830, and 842. 


1367, Contents www.EconsPhDTutor.com 


(... Example continued from the previous page.) 


But this is utter nonsense. One does not conclude that one model is “more appropri- 
ate” than another simply because its PMCC is 0.018 larger. Small measurement errors 
or plain bad luck could easily explain these tiny differences in PMCCs. 


Moreover, even if one model has r = 0.9 and another has r = 0.4, it does not automatically 
follow that the first model is “more appropriate” than the second. In deciding which 
statistical model to use, there are very many considerations, of which the PMCC is a 
relatively unimportant one. 


In my view, the correct answer should have been this: 
We have far too little information to make any conclusions. 


Sadly, in the Singapore education system, what I consider to be the correct answer would 
not have gotten you any marks. Instead, one is taught that there must always be one 
single, simplistic, formulaic, definitive, “correct” answer. This is a convenient substitute 
for thinking. 


As it turns out, the “most correct” linear model—based on the actual barometric formula 
(see subsection 147.10 (Appendices) )—is actually the following: 


L 
InP=a+bInj1+—h}. 
vere Gy) 


The constants L = —0.0065 kelvin per metre (Km‘') and T = 288.15 kelvin (K) are, 
respectively, the standard temperature lapse rate (up to 11,000 m above sea level) and 
the standard temperature (at sea level). 


The PMCC for the above model is rg * 0.999998, which is “better” than the cases exam- 
ined above. (See this Google spreadsheet for the data and calculations. ) 


But again, the PMCC is merely one relatively unimportant consideration. Our 
conclusion that this last model is superior to the model P =a+ b/h is based not on the 
fact that rq is 0.001 larger than r-. 


Instead, we are confident in this model because it was derived from physical theories. In 
contrast, the model P = a+bVh (or indeed any of the other models suggested above) 
is completely arbitrary and has no theoretical justification. Hence, even if the model 
P=a+bVh had a PMCC of 1, we’d still prefer this last model. 
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Part VII. 
Ten- Year Series 





Answers for Part VI (Probability and Statistics) 2016-19 questions will be 
written “soon”. © 
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As stated on p. 4 of your H2 Maths syllabus, your A-Level exam will consist of two papers 
of 3 hours each. 
e Paper 1 [total 100 points]: Pure Mathematics, 3 hours, 10-12 questions. 
¢ Paper 2 [total 100 points]: 3 hours 
— Section A [total 40 points]: Pure Mathematics, 4-5 questions. 
— Section B [total 60 points]: Probability and Statistics, 6-8 questions. 


Each paper contains 100 points. So, you have an average of 1.8min = 108s to spend on 
each point and each point is 0.5% of the maximum score of 200. 


For more practice, try the TYS questions for H1 Maths (in my H1 Maths 
Textbook). They’re very similar! 


This part lists all the questions from the 2006-2017 A-Level exams, sorted into the six 
different parts and in reverse chronological order. 


In the older exams, they had the habit of not distinctly numbering different parts within 
the same question as parts (i), (ii), etc. So I have sometimes taken the liberty of adding or 
modifying such numbers. 
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Many questions are out-of- 
syllabus and can be skipped. 
(They’re printed in grey).5 





»°8Happily, the present 9758 syllabus (first examined in 2017) is considerably lighter than the previous 
9740 syllabus (last examined in 2017), which was in turn lighter than the previous 9233 syllabus (last 
examined in 2008). Thus, many past-year questions printed here are no longer in the current 9758 
syllabus and you can skip them. Answers have been provided anyway and you’re perfectly welcome to 
try them. 
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The following appears on the cover page of each of your 9758 A-Level exam papers. 


READ THESE INSTRUCTIONS FIRST 

Write your Centre number, index number and name on the work you hand in. 
Write in dark blue or black pen on both sides of the paper. 

You may use an HB pencil for any diagrams or graphs. 


Do not use staples, paper clips, glue or correction fluid. 


Answer all the questions. 


Give non-exact numerical answers correct to 3 significant figures, or 1 decimal place in the 
case of angles in degrees, unless a different level of accuracy is specified in the question. 


You are expected to use an approved graphing calculator. 


Unsupported answers from a graphing calculator are allowed unless a question specifically 
states otherwise. 


Where unsupported answers from a graphing calculator are not allowed in a question, 
you are required to present the mathematical steps using mathematical notations and not 
calculator commands. 


You are reminded of the need for clear presentation in your answers. 


At the end of the examination, fasten all your work securely together. 


The number of marks is given in brackets |] at the end of each question or part question. 
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133. Past-Year Questions for Part I. Functions and Graphs 


Exercise 525. (9758 N2019/I/3.) (Answer on p. 1991.) 
A function is defined as f (x) = 2x° - 6a? + 6x - 12. 


(i) Show that f(a) can be written in the form p{(a+q) +r}, where p, g and r are 


constants to be found. [2] 

(ii) Hence, or otherwise, describe a sequence of transformations that transform the graph 
of y = x® onto the graph of y = f (x). [3] 
Exercise 526. (9758 N2019/I/4.) (Answer on p. 1991.) 
(i) Sketch the graph of y = |2” - 10], giving the exact values of any points where the curve 
meets the axes. [3] 

(ii) Without using a calculator, and showing all your working, find the exact interval, or 
intervals, for which |2” - 10| < 6. Give your answer in its simplest form. [3] 
Exercise 527. (9758 N2019/I/5.) (Answer on p. 1992.) 


The functions f and g are defined by 


f(z) =e*-4, ceR, 
g(x)=x+2, «eR. 








(i) Find f~' (x) and state its domain. [3] 

(ii) Find the exact solution of fg (x) =5, giving your answer in its simplest form. [3] 

Exercise 528. (9758 N2019/I/7.) (Answer on p. 1992.) 

A curve C' has equation y = xe”. 

(i) Find the equations of the tangents to C at the points where x = 1 and x = -1. [6] 

(ii) Find the acute angle between these tangents. [2] 

Exercise 529. (9758 N2019/II/2.) (Answer on p. 1992.) 

= 

(i) Sketch the graph of y = —“"*___ Give the equations of the asymptotes and the 
327 + 5a - 8 

coordinates of the point(s) where the curve crosses either axis. [4] 

(ii) Solve the i lit pre i (1] 
ii) Solve the inequality —.———— > 0. 

TY 302 + Ba — 8 = 

nis ; f= 2 oe 

(iii) Hence solve the inequality 0. [1] 


—— 
3072+ 5x -8 
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Exercise 530. (9758 N2018/I/4.) (Answer on p. 1993.) 


(i) Find the exact roots of the equation Pig + 3x - 2| =2-2. [4] 


(ii) On the same axes, sketch the curves with equations y = Pe +32 - 2| and y=2-2. 


Hence solve exactly the inequality 


Qn? +34 -2|<2--. [4] 


Exercise 531. (9758 N2018/I/5.) (Answer on p. 1993.) 
Functions f and g are defined by 





Fe ose for s€R, c #-b, a #-1, 
x+b 
g:Le"£ for ce R. 
It is given that ff = 4g. 
Find the value of 6. 
Find f(z) in terms of x and a. [5] 


Remark 178. ‘This question contains an error: 


Observe that f f (—b) is undefined, while g(-b) = -b. So, ff #g. 
Below is how I would’ve written this question (leaving the last two sentences unchanged): 


Let a,b ¢R with a+—1. Define f :R\ {-b} > R by f ( )= =. 


It is given that f f(x) =a for all ze R\ {-5}. 
Find the value of b. 


Find f(z) in terms of x and a. 


(The function g in the original erroneous question was superfluous. ) 


The answer I provide is to the above rewritten question. 





Exercise 532. (9758 N2017/I/2.) (Answer on p. 1994.) 





(i) On the same axes, sketch the graphs of y = and y = b|xz — al, where a and b are 
r-a 


positive constants. [2] 





(ii) Hence, or otherwise, solve the inequality < b|x - al. [4] 


w—-a 
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Exercise 533. (9758 N2017/I1/4.) (Answer on p. 1995.) 





Ar +9 
A curve C’ has equation y = sale ; 
C+ 
(i) Show that the gradient of C' is negative for all points on C. [3] 





b 
(ii) By expressing the equation of C in the form y = a+ re where a and b are constants, 
a 


write down the equations of the asymptotes of C. [3] 


(iii) Describe a pair of transformations which transforms the graph of C on to the graph 


1 
of y= -—. [2] 
x 


Exercise 534. (9758 N2017/I/5.) (Answer on p. 1995.) 


When the polynomial x + ax? + br +c is divided by (2-1), (2-2) and (2-3), the 
remainders are 8, 12 and 25 respectively. 


(i) Find the values of a, b and c. [4] 


A curve has equation y = f (x), where f (x) = 2? + ax? + bx +, with the values of a, b and 
c found in part (i). 


(ii) Show that the gradient of the curve is always positive. Hence explain why the equation 


f(z) =0 has only one real root and find this root. [3] 
(iii) Find the x-coordinates of the points where the tangent to the curve is parallel to the 
line y = 2x -3. [3] 
Exercise 535. (9758 N2017/II/1.) (Answer on p. 1996.) 

A curve C’ has parametric equations 

3 
a = 2t. 
ae y 

(i) The line y = 2x cuts C at the points A and B. Find the exact length of AB. [3] 


3 
(ii) The tangent at the point P (= 2») on C’ meets the x-axis at D and the y-axis at LE. 
Dp 


The point F' is the midpoint of DE. Find a cartesian equation of the curve traced by 
F as p varies. [5] 
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Exercise 536. (9758 N2017/II/3.) (Answer on p. 1996.) 


(a) The curve y = f (x) cuts the axes at (a,0) and (0,0). It is given that f~' (a) exists. 
State, if it is possible to do so, the coordinates of the points where the following curves 
cut the axes. 


(i) y= f (22). 
(ii) y= f(v-1). 
(iii) y = f (24-1). 
(iv) y= f" (2). 4] 
(b) The function g is defined by 


1 
ae ae er where x¢ R, x #a. 
=a 


(i) State the value of a and explain why this value has to be excluded from the 





domain of g. [2] 
(ii) Find g? (x) and g™! (x), giving your answers in simplified form. [4] 
(iii) Find the values of b such that g? (b) = g7! (0). [2] 
Exercise 537. (9740 N2016/I/1.) (Answer on p. 1996.) 
An? + 4a -14 
Express — ua 5 — (x +3) as a single simplified fraction. [2] 
GF — 


Hence, without using a calculator, solve the inequality 


4e? + 4a - 14 
ee 


re r+3. [3] 


Exercise 538. (9740 N2016/I/3.) (Answer on p. 1997.) 


The curve y = x* is transformed onto the curve with equation y = f(x). The turning 
point on y = x* corresponds to the point with coordinates (a,b) on y = f (x). The curve 
y = f (x) also passes through the point with coordinates (0,c). Given that f (2) has the 
form k (a -1) 44m and that a, b and c are positive constants with c > b, express k, | and 


m in terms of a, b and c. [2] 
By sketching the curve y = f (x), or otherwise, sketch the curve y = ray State, in terms 
of a, b and c, the coordinates of any points where y = 7 crosses the axes and of any 
turning points. [4] 
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Exercise 539. (9740 N2016/1/10.) (Answer on p. 1998.) 


(a) The function f is given by f:v7 1+ V2, for re R, x20. 
(i) Find f(z) and state the domain of f7!. [3] 
(ii) Show°™ that if ff (x) =a then 2° - 427+ 4x -1=0. 
(iii) Hence find the value of x for which ff (x) =x. 
(iv) Explain why this value of x satisfies the equation f (2) = f7' (x). [5] 


(b) The function g, with domain the set of non-negative integers, is given by 


1, for n= 0, 
1 
g(a) =<2 +9(5n), for n even, 
1l+g(n-1), — for n odd. 


(i) Find g (4), g(7), and g(12). [3] 
(ii) Does g have an inverse? Justify your answer. [2] 
Exercise 540. (9740 N2015/I/1.) (Answer on p. 1998.) 


A curve C’ has equation 
a 
y=, +br+e, 
i 


where a, b and care constants. It is given that C’ passes through the points with coordinates 
(1.6,-2.4) and (-0.7,3.6), and that the gradient of C is 2 at the point where zx = 1. 








(i) Find the values of a, b and c, giving your answers correct to 3 decimal places. [4] 
(ii) Find the z-coordinate of the point where C’ crosses the z-axis, giving your answer 
correct to 3 decimal places. [2] 

(iii) One asymptote of C is the line with equation x = 0. Write down the equation of the 
other asymptote of C. [1] 
Exercise 541. (9740 N2015/1/2.) (Answer on p. 1999.) 


. 
(i) Sketch the curve with equation y = 








, Stating the equations of the asymptotes. 


On the same diagram, sketch the line with equation y = x + 2. [3] 





<aut2. [3] 


ae 
(iv) Solve the inequality F 








°¢Originally, parts (a)(ii), (a) (iii), and (a)(iv) here were combined into a single part (a)(ii) . But for 
great clarity, I’ve split the original (a) (ii) into three separate parts. 
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Exercise 542. (9740 N2015/T/5.) (Answer on p. 2000.) 
(i) State a sequence of transformations that will transform the curve with equation y = x” 
on to the curve with equation y = 0.25 (x - 3)?. [2] 


A curve has equation y = f (2) where 


1, mor <9 < A, 


Y=10,.25(¢=3)",  forl<2<3, 


0, otherwise. 
(ii) Sketch the curve for -l <a <4. [3] 
(iii) On a separate diagram, sketch the curve with equation y = 1+ f (0.5x), for -l<a<4. 
[2] 
Exercise 543. (9740 N2015/II/3.) (Answer on p. 2001.) 
(a) The function f is defined by 
f Hee ied ise" “LE IN, LT > i 
(i) Show that f has an inverse. [2] 


(ii) Find f~' (x) and state the domain of f~'. [ 


a 


(b) The function g is defined by 
2 xéeR, x#l. 
—£ 


Find algebraically the range of g, giving your answer in terms of V3 as simply as 





possible. [5] 
Exercise 544. (9740 N2014/I/1.) (Answer on p. 2001.) 
The function f is defined by 

1 
y= xeR, c#1,2#0. 
Lae 
(i) Show that f? (x) = f71 (2). [4] 
(ii) Find f°? (x) in simplified form. [1] 


1378, Contents www.EconsPhDTutor.com 


Exercise 545. (9740 N2014/1/4.) (Answer on p. 2001.) 


The diagram shows the curve y = f (x). The curve crosses the z-axis at the points A, B 
and C’, and has a maximum turning point at D where it crosses the y-axis. The coordinates 
of A, B, C and D are (-a,0), (6,0), (c,0) and (0,d) respectively, where a, b, c and d are 
positive constants. 


° 2 2 . 7 > . . 
(i) Sketch the curve y* = f (x), stating, in terms of a, 6, c and d, the coordinates of any 


turning points and of the points where the curve crosses the x-axis. [4] 
(ii) What can be said about the tangents to the curve y? = f(x) at the points where it 
crosses the z-axis? [1] 
Exercise 546. (9740 N2014/II/1.) (Answer on p. 2002.) 
A curve C has parametric equations x = 3t?, y = 6t. 
(i) Find the value of t at the point on C' where the tangent has gradient 0.4. [3] 
(ii) The tangent at the point P (3p", 6p) on C' meets the y-axis at the point D. Find the 
cartesian equation of the locus of the mid-point of PD as p varies. [4] 





Exercise 547. (9740 N2013/I/2.) (Answer on p. 2002.) 
It is given that 


2 
= a xeR, cel. 
i ed | 
Without using a calculator, find the set of values that y can take. [5] 





>°Tf p = 0, then P = (0,0) and the tangent at P is vertical, so that D could be any point on the y-axis. 
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Exercise 548. (9740 N2013/1/3.) (Answer on p. 2002.) 


(i) Sketch the curve with equation 





_ eel 
a a 
stating the equations of any asymptotes and the coordinates of the points where the 
curve crosses the axes. [4] 


(ii) Solve the inequality 








1 
aah 1] 
2£-1 
Exercise 549. (9740 N2013/II/1.) (Answer on p. 2003.) 
Functions f and g are defined by 
2 
fran i geR c#1, 
l-g¢ 


Gea Loa xeR. 


(i) Explain why the composite function fg does not exist. [2] 
(ii) Find°’® an expression for gf (x). 
(iii) And hence, or otherwise, find (gf)~' (5). [4] 
Exercise 550. (9740 N2012/I/1.) (Answer on p. 2004.) 


A cinema sells tickets at three different prices, depending on the age of the customer. The 
age categories are under 16 years, between 16 and 65 years, and over 65 years. Three groups 
of people, A, B, and C,, go to the cinema on the same day. The numbers in each category 
for each group, together with the total cost of the tickets for each group, are given in the 
following table. 














Group | Under 16 years | Between 16 and 65 years | Over 65 years | Total cost 
A 9 6 4 $162.03 
B f 5 3 $128.36 
C 10 4 5 $158.50 























Write down and solve equations to find the cost of a ticket for each of the age categories. |4] 





>°°Originally, parts (ii) and (iii) here were combined into a single part (ii). But for great clarity, I’ve split 
the original (ii) into two separate parts. 
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Exercise 551. (9740 N2012/1/7.) (Answer on p. 2004.) 


A function f is said to be self-inverse if f (x) = f~'(x) for all x in the domain of f. The 
function g is defined by 





where k is a constant, k # —-1. 


(i) Show that g is self-inverse. [2] 


(ii) Given that k > 0, sketch the curve y = g(x), stating the equations of any asymptotes 
and the coordinates of any points where the curve crosses the x- and y-axes. [3] 


(iii) State the equation of one line of symmetry of the curve in part (ii), and describe 
fully a sequence of transformations which would transform the curve y = 1/x onto this 








curve. [4] 
Exercise 552. (9740 N2012/II/3.) (Answer on p. 2005.) 
It is given that f (x) = 2° +2? - 22-4. 

(i) Sketch the graph of y = f (x). [1] 
(ii) Find the integer solution of the equation f (x) = 4, and prove algebraically that there 

are no other real solutions. [3] 
(iii) State the integer solution of the equation (x + 3)? +(x+3)?-2(%+3)-4=4. [1] 
(iv) Sketch the graph of y =|f («)]. [1] 

(v) Write down two different cubic equations which between them give the roots of the 

equation | f («)| = 4. Hence find all the roots of this equation. [4] 

Exercise 553. (9740 N2011/I/1.) (Answer on p. 2007.) 


Without using a calculator, solve the inequality 


2 


r+at+l 


——____—— '#'(}; 4 
e2+n-2 4] 


Exercise 554. (9740 N2011/I/2.) (Answer on p. 2007.) 


It is given that f (x) = ax? + bx +c, where a, b, and c are constants. 


(i) Given that the curve with equation y = f(a) passes through the points with coor- 
dinates (-1.5,4.5), (2.1,3.2) and (3.4, 4.1), find the values of a, 6, and c. Give your 
answers correct to 3 decimal places. [3] 


(ii) Find the set of values of x for which f (x) is an increasing function. 
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Exercise 555. (9740 N2011/II/3.) (Answer on p. 2007.) 
The function f is defined by 


1 
fixe In(2x4+1)+3, zeR, £>->. 


(i) Find f-' (x) and write down the domain and range of f7!. [4] 


(ii) Sketch on the same diagram the graphs of y = f (x) and y = f~' (x) giving the equa- 
tions of any asymptotes and the exact coordinates of any points where the curves 
cross the x- and y-axes. [4] 


(iii) Explain why the z-coordinates of the points of intersection of the curves in part (ii) 
satisfy the equation In(27+1) = x — 3, and find the values of these x-coordinates, 
correct to 4 significant figures. [3] 


Exercise 556. (9740 N2010/I/5.) (Answer on p. 2008.) 


The curve with equation y = x is transformed by a translation of 2 units in the positive 
x-direction, followed by a stretch with scale factor 0.5 parallel to the y-axis, followed by a 
translation of 6 units in the negative y-direction. 


(i) Find the equation of the new curve in the form y = f (x) and the exact coordinates of 
the points where this curve crosses the x- and y-axes. Sketch the new curve. [5] 


(ii) On the same diagram, sketch the graph of y = f~!(x), stating the exact coordinates 
of the points where the graph crosses the x- and y-axes. [3] 
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Exercise 557. (9740 N2010/II/4.) (Answer on p. 2009.) 


The function f is defined as follows. 


1 
jf ii for xe R, x #-1, 21. 
| ae | 
(i) Sketch the graph of y = f (x). [1] 
(ii) If the domain of f is further restricted to x > k, state with a reason the least value of 
k for which the function f~! exists. [2] 


In the rest of the question, the domain of f is x ¢ R, x # -1, x #1, as originally defined. 


The function g is defined as follows. 


i 
gi: rH —, for7 eR, ee 2, 223,e24. 
Z= 3 
(iii) Show that fg (2) (x -3) (2] 
Ww = ——__——_.. 
eC cm) 
(iv) Solve the inequality fg (a) > 0. [3] 
(v) Find the range of fg. [3] 
Exercise 558. (9740 N2009/I/1.) (Answer on p. 2010.) 
(i) The first three terms of a sequence are given by wu, = 10, u2 = 6, ug = 5. Given that u, 
is a quadratic polynomial in n, find uw, in terms of n. [4] 
(ii) Find the set of values of n for which uy, is greater than 100. [2] 
Exercise 559. (9740 N2009/1/6.) (Answer on p. 2011.) 
; x-2 oe, 2 ae 
The curve C; has equation y = ——~. The curve Cy has equation — + — = 1. 
L+2 6 63 

(i) Sketch C, and C, on the same diagram, stating the exact coordinates of any points 
of intersection with the axes and the equations of any asymptotes. [4] 
(ii) Show algebraically that the - poonarneic of the points of intersection of C, and C) 
satisfy the equation 2 (x - 2)? = =(r+ 2); (6 - a), [2] 
(iii) Use your calculator to find these x-coordinates. [2] 
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Exercise 560. (9740 N2009/II/3.) (Answer on p. 2012.) 
The function f is defined by 


ax 





a 
fitr for ceR, o# 7, 


bx -a 
where a and 6 are non-zero constants. 


(i) Find f-' (x). Hence or otherwise find f? (2) and state the range of f?. [5] 


1 
(ii) The function g is defined by g : x & — for all real non-zero x. State whether the 
ae 





composite function fg exists, justifying your answer. [2] 
(ii) Solve the equation f~! (x) =z. [3] 
Exercise 561. (9740 N2008/1/9.) (Answer on p. 2012.) 
It is given that 
ax +b 
I(2)= cx +d’ 


for non-zero constants a, b, c, and d. 
(i) Given that ad - bc # 0, show by differentiation that the graph of y = f(a) has no 
turning points. [3] 
(ii) What can be said about the graph of y = f (2) when ad - bc = 0? [2] 
(iii) Deduce from part (i) that the graph of 
_ 32-7 
r+] 





Y 


has a positive gradient at all points of the graph. [1] 
(iv) On separate diagrams, draw sketches of the graphs of 
a= Ty 





including the coordinates of the points where the graphs cross the axes and the equa- 
tions of any asymptotes. [5] 
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Exercise 562. (9233 N2008/1/14.) (Answer on p. 2014.) 


Sketch, on separate diagrams, the curves 











‘ x : : ; 

(i) y=— 7 stating the equations of the asymptotes, [4] 

1 al 

ar ez ; - : 

(ii) y? =— 7 making clear the form of the curve at the origin. [3] 

aa 
(iii) Show that the x-coordinates of the points of intersection of the curves y = — i and 
y =e” satisfy the equation 2? = 1+z2e”. | [1] 
(iv) Use the iterative formula 2,4) = /1+2,e°*", together with a suitable initial value 21, 
to find the positive root of this equation correct to 2 decimal places. [2] 
Exercise 563. (9740 N2008/II/4.) (Answer on p. 2016.) 


The function f is defined by f: 2+ (x-4)?+1 for re R, x>4. 
(i) Sketch the graph of y = f (x). Your sketch should indicate the position of the graph 


in relation to the origin. [2] 
(ii) Find f-' (2), stating the domain of f~t. [3] 
(iii) On the same diagram as in part (i), sketch the graph of y = f7'(z). [1] 


(iv) Write down the equation of the line in which the graph of y = f (x) must be reflected 
in order to obtain the graph of y = f~ (x).°?" 


(v) And hence find the exact solution of the equation f (x) = f~' (2). [5] 


Exercise 564. (9740 N2007/I/1.) (Answer on p. 2018.) 


Qn? -—-x7-19 etree) 


Show that ———————- = 1 = ———_—_, 
es v2 +3242 v2 4+3x24+2 


[1] 
Hence, without using a calculator, solve the inequality 


Qn? —-x -19 


4 
v2 +3242 4 





°°7Originally, parts (iv) and (v) here were combined into a single part (iv). But for great clarity, I’ve 
split the original (iv) into two separate parts. 
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Exercise 565. (9740 N2007/I/2.) (Answer on p. 2018.) 
Functions f and g are defined by 


i 
ieee for c €R, x #3, 


git x for ce R. 


(i) Only one of the composite functions fg and gf exists. Give a definition (including 
the domain) of the composite that exists, and explain why the other composite does 








not exist. [3] 
(ii) Find f~' (x) and state the domain of f7!. [3] 
Exercise 566. (9740 N2007/I/5.) (Answer on p. 2019.) 
20+ 7 B 
Show that the equation y = = can be written as y = A + ——, where A and B are 
r+2 ee 2 
constants to be found. Hence state a sequence of transformations which transform the 
| 20+ 7 
graph of y = — to the graph of y = cairiey [4] 
x L+2 


20 +7 
Sketch the graph of y = 0" giving the equations of any asymptotes and the coordinates 


r+ 
of any points of intersection with the xz- and y-axes. [3] 
Exercise 567. (9740 N2007/II/1.) (Answer on p. 2019.) 


Four friends buys three different kinds of fruits in the market. When they get home they 
cannot remember the individual prices per kilogram, but three of them can remember the 
total amount that they each paid. The weights of fruit and the total amounts paid are 
shown in the following table. 





Suresh | Fandi | Cindy | Lee Lian 











Pineapples (kg) 1.15 | 1.20 | 2.15 1.30 
Mangoes (kg) 0.60 | 0.45 | 0.90 0.25 
Lychees (kg) 0.55 | 0.30 | 0.65 0.50 





Total amount paid in $| 8.28 | 6.84 | 13.05 























Assuming that, for each variety of fruit, the price per kilogram paid by each of the friends 
is the same, calculate the total amount that Lee Lian paid. [6] 


1386, Contents www.EconsPhDTutor.com 


Exercise 568. (9233 N2007/II/4.) (Answer on p. 2020.) 
The function f is defined by 





4 
fite ail xeR, x #3. 

z-3 
(i) State the equations of the two asymptotes of the graph of y = f (zx). [2] 
(ii) Sketch the graph of y = f (x), showing its asymptotes and stating the coordinates of 
the points of intersection with the axes. [3] 
(iii) Find an expression for f~' (a) and state the domain of f~'. [3] 
Exercise 569. (9233 N2006/I1/3.) (Answer on p. 2020.) 


Functions f and g are defined by 


fiver 52443, x>O0, 
3 
gitre-—, ZS 0: 
a 
(i) Find, in a similar form, fg, g? and 9°’. [3] 


[Note: g? denotes gg.| 


(ii) Express h in terms of one or both f and g, where 





hig 25r + 18, x > 0. [1] 
Exercise 570. (9233 N2006/II/1.) (Answer on p. 2021.) 
Solve the inequality 
=o 
<1 5 
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134. Past-Year Questions for Part II. Sequences and Series 








Exercise 571. (9758 N2019/I/6.) (Answer on p. 2022.) 
n 1 
(i) By writing ree in partial fractions, find an expression for 2, ae [4] 
= 1 
(ii) Hence find the exact value of )> Fei [2] 
r=11 
Exercise 572. (9758 N2019/I/8.) (Answer on p. 2022.) 


(i) An arithmetic series has first term a and common difference 2a where a # 0. A 
geometric series has first term a and common ratio 2. The Ath term of the geometric 
series is equal to the sum of the first 64 terms of the arithmetic series. Find the value 
of k. [3] 


(ii) A geometric series has first term f and common ratio r where f,r € R and f #0. The 
sum of the first four terms of the series is 0. Find the possible values of f and r. Find 
also, in terms of f, the possible values of the sum of the first n terms of the series. |4] 


(iii) The first term of an arithmetic series is negative. The sum of the first four terms of 
the series is 14 and the product of the first four terms of the series is 0. Find the 11th 


term of the series. [4] 
Exercise 573. (9758 N2018/1/8.) (Answer on p. 2023.) 
A sequence uj, U2,U3,... is such that uns = 2u, + An, where A is a constant and n> 1. 

(i) Given that wu, = 5 and uz = 15, find A and uz. [2] 





It is known that the nth term of this sequence is given by 


Un = a(2") + bn +¢, 


where a, b and c are constants. 


(i) Find a, b and c. [4] 
(ii) Find )°u, in terms of n. (You need not simplify your answer.) [4] 
r=l1 





558T ast examined in 2017. 
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Exercise 574. (9758 N2018/1/11.) (Answer on p. 2023.) 


Mr Wong is considering investing money in a savings plan. One plan, P, allows him to 
invest $100 into the account on the first day of every month. At the end of each month the 
total in the account is increased by a%. 


(i) It is given that a = 0.2. 
(a) Mr Wong invests $100 on 1 January 2016. Write down how much this $100 is 


worth at the end of 31 December 2016. [1] 
(b) Mr Wong invests $100 on the first day of each of the 12 months of 2016. Find the 
total amount in the account at the end of 31 December 2016. [3] 





(c) Mr Wong continues to invest $100 on the first day of each month. Find the month 
in which the total in the account will first exceed $3000. Explain whether this 
occurs on the first or last day of the month. [5] 


An alternative plan, Q, also allows him to invest $100 on the first day of every month. Each 
$100 invested earns a fixed bonus of $b at the end of every month for which it has been in 
the account. This bonus is added to the account. The accumulated bonuses themselves do 
not earn any further bonus. 


(ii) (a) Find, in terms of b, how much $100 invested on 1 January 2016 will be worth at 
the end of 31 December 2016. [1] 


(b) Mr Wong invests $100 on the first day of each of the 24 months in 2016 and 2017. 
Find the value of b such that the total value of all the investments, including 
bonuses, is worth $2800 at the end of 31 December 2017. [3] 


It is given instead that a =1 for plan P. 


(ii) Find the value of 6b for plan Q such that both plans give the same total value in the 
account at the end of the 60th month. [3] 
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Exercise 575. (9758 N2017/1/9.) (Answer on p. 2024.) 


nm 
(a) A sequence of numbers wy, ue, Ug, ... has a sum S, where S,, = > uy. It is given that 
r=1 
S, = An? + Bn, where A and B are non-zero constants. 


























(i) Find an expression for u, in terms of A, B and n. Simplify your answer. [3] 

(ii) It is also given that the tenth term is 48 and the seventeenth term is 90. Find A 

and B. [2| 

(b) Show that r? (r+ 1)’ -(r-1)’r? = kr®, where k is a constant to be determined. Use 
nm 

this result to find a simplified expression for )°r°. [4] 
r=1 

(c) D’Alembert’s ratio test states that a series of the form )\a, converges when 

r=0 
. An+1 : . An+1 . An+1 . 
lim |——] < 1, and diverges when lim > 1. When lim = 1, the test is 


Co ar 
inconclusive. Using the test, explain why the series > — converges for all real values 
r=0 T: 


of x and state the sum to infinity of this series, in terms of «. [4] 


Exercise 576. (9758 N2017/II/2.) (Answer on p. 2024.) 


An arithmetic progression has first term 3. The sum of the first 13 terms of the progression 
is 156. 


(i) Find the common difference. [2] 


A geometric progression has first term 3 and common ratio r. The sum of the first 13 terms 
of the progression is 156. 


(ii) Show that r'’ - 52r +51 =0. Show that the common ratio cannot be 1 even though 
r=1 isa root of this equation. Find the possible values of the common ratio. [4] 
(iii) It is given that the common ratio of the geometric progression is positive, and that 
the nth term of this geometric progression is more than 100 times the nth term of the 
arithmetic progression. Write down an inequality, and hence find the smallest possible 
value of n. [3] 


Exercise 577. (9740 N2016/I/4.) (Answer on p. 2025.) 


An arithmetic series has first term a and common difference d, where a and d are non-zero. 
A geometric series has first term b and common ratio r, where b and r are non-zero. It is 
given that the 4th, 9th and 12th terms of the arithmetic series are equal to the 5th, 8th 
and 15th terms of the geometric series respectively. 


(i) Show that r satisfies the equation 5r’° - 8r° +3 = 0. Given that |r| < 1, solve this 
equation, giving your answer correct to 2 decimal places. [4] 
(ii) Using this value of r, find, in terms of b and n, the sum of the terms of the geometric 
series after, but not including, the nth term, simplifying your answer. [3] 
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Exercise 578. (9740 N2016/I/6.) (Answer on p. 2024.) 


(i) Prove by the method of mathematical induction that 


sags 1) = an (n+l) (n?+n+2). [5] 


(ii) A sequence uo, U1, U2, ...is given by 
Uy = 2 and Un = Un-1 +n? +n for n> 1. 
Find wy, us, and us. [2] 
(iii) By considering 3 (u, — Ur-1), find a formula for u,, in terms of n. [3] 
f=1 
Exercise 579. (9740 N2015/1/8.) (Answer on p. 2026.) 


Two athletes are to run 20km by running 50 laps around a circular track of length 400m. 
They aim to complete the distance in between 1.5 hours and 1.75 hours inclusive. 


(i) Athlete A runs the first lap in T seconds and each subsequent lap takes 2 seconds 
longer than the previous lap. Find the set of values of T which will enable A to 
complete the distance within the required time interval. [4] 


(ii) Athlete B runs the first lap in t seconds and the time for each subsequent lap is 2% 
more than the time for the previous lap. Find the set of values of t which will enable 
B to complete the distance within the required time interval. [4] 


(iii) Assuming each athlete completes the 20 km run in exactly 1.5 hours, find the difference 
in the athletes’ times for their 50th laps, giving your answer to the nearest second. |3] 


Exercise 580. (9740 N2015/II/4.) (Answer on p. 2026.) 


(a) Prove by the method of mathematical induction that 


if 9 
1x3x642x4x74+3x5x84---+n(n+2) (n+ 5) =n (n+ 1) (3n* + 3ln +74). [6] 





2 A B 
(b) (i) Show that Ta erag cn be expressed as aa ee where A and B are 
constants to be determined. [1] 
The sum 3 oe is denoted by S 
4 Ar? + 8r +3 Eon 
(ii) Find an expression for S,, in terms of n. [4] 


(iii) Find the smallest value of n for which S;, is within 10~° of the sum to infinity. [3] 
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Exercise 581. (9740 N2014/1/6.) (Answer on p. 2027.) 
(a) A sequence pj, p2, p3, ... is given by 
p, = 1 and pray = 4p, - 7 for n> 1. 


(i) Use the method of mathematical induction to prove that 


C = 4n - 
Pn = a a [5] 
(ii) Find )’ p,. [3] 
r=l1 
(b) The sum S’, of the first n terms of a sequence wy, U2, U3, ... is given by 
if 
S, = 1-——. 
(n+1)! 
(i) Give a reason why the series }° u, converges, and write down the value of the sum 
to infinity. [2] 
(ii) Find a formula for u,, in simplified form. [2] 
Exercise 582. (9740 N2014/II/3.) (Answer on p. 2028.) 
In a training exercise, athletes run from a starting point O to and from a series of points 
A,, Az, A3,..., increasingly far away in a straight line. In the exercise, athletes start at O 


and run stage 1 from O to A; and back to O, then stage 2 from O to A» and back to O, 
and so on. 


O 4m A, 4m A, 4m ee 4m A, 4m Ap 4m Ag 4m A, 4m Ag 


(i) In Version 1 of the exercise (above), the distances between adjacent points are all 4m. 


(a) Find the distance run by an athlete who completes the first 10 stages of Version 
1 of the exercise. [2] 
(b) Write down an expression for the distance run by an athlete who completes n 
stages of Version 1. Hence find the least number of stages that the athlete needs 
to complete to run at least 5 km. [4] 


(ii) In Version 2 of the exercise (below), the distances between the points are such that 
OA, =4m, A, Ao = 4m, Ao A3 = 8mand A,Ans = 2An-1 Ayn. Write down an expression 
for the distance run by an athlete who completes n stages of Version 2. Hence find the 
distance from O, and the direction of travel, of the athlete after he has run exactly 
10km using Version 2. [5] 


O 4m A, 4m A, 8m As 16m A, 


1392, Contents www.EconsPhDTutor.com 


Exercise 583. (9740 N2013/I1/7.) (Answer on p. 2029.) 


A gardener is cutting off pieces of string from a long roll of string. The first piece he cuts 
off is 128cm long and each successive piece is 2/3 as long as the preceding piece. 


(i) The length of the nth piece of string cut off is pcm. Show that Inp = (An+ B)In2+ 
(Cn + D)I\n38, for constants A, B, C and D to be determined. [3] 


(ii) Show that the total length of string cut off can never be greater than 384 cm. [2] 


(iii) How many pieces must be cut off before the total length cut off is greater than 380 cm? 
You must show sufficient working to justify your answer. [4] 





Exercise 584. (9740 N2013/1/9.) (Answer on p. 2029.) 


(i) Prove by the method of mathematical induction that 
1 
yi r (2r? +1) =5n(n+1)(ni+n+l). [5] 
r=1 


(ii) It is given that f (r) = 2r?+3r?+r+24. Show that f (r)-f (r-1) = ar’, for a constant 


a to be determined. Hence find a formula for )° r’, fully factorizing your answer. [5] 





r=l1 
(iii) Find ~ f(r). (You should not simplify your answer.) [3] 
r=1 
Exercise 585. (9740 N2012/1/3.) (Answer on p. 2030.) 
A sequence uj, U2, U3, ... iS given by 
3 n -] . 
wr =2 and tn4 = “ - tor 21. 
(i) Find the exact values of uz and us. [2] 


(ii) It is given that u, > 1 as n > oo. Showing your working, find the exact value of /. [2] 


(iii) For this value of 1, use the method of mathematical induction to prove that 


14 /1\" 
Un = — (= +. [4] 
3 \2 
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Exercise 586. (9740 N2012/II/4.) (Answer on p. 2031.) 


On 1 January 2001 Mrs A put $100 into a bank account, and on the first day of each 
subsequent month she put in $10 more than in the previous month. Thus on 1 February 
she put $110 into the account and on 1 March she put $120 into the account, and so on. 
The account pays no interest. 


(i) On what date did the value of Mrs A’s account first become greater than $5000? [5] 


On 1 January 2001 Mr B put $100 into a savings account, and on the first day of each 
subsequent month he put another $100 into the account. The interest rate was 0.5% per 
month, so that on the last day of each month the amount in the account on that day was 
increased by 0.5%. 


(ii) Use the formula for the sum of a geometric progression to find an expression for the 
value of Mr B’s account on the last day of the nth month (where January 2001 was 
the lst month, February 2001 was the 2nd month, and so on). Hence find in which 


month the value of Mr B’s account first became greater than $5000. [5] 
(iii) Mr B wanted the value of his account to be $5000 on 2 December 2003. What interest 
rate per month, applied from January 2001, would achieve this? [3] 
Exercise 587. (9740 N2011/I/6.) (Answer on p. 2032.) 


(i) Using the formulae for sin(A +B), prove that 


sin(r +5) 6—sin(r-5) 0 = 2cosrdsin 58 [2] 
2 2 Z 
- - ; i! _ 1 
(ii) Hence find a formula for }° cosré in terms of sin (n + 5) 6 and sin 50. [3] 
r=1 


(iii) Prove by the method of mathematical induction that 


A cos 44 — cos (n + +) 6 
> sin 8 = ———__, 
oe 2 sin 50 


for all positive integers n. [6] 





559Otherwise some of the formulae that follow have 0 as denominators and are thus undefined. 
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Exercise 588. (9740 N2011/1/9.) (Answer on p. 2033.) 


(i) A company is drilling for oil. Using machine A, the depth drilled on the first day is 
256 metres. On each subsequent day, the depth drilled is 7 metres less than on the 
previous day. Drilling continues daily up to and including the day when a depth of 
less than 10 metres is drilled. What depth is drilled on the 10th day, and what is the 
total depth when drilling is completed? [6] 


(ii) Using machine B, the depth drilled on the first day is also 256 metres. On each 
subsequent day, the depth drilled in : of the depth drilled on the previous day. How 


many days does it take for the depth drilled to exceed 99% of the theoretical maximum 


total depth? [4] 
Exercise 589. (9740 N2010/I/3.) (Answer on p. 2034.) 
The sum S,, of the first n terms of a sequence uj, U2, U3, ... iS given by 

Sa nlan +c), 


where c is a constant. 








(i) Find uw, in terms of c and n. [3] 
(ii) Find a recurrence relation of the form un, = f (un). [2] 
Exercise 590. (9740 N2010/II/2.) (Answer on p. 2034.) 
(i) Prove by mathematical induction that 
n . ‘| . : 
Sir (r+ 2) ==n(n+1)(2n+7). [5] 
r=1 6 
(ii) (a) Prove by the method of differences that 
o _3 1 1 
> ae ae oy a 
= way: 4 2(n+1) 2(n+2) 





(b) Explain why iy is a convergent series, and state the value of the sum to 
T= 
infinity. [2] 
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Exercise 591. (9740 N2009/I/3.) (Answer on p. 2035.) 
(i) Show that 











1 2 1.4 
n-1 n ntl. n3-n’ 
where A is a constant to be found. [2] 
=. i 
ii) Hence find . (There is no need to express your answer as a single algebraic 
(ii) 3 
rap 7 
fraction. ) [3] 
(iii) Give a reason why the series )’ —— converges, and write down its value. [2] 
roa —7 
Exercise 592. (9740 N2009/I/5.) (Answer on p. 2035.) 
(i) Use the method of mathematical induction to prove that 
n P 1 
yor? ==n(n+1)(2n+1). [4] 
r=1 6 
2n 
(ii) Find )° r’, giving your answer in fully factorized form. [4] 
r=ntl 
Exercise 593. (9740 N2009/1/8.) (Answer on p. 2036.) 


Two musical instruments, A and B, consist of metal bars of decreasing lengths. 


(i) The first bar of instrument A has length 20cm and the lengths of the bars form a 
geometric progression. The 25th bar has length 5cm. Show that the total length of 
all the bars must be less than 357cm, no matter how many bars there are. [4] 


Instrument B consists of only 25 bars which are identical to the first 25 bars of instrument 
A. 


(ii) Find the total length, L.cm, of all the bars of instrument B and the length of the 13th 
bar. [3] 


(iii) Unfortunately the manufacturer misunderstands the instructions and constructs in- 
strument B wrongly, so that the lengths of the bars are in arithmetic progression with 
common difference d cm. If the total length of the 25 bars is still Lcm and the length 
of the 25th bar is still 5cm, find the value of d and the length of the longest bar. [4] 
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Exercise 594. (9740 N2008/1/2.) (Answer on p. 2036.) 


The nth term of a sequence is given by 
Un = n(2n+1), 


for n>1. The sum of the first n terms is denoted by S;,,. Use the method of mathematical 
induction to show that 


i 
Sane 5” (n+ 1) (4n +5) 


for all positive integers n. [5] 


Exercise 595. (9740 N2008/1/10.) (Answer on p. 2036.) 


(i) A student saves $10 on 1 January 2009. On the first day of each subsequent month 
she saves $3 more than in the previous month, so that she saves $13 on 1 February 
2009, $16 on 1 March 2009, and so on. On what date will she first have saved over 
$2000 in total? [5] 


(ii) A second student puts $10 on 1 January 2009 into a bank account which pays com- 
pound interest at a rate of 2% per month on the last day of each month. She puts a 
further $10 into the account on the first day of each subsequent month. 


(a) How much compound interest has her original $10 earned at the end of 2 years? 
[2] 
(b) How much in total is in the account at the end of 2 years? [3] 


(c) After how many complete months will the total in the account first exceed $2000? 
4] 


Exercise 596. (9233 N2008/II/2.) (Answer on p. 2037.) 
il 
An arithmetic progression and a geometric progression each have first term 5° 


1 1 
The sum of their second terms is 5 and the sum of their third terms is 3° Given that the 


geometric progression is convergent, find its sum to infinity. [6] 
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Exercise 597. (9740 N2007/1/9.) (Answer on p. 2038.) 








The diagram shows the graph of y = e” — 3x. The two roots of the equation e” — 3x = 0 are 
denoted by a and 3, where a < (3. 


(i) Find the values of a and (3, each correct to 3 decimal places. [2] 
A sequence of real numbers 71, %2, 73, ... satisfies the recurrence relation 
i 5 
Enzi = =e", tor 1, 
) 
(ii) Prove algebraically that, if the sequence converges, then it converges to either a or /. 


[2] 
(iii) Use a calculator to determine the behaviour of the sequence for each of the cases 
e730, oy = 1 a Se . [3] 


(iv) By considering 2,41; — %,, prove that 


v1 <2, 1 Oe, < Vv, 





Cia Dy 1 oe ea OF > OB, [2] 


(v) State briefly how the results in part (iv) relate to the behaviours determined in (iii). 


[2] 


Exercise 598. (9740 N2007/1/10.) (Answer on p. 2038.) 


A geometric series has common ratio r, and an arithmetic series has first term a and 
common difference d, where a and d are non-zero. The first three terms of the geometric 
series are equal to the first, fourth and sixth terms respectively of the arithmetic series. 


(i) Show that 3r?-5r+2=0. 


(ii) Deduce that the geometric series is convergent and find, in terms of a, the sum 
infinity. 


o. & 


(iii) The sum of the first n terms of the arithmetic series is denoted by S. Given that a > 
find the set of possible values of n for which S exceeds 4a. 








Mo 
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Exercise 599. (9740 N2007/II/2.) (Answer on p. 2039.) 


A sequence uj, U2, ug, ... is such that u; = 1 and 
2+ I 
Unel = Un -— 7a for all n> 1. 
n?(n+1) 
| ee 1 
(i) Use the method of mathematical induction to prove that u, = —. [4] 
ie 
~  2n+1 


(ii) Hence find ds (nel)? [2] 


(iii) Give a reason why the series in part (ii) is convergent and state the sum to infinity. 


2] 


(iv) Use your answer to part (ii) to find ) 


n=2 


Cc CESTE a 


Exercise 600. (9233 N2007/1/14.) (Answer on p. 2040.) 


Use the method of mathematical induction to prove the following result. 


COS le — COS (n a 5) oO 





\>sinra = — , where 0 < x < 271. [6] 
cm 2sin 5% 
2n 
Exercise 601. (9233 N2007/II/1.) Find $°3"*?. [3] (Answer on p. 2040.) 
r=1 
Exercise 602. (9233 N2006/I/1.) (Answer on p. 2040.) 
2 
The sum S,, of the first n terms of a geometric progression is given by S, = 6 —- a Find 
the first term and the common ratio. [4] 
Exercise 603. (9233 N2006/1/11.) (Answer on p. 2040.) 


(i) 


(ii) Deduce that 2° + 4° + 6° +---+ (2n)? = 2n?(n+1)?. [1] 


(iii) Hence or otherwise find 


» @r- as 


simplifying your answer. [4] 
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135. Past-Year Questions for Part III. Vectors 


Exercise 604. (9758 N2019/1/12.) (Answer on p. 2042.) 


A ray of light passes from air into P(2,2;4) 
a material made into a rectangular 
prism. The ray of light is sent in 
the direction (-2,-3,-6) from a light 
source at the point P with coordi- 
nates (2,2,4). The prism is placed so 
that the ray of light passes through 
the prism, entering at the point Q 
and emerging at the point FR and is 
picked up by a sensor at point S with 
coordinates (—5,-6,-7). The acute 
angle between PQ and the normal to 
the top of the prism at @ is @ and 
the acute angle between QR and the 
same normal is @ (see diagram). 


| Normal 





It is given that the top of the prism 
is a part of the plane x+y+z=1, and 
that the base of the prism is a part of 8 (<5.+6,27) 
the plane x + y+ z= 9. —- 


It is also given that the ray of light along PQ is parallel to the ray of light along RS so 
that P, Q, R and S lie in the same plane. 





(i) Find the exact coordinates of Q and R. [5] 
(ii) Find the values of cos@ and cos 3. [3] 
(iii) Find the thickness of the prism measured in the direction of the normal at Q. [3] 





Snell’s law states that sin@ = ksin 9, where k is a constant called the refractive index. 


(iv) Find k for the material of this prism. 
(v) What can be said about the value of & for a material for which 6 > 6? 


a 
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Exercise 605. (9758 N2019/II/5.) 


With reference to the origin O, the points A, B, C and D 

are such that OA = a, OB = b, OC = 2a+ 4a and OD = 

b+5a. The lines BD and AC cross at X (see diagram). 
(i) Express OX in terms of a and b. [4] 


The point Y lies on C’D and is such that the points O, X 
and Y are collinear. 


(ii) Express OY in terms of a and b and find the ratio 
OX : OY. 6] 


Exercise 606. (9758 N2018/I/6.) 


Vectors a, b and c are such that a# 0 and ax 3b=2axc. 


(i) Show that 3b - 2c = Aa, where \ is a constant. 


(Answer on p. 2042.) 


C 





(Answer on p. 2043.) 


2] 


(ii) It is now given that a and ¢ are unit vectors, that the modulus of b is 4 and that the 
angle between b and c is 60°. Using a suitable scalar product, find exactly the two 


possible values of X. 


Exercise 607. (9758 N2018/II/3.) A 
E(0,0, 10) 








B(5,4,0) 


(Exercise continues on the next page ...) 
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(Exercise continued from the previous page ...) An oblique pyramid has a plane base ABC'D 

in the shape of a parallelogram. The coordinates of A, B and C are (5,-4,1), (5,4,0) and 

(—5,4,2) respectively. The apex of the pyramid is at £'(0,0,10) (see diagram). 
(i) Find the coordinates of D. 

(ii) Find the cartesian equation of face BCE. [3 


— 
— 
a Uo. 





(iii) Find the angle between face BCE and the base of the pyramid. 
(iv) Find the shortest distance from the midpoint of edge AD to face BCE. [5 


oO 
ey 


Exercise 608. (9758 N2017/I1/6.) (Answer on p. 2044.) 
(i) Interpret geometrically the vector equation r = a+tb, where a and b are constant 
vectors and ¢ is a parameter. [2] 


(ii) Interpret geometrically the vector equation r-n = d, where n is a constant unit vector 
and d is a constant scalar, stating what d represents. [3] 


(iii) Given that b-n #0, solve the equations r = a+tb and r-n=d to find r in terms of 
a, b, n and d. Interpret the solution geometrically. [3] 











Exercise 609. (9758 N2017/1/10.) (Answer on p. 2044.) 


Electrical engineers are installing electricity cables on a building site. Points (x,y,z) are 
defined relative to a main switching site at (0,0,0), where units are metres. Cables are laid 
in straight lines and the widths of cables can be neglected. 


An existing cable C' starts at the main switching site and goes in the direction (3, 1,-2). 
A new cable is installed which passes through points P (1,2,-1) and Q (5,7,a). 


(i) Find the value of a for which C and the new cable will meet. [4] 


To ensure that the cables do not meet, the engineers use a = —3. The engineers wish to 
connect each of the points P and Q to a point R on C. 


(ii) The engineers wish to reduce the length of cable required and believe in order to do 
this that angle PRQ should be 90°. Show that this is not possible. [4] 


(iii) The engineers discover that the ground between P and R is difficult to drill through 
and now decide to make the length of PR as small as possible. Find the coordinates 
of R in this case and the exact minimum length. [5] 





5°0This is because in the context of two-dimensional space, the vector equation r-n = d describes a line. 
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Exercise 610. (9740 N2016/I/5.) (Answer on p. 2045.) 
The vectors u and v are given by u = 2i-j+2k and v = ai+bk, where a and b are constants. 
(i) Find (u+v) x (u-v) in terms of a and b. [2] 


(ii) Given that the i- and k-components of the answer to part (i) are equal, express 
(u+v) x (u-v) in terms of a only. Hence find, in an exact form, the possible values 


of a for which (u+v) x (u-v) is a unit vector. [4] 
(iii) Given instead that (u+v)-(u-v) =0, find the numerical value of |v}. [2] 
Exercise 611. (9740 N2016/1/11.) (Answer on p. 2046.) 
i! i a 
The plane p has equation r = | —-3 |+A] 2 [+p] 4 |, and the line / has equation 
2 0 —2 
a-1 —2 
r= a +t} 1 |, where a is a constant and 4, and t are parameters. 
a+1 2 


(i) In the case where a = 0, 


(a) show that / is perpendicular to p and find the values of A, . and t which give the 


coordinates of the point at which / and p intersect, [5] 

(b) find the cartesian equations of the planes such that the perpendicular distance 
from each plane to p is 12. [5] 

(ii) Find the value of a such that / and p do not meet in a unique point. [3] 
Exercise 612. (9740 N2015/I/7.) (Answer on p. 2047.) 


Referred to the origin O, points A and B have position vectors a and b respectively. Point 
C' lies on OA, between O and A, such that OC’: CA = 3:2. Point D lies on OB, between 
O and B, such that OD: DB =5:6. 


(i) Find the position vectors OC and OD, giving your answers in terms of a and b. [2] 


3 
(ii) Show that the vector equation of the line BC can be written as r = za +(1-A)b, 


where A is a parameter. Find in a similar form the vector equation of the line AD in 
terms of a parameter [U. [3] 


(iii) Find, in terms of a and b, the position vector of the point E where the lines BC and 
AD meet and find the ratio AE’: ED. [5] 
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Exercise 613. (9740 N2015/II/2.) (Answer on p. 2048.) 
The line Z has equation r = i - 2j - 4k + 4 (21+ 3j — 6k). 

(i) Find the acute angle between L and the z-axis. [2] 
The point P has position vector 2i + 5j — 6k. 


(ii) Find the points on L which are a distance of V33 from P. Hence or otherwise find 
the point on L which is closest to P. [5] 


(iii) Find a cartesian equation of the plane that includes the line LZ and the point P. [3] 


Exercise 614. (9740 N2014/1/3.) (Answer on p. 2048.) 


(i) Given that a x b =0, what can be deduced about the vectors a and b? [2] 

(ii) Find a unit vector n such that n x (i+ 2j - 2k) = 0. [2] 
(iii) Find the cosine of the acute angle between i + 2j- 2k and the z-axis. [1] 
Exercise 615. (9740 N2014/1/9.) (Answer on p. 2048.) 


Planes p and q are perpendicular. Plane p has equation x + 2y — 3z = 12. Plane q contains 
the line / with equation 





el yell 253 
2  -1 4° 
The point A on / has coordinates (1,-1,3). 
(i) Find a cartesian equation of q. [4] 
(ii) Find a vector equation of the line m where p and q meet. [4] 


(iii) B is a general point on m. Find an expression for the square of the distance AB. 
Hence, or otherwise, find the coordinates of the point on m which is nearest to A. [5] 


Exercise 616. (9740 N2013/I/1.) (Answer on p. 2048.) 


Planes p, q and r have equations x-2z = 4, 2x-2y+z=6 and 52-4y+ pz = —9 respectively, 
where jz is a constant. 


(i) Given that p = 3, find the coordinates of the point of intersection of p,qandr. [2] 
(ii) Given instead that ys = 0, describe the relationship between p, q and r. [3] 
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Exercise 617. (9740 N2013/1/6.) A 


-- 
- 
Oe eee ee 
-_-- 
-_ 
-— 


C 





O M 


The origin O and the points A, B and C lie in the same plane, where OA = a, OB =b and 
OC = c (see diagram). 

(i) Explain why c can be expressed as c = Aa + pub, for constants and pu. [1] 
The point N is on AC such that AN: NC =3:4. 

(ii) Write down the position vector of N in terms of a and c. [1] 
(iii) It is given that the area of triangle ONC is equal to the area of triangle OMC, where 


M is the mid-point of OB. By finding the areas of these triangles in terms of a and 
b, find A in terms of ys in the case where A and w are both positive. [5] 


Exercise 618. (9740 N2013/II/4.) (Answer on p. 2049.) 


The planes p; and pz have equations r-(2,-2,1) = 1 and r-(-6,3,2) = -1 respectively, and 
meet in the line J. 


(i) Find the acute angle between p; and py. [3] 
(ii) Find a vector equation for 1. [4] 
(iii) The point A (4, 3,c) is equidistant from the planes p,and p. Calculate the two possible 
values of c. [6] 
Exercise 619. (9740 N2012/I/5.) (Answer on p. 2050.) 


Referred to the origin O, the points A and B have position vectors a and b such that 
a=i-j+k and b=i+2j. The point C has position vector c given by c = Aa+ pub, where 
A and p are positive constants. 


(i) Given that the area of triangle OAC is V 126, find p. [4] 
(ii) Given instead that = 4 and that OC = 5V3, find the possible coordinates of C. [4] 
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Exercise 620. (9740 N2012/1/9.) (Answer on p. 2050.) 
(i) Find a vector equation of the line through the points A and B with position vectors 
7i+ 8j+ 9k and -i- 8j+k respectively. [3] 


(ii) The perpendicular to this line from the point C with position vector i+8j+3k meets 
the line at the point N. Find the position vector of N and the ratio AN: NB. _ [5] 


(iii) Find a cartesian equation of the line which is a reflection of the line AC in the line 
7m 








Exercise 621. (9740 N2011/I/7.) A. (Answer on p. 2051.) 


O B 
Q 
Referred to the origin O, the points A and B are such that OA =aand OB =b. The point 
P on OA is such that OP: PA=1:2, and the point Q on OB is such that OQ: QB = 3: 2. 
The mid-point of PQ is M (see diagram). 


ee 

(i) Find OM in terms of a and b and show that the area of triangle OM P can be written 
as k|a x b|, where & is a constant to be found. [6] 

(ii) The vectors a and b are now given by a = 2pi — 6pj + 3pk and b =i+j-2k, where p is 
a positive constant. Given that a is a unit vector, 


(a) find the exact value of p, [2] 
(b) give a geometrical interpretation of |a- bl, [1] 
(c) evaluate a xb. [2] 
Exercise 622. (9740 N2011/I/11.) (Answer on p. 2051.) 


The plane p passes through the points with coordinates (4,-1,-3), (—2,-5, 2) and (4, -3, -2). 
(i) Find a cartesian equation of p. [4] 


The lines /; and Jy have, respectively, equations 


—] —2 3 2 —] —3 
a _y72_ Zt Saaie _y-l_2 | 


1 1 sy) k; 














iS) 
| 
aw 


where k is a constant. It is given that /; and [> intersect. 


(ii) Find the value of k. [4] 
(iii) Show that llies in p and find the coordinates of the point at which ly intersects p. [4] 
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CO 


(iv) Find the acute angle between /y and p. [ 


Exercise 623. (9740 N2010/I/1.) (Answer on p. 2052.) 


The position vectors a and b are given by 
a = 2pi+ 3pj + 6pk and b =i- 2j + 2k, 


where p > 0. It is given that |al = |b]. 

















(i) Find the exact value of p. [2] 
(ii) Show that (a+b)-(a-b) =0. [3] 
Exercise 624. (9740 N2010/1/10.) (Answer on p. 2052.) 
The line / and plane p have, respectively, equations 
— 10 1 3 
ad My Ae cd and x -2y-3z=0. 
—3 6 9 
(i) Show that / is perpendicular to p. [2] 
(ii) Find the coordinates of the point of intersection of | and p. [4] 
(iii) Show that the point A with coordinates (—2, 23,33) lies on J. Find the coordinates of 
the point B which is the mirror image of A in p. [3] 
(iv) Find the area of triangle OAB, where O is the origin, giving your answer to the nearest 
whole number. 3] 
Exercise 625. (9740 N2009/1/10.) (Answer on p. 2052.) 


The planes p; and po have equations r- (2,1,3) = 1 and r- (-1,2,1) = 2 respectively, and 
meet in a line J. 

(i) Find the acute angle between p; and py. [3] 
(ii) Find a vector equation of 1. [4] 


(iii) The plane p3 has equation 22 + y+3z-1+k(-a%+2y+z-2)=0. Explain why / lies 
in p3 for any constant k. Hence, or otherwise, find a cartesian equation of the plane 
in which both / and the point (2,3, 4) lie. [5] 


Exercise 626. (9740 N2009/II/2.) (Answer on p. 2053.) 


Relative to the origin O, two points A and B have position vectors given by a = 14i+14j+14k 
and b = 11i- 13j + 2k respectively. 


(i) The point P divides the line AB in the ratio 2:1. Find the coordinates of P. [2] 
(ii) Show that AB and OP are perpendicular. [2] 


—> 
(iii) The vector c is a unit vector in the direction of OP. Write c as a column vector, and 
give the geometrical meaning of |a- c]. [2] 

= 
(iv) Find a x p, where p is the vector OP, and give the geometrical meaning of |a x pl. 
Hence write down the area of the triangle OAP. 4 
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Exercise 627. (9740 N2008/I/3.) (Answer on p. 2053.) 


Points O, A, B are such that OA = i+ 4j-3k and OB = 5i —j, and the point P is such that 
OAPB is a parallelogram. 


(i) Find OP. (1] 
(ii) Find the size of angle AOB. [3] 
(iii) Find the exact area of the parallelogram OAPB. [2] 
Exercise 628. (9740 N2008/1/11.) (Answer on p. 2053.) 


The equations of three planes p;, po, p3 are 


20 - oy + 3zZ = 
Sf + 2-92 =—d, 
5x2 + Ay+17z = p, 
respectively, where \ and pare constants. When A = —20.9 and p = 16.6, find the coordinates 
of the point at which these planes meet. [2] 
The planes p; and po intersect in a line J. 
(i) Find a vector equation of 1. 4 
(ii) Given that all three planes meet in the line J, find \ and wp. 3] 


(iii) Given instead that the three planes have no points in common, what can be said about 
the values of A and pu? 2| 








(iv) Find the cartesian equation of the plane which contains / and the point (1,-1,3). [4 


Exercise 629. (9233 N2008/I/11.) (Answer on p. 2054.) 


The cartesian equations of two lines are 


ze yt2 2-5 z-1l yt3 z-4 


and 








i Z -l = —3 1 
(i) Show that the lines intersect and state the point of intersection. [5] 
(ii) Find the acute angle between the lines. [4] 


Exercise 630. (9740 N2007/I/6.) (Answer on p. 2054.) 
Referred to the origin O, the position vectors of the points A and B are i-j+2k and 
21+ 4j + k respectively. 

(i) Show that OA is perpendicular to OB. 


(ii) Find the position vector of the point M on the line segment AB such that AM: MB 
12. [ 
(iii) The point C has position vector —4i+2j+2k. Use a vector product to find the exa 
area of triangle OAC. 


~ 


oO 
BG ww, 
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Exercise 631. (9740 N2007/1/8.) (Answer on p. 2054.) 


The line / passes through the points A and B with coordinates (1,2,4) and (-2,3,1) 
respectively. The plane p has equation 3x - y+2z=17. Find 


(i) the coordinates of the point of intersection of | and p, [5] 
(ii) the acute angle between / and p, [3] 
(iii) the perpendicular distance from A to p. [3] 
Exercise 632. (9233 N2007/I1/7.) (Answer on p. 2054.) 


The point P is the foot of the perpendicular from the point A (1,3,-2) to the line given by 


a+3 y-8 z-3 
2 2 Se 


Find the coordinates of P, and hence find the length of AP. [7] 





Exercise 633. (9233 N2007/II/2.) (Answer on p. 2054.) 


Referred to an origin O the position vectors of two points A and B are 4i+j+3k and i-3j+4k 
—> —_ —_ —> 
respectively. Two other points, C' and D, are given by OC = 0.250A and OD =0.75OB. 


(i) Find a vector equation for the line AD. [2] 
(ii) Find the position vector of the point of intersection of AD and BC. [5] 
Exercise 634. (9233 N2006/1/14.) (Answer on p. 2055.) 


The points A, B, C and D have position vectors i- 2j+5k, i+3j, 10i+j+2k and -2i+4j+5k 
respectively, with respect to an origin O. The point P on AB is such that AP: PB =: 1-2 


and the point Q on C’'D is such that CQ: QD = w:1-p. Find OP and OQ in terms of A 
and ju respectively. [3] 


Given that PQ is perpendicular to both AB and CD, 


(i) show that PQ =i+2j+2k, (7] 
(ii) Find the area of triangle ABQ. [2] 
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136. Past-Year Questions for Part IV. Complex Numbers 








Exercise 635. (9758 N2019/I/1.) (Answer on p. 2056.) 
The function f is defined by f (z) = az? +bz* +cz+d, where a, b, c and d are real numbers. 
Given that 2+i and —3 are roots of f (z) =0, find b, c and d in terms of a. [4] 
Exercise 636. (9758 N2019/I/9.) (Answer on p. 2056.) 
(i) The complex number w can be expressed as cos@ + isin 0. 
il 
(a) Show that w + — is a real number. [2] 
w 
-1 1 
(b) Show that i | can be expressed as ktan 5% where k is a complex number to 
w 
be found. [4] 
(ii) The complex number z has modulus 1. Find the modulus of the complex number 
2= di 
= 5] 
1+ 3iz 
Exercise 637. (9758 N2018/II/2.) (Answer on p. 2057.) 


(i) One of the roots of the equation Ag’ — 20x? + sx? — 56a +t =0, where s and t are real, 
is 2- 3i. Find the other roots of the equation and the values of s and tf. [5] 


(ii) The complex number w is such that w? = 27. 


(a) Given that one possible value of w is 3, use a non-calculator method to find 
the other possible values of w. Give your answers in the form a+ib, where a and 
b are exact values. [3] 
(b) Write these values of w in modulus-argument form and represent them on an 
Argand diagram. [2] 








(c) Find the sum and the product of all the possible values of w, simplifying your 
answers. [2] 


Exercise 638. (9758 N2017/I/8.) (Answer on p. 2058.) 


Do not use a calculator in answering this question. 


(a) Find the roots of the equation z?(1-i) - 2z + (5+5i) = 0, giving your answers in 
cartesian form a + ib. [3] 


(b) (i) Given that w =1-i, find w?, w* and w* in cartesian form. Given also that 
w* + pw + 39w? + qw + 58 = 0, 


where p and q are real, find p and q. [4] 


(ii) Using the values of p and q in part (b)(i), express w* + pw? + 39w? + qw + 58 as the 
product of two quadratic factors. [3] 
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Exercise 639. (9740 N2016/1/7.) (Answer on p. 2058.) 


Do not use a calculator in answering this question. 


(a) Verify that -1+ 5i is a root of the equation w? + (-1-8i) w+ (-17+ 7i) = 0. Hence, or 
otherwise, find the second root of the equation in cartesian form, p+ ig, showing your 
working. [5] 

(b) The equation 2° - 52*+16z+k =0, where k is a real constant, has a root z = 1 + ai, 
where a is a positive real constant. Find the values of a and k, showing your working. 


5] 


Exercise 640. (9740 N2016/II/4.) (Answer on p. 2059.) 


(a) Two loci in the Argand diagram are given by the equations 
jz-3-iJ=1 and arg z= a, where tana = 0.4. 


The complex numbers z,and 22, where |z;| < |z2|, correspond to the points of intersection 
of these loci. 


(i) Draw an Argand diagram to show both loci, and mark the points represented by 
Zz, and Zo. 2 


(ii) Find the two values of z which represent points on |z - 3 - i] = 1 such that |z - z 


Z— 2). 4 


(b) (i) The complex number 2 - 2i is denoted by w. By writing w in polar form re’, 
where r >0 and -71< 6 <7, find exactly all the cube roots of w in polar form. 3] 








shy sae ses 1 
(ii) Find the smallest positive whole number value of n such that arg (w*w”) = aT 3 


Exercise 641. (9740 N2015/1/9.) (Answer on p. 2059.) 


(a) The complex number w is such that w = a+ib, where a and b are non-zero real numbers. 


2 
w 

The complex conjugate of w is denoted by w*. Given that — is purely imaginary, find 
w 


the possible values of w in terms of a. [5] 


. 5 ‘a 
(b) The complex number z is such that z° = -32i. 


(i) Find the modulus and argument of each of the possible values of z. [4] 
(ii) Two of these values are z; and z:, where 0.57 < arg z, < m and -7 < arg 2% < 
—0.57. Find the exact value of arg (2; — z2) in terms of 7 and show that |z; - z: 


2| 
| 


I 


Asin (0.270). A| 
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Exercise 642. (9740 N2014/I/5.) (Answer on p. 2060.) 
It is given that z= 1+ 2i. 


1 
(i) Without using a calculator, find the values of z? and —z in cartesian form x + iy, 
Zz 





showing your working. [4] 
(ii) The real numbers p and q are such that pz? + 5 is real. Find, in terms of p, the value 

of q and the value of pz? + [3] 
Exercise 643. (9740 N2014/II/4.) (Answer on p. 2060.) 
(a) The complex number z satisfies |z + 5 — i] = 4 

(i) On an Argand diagram show the locus of z. [2] 


(ii) The complex number 2 also satisfies |z — 6i| = |z + 10 + 4i]. Find exactly the possible 
values of z, giving your answers in the form x + iy. [4] 


(b) It is given that w= V3 -i. 


(i) Without nsine a calculator, find an exact expression for w®. Give your answer in 


the form re”, where r > 0 and 0< @< 2r. [3] 

(ii) Without using a calculator, find the three smallest positive whole number values 

of n for which is a real number. [4] 

Exercise 644. (9740 N2013/I1/4.) (Answer on p. 2061.) 
The complex number w is given by 1 + 2i. 

(i) Find w? in the form x + iy, showing your working. [2] 

(ii) Given that w is a root of the equation az? + 52*+17z+b = 0, find the values of the 

real numbers a and b. [3] 

(iii) Using these values of a and 8, find all the roots of this equation in exact form. [3] 

Exercise 645. (9740 N2013/I1/8.) (Answer on p. 2062.) 


The complex number z is given by z = re’, where r > 0 and 0< 6 < 0.57. 


(i) Given that w = (1 - iv3) z, find |w| in terms of r and argw in terms of 0. [2] 


(ii) Given that r has a fixed value, draw an Argand diagram to show the locus of z as 4 
varies. On the same diagram, show the corresponding locus of w. You should identify 
the modulus and argument of the endpoints of each locus. [4] 


710 
got find @. [3] 





(iii) Given that arg 
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Exercise 646. (9740 N2012/I/6.) (Answer on p. 2063.) 
Do not use a calculator in answering this question. 
The complex number z is given by z = 1+ ic, where c is a non-zero real number. 

(i) Find 2° in the form z + iy. [2] 
(ii) Given that 2° is real, find the possible values of z. [2] 


(iii) For the value of z found in part (ii) for which c < 0, find the smallest positive integer n 
such that |z"| > 1000. State the modulus and argument of z” when n takes this value. 
4] 


Exercise 647. (9740 N2012/II/2.) (Answer on p. 2063.) 
The complex number z satisfies the equation |z — (7 - 3i)| = 4. 

(i) Sketch an Argand diagram to illustrate this equation. 2] 
(ii) Given that |z| is as small as possible, 


(a) find the exact value of |z], 2] 





(b) hence find an exact expression for z, in the form x + iy. 2 


(iii) It is given instead that -7 < arg z < 7 and that |arg z| is as large as possible. Find the 
value of arg z in radians, correct to 4 significant figures. 3 








Exercise 648. (9740 N2011/I/10.) (Answer on p. 2064.) 

Do not use a graphic calculator in answering this question. 
(i) The roots of the equation z? = -8i are 2, and z. Find z, and zg in cartesian form 
x +iy, showing your working. [4] 


(ii) Hence, or otherwise, find in cartesian form the roots w; and wy of the equation we + 
4w + (4+ 2i) = 0. [3] 
(iii) Using a single Argand diagram, sketch the loci 








(a) lz | =|z— zl, 1 
(b) |z -— wy] = |z- wil, 1] 
(iv) Give a reason why there are no points which lie on both of these loci. 1] 
Exercise 649. (9740 N2011/II/1.) (Answer on p. 2065.) 


The complex number z satisfies |z — 2 - 5i] < 3. 
(i) On an Argand diagram, sketch the region in which the point representing z can lie.[3] 
(ii) Find exactly the maximum and minimum possible values of |z]. [2] 


(iii) It is given that 0 < argz < —. With this extra information, find the maximum value of 


|z -6-i|. Label the point(s) that correspond to this maximum value on your diagram 
with the letter P. [3] 
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Exercise 650. (9740 N2010/I/8.) (Answer on p. 2066.) 
The complex numbers z, and zy are given by 1+iV3 and -1-i respectively. 


(i) Express each of z; and z in polar form r(cos@+isin@), where r >0 and -1<0@< 7. 





Give r and @ in exact form. [2] 
(ii) Find the complex conjugate of *! in exact polar form. [3] 
£2 
(iii) On a single Argand diagram, sketch the loci 
(a) |z-z| = 2, 
Tt 
(b) arg (z- 22) = 1 [4] 
(iv) Find where the locus |z — z;| = 2 meets the positive real axis. [2] 
Exercise 651. (9740 N2010/II/1.) (Answer on p. 2067.) 
(i) Solve the equation x? - 6x + 34 = 0. [2] 
(ii) One root of the equation ¢'+42°+27+ax+b=0, where a and bare real, is x = -2 +i. 
Find the values of a and 6 and the other roots. [5] 
Exercise 652. (9740 N2009/1/9.) (Answer on p. 2068.) 


(i) Solve the equation 


z°—-(1+i) =0, 
giving the roots in the form re’, where r > 0 and -71?<a< 7. [5] 
(ii) Show the roots on an Argand diagram. [2] 


7 ; 
(iii) The roots represented by z, and zg are such that 0 < arg z, < arg 2 < Explain why 


the locus of all points z such that |z—- z,| = |z— z2| passes through the origin. Draw 
this locus on your Argand diagram and find its exact cartesian equation. [5] 
Exercise 653. (9740 N2008/1/8.) (Answer on p. 2069.) 


A graphic calculator is not to be used in answering this question. 


(i) It is given that z; = 1+ V/3i. Find the value of z?, showing clearly how you obtain your 


answer. [3] 
(ii) Given that 1+ V3i is a root of the equation 22° + az? + bz +4 = 0, find the values of 
the real numbers a and b. [4] 


(iii) For these values a and 6, solve the equation in part (ii), and show all the roots on an 
Argand diagram. [4] 
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Exercise 654. (9740 N2008/II/3.) (Answer on p. 2070.) 


(a) The complex number w has modulus r and argument 6, where 0 < @ < a and w* 


denotes the conjugate of w. State the modulus and argument of p, where p = = [2] 
w 








Given that p° is real and positive, find the possible values of 6@. [2] 

(b) The complex number z satisfies the relations |z| < 6 and |z| = |z - 8 - Gil. 
(i) Illustrate both of these relations on a single Argand diagram. 3 
(ii) Find the greatest and least possible values of arg z, giving your answers in radians 
correct to 3 decimal places. 4) 
Exercise 655. (9233 N2008/1/9.) (Answer on p. 2071.) 


In an Argand diagram, the point P represents the complex number z. Clearly labelling 
any relevant points, draw three separate diagrams to show the locus of P in each of the 
following cases. 

(i) |z + 2i] = 2. 

(ii) |z -2-i]=|z-i]. 


Nh bv 


(iit) = < arg(z4+1-3i) <=. (4) 
6 3 
Exercise 656. (9233 N2008/II/3.) (Answer on p. 2073.) 
(i) Verify that w = 1 -i satisfies the equation w? = -2i and write down the other root of 
this equation. [3] 
(ii) Use the quadratic formula to solve the equation z? — (3 + 5i) z- 4 (1 -2i) =0. [4] 
Exercise 657. (9740 N2007/I/3.) (Answer on p. 2073.) 
(a) Sketch, on an Argand diagram, the locus of points representing the complex number z 
such that |z + 2- 3i| = V/13. [3] 
(b) The complex number w is such that ww*+2w = 3+4i, where w” is the complex conjugate 
of w. Find w in the form a+ ib, where a and 0 are real. [4] 
Exercise 658. (9740 N2007/I/7.) (Answer on p. 2074.) 


The polynomial P (z) has real coefficients. The equation P(z) = 0 has a root re’, where 
r>Oand0<0<7. 


(i) Write down a second root in terms of r and 0, and hence show that a quadratic factor 


of P(z) is 27 -2rzcosO +r’. [3 
(ii) Solve the equation z° = —-64, expressing the solutions in the form re'’, where r > 0 and 
—M<O <7. A| 


(iii) Hence, or otherwise, express 2° +64 as the product of three quadratic factors with real 
coefficients, giving each factor in non-trigonometrical form. 3 
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Exercise 659. (9233 N2007/1/9.) (Answer on p. 2075.) 


(i) The equation az* + bz? +cz?+dz+e=0 has a root z = ki, where k is real and non-zero. 
Given that the coefficients a, b, c, d and e are real, show that ad? + b’e = bed. [5] 


(ii) Verify that this condition is satisfied for the equation 24 + 32° +1327 +27z +36 =0 and 
hence find two roots of this equation which are of the form z = ki, where k is real. [3] 


Exercise 660. (9233 N2007/II/5.) (Answer on p. 2075.) 


(i) Illustrate, on an Argand diagram, the locus of a point P representing the complex 


ae 
number z, where arg (z - 2i) = 3" [3] 
(ii) Illustrate, using the same Argand diagram, the locus of a point Q representing the 
complex number z, where |z - 4| = |z + 2|. [2] 


T 
(iii) Hence find the exact value of z such that arg (z—- 2i) = e and |z - 4| = |z+2|, giving 


your answer in the form a + ib. [3] 
(iv) Show that, in this case, zz* = 8 + 4V3. [2] 
Exercise 661. (9233 N2006/I/5.) (Answer on p. 2076.) 


= 3. 





The complex number z satisfies |z + 4 - 4i 


(i) Describe, with the aid of a sketch, the locus of the point which represents z in an 


Argand diagram. [3] 

(ii) Find the least possible value of |z - ij. [2] 
Exercise 662. (9233 N2006/1/6.) (Answer on p. 2077.) 
(i) Show that the equation z* - 2z* + 6z? -8z+8=0 has a root of the form ki where k is 
real. [3] 

(ii) Hence solve the equation z4 — 22° + 6z7 -8z+8=0. [3] 
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137. Past-Year Questions for Part V. Calculus 


Exercise 663. (9758 N2019/I/2.) (Answer on p. 2079.) 


3 


The curve C’ has equation y = 2° +2” -1. 


(i) C crosses the z-axis at the point with coordinates (a,0). Find the value of a correct 
to 3 decimal places. [1] 
(ii) You are given that b> a. 


The region P is bounded by C, the x-axis and the lines x = -1 and x = 0. The region 
Q is bounded by C, the line x = 6 and the part of the x-axis between x =a and x = b. 
Given that the area of Q is 2 times the area of P, find the value of b correct to 3 
decimal places. [4] 


Exercise 664. (9758 N2019/1/10.) (Answer on p. 2080.) 


A curve C’ has parametric equations 


x = a(2cos6@ — cos 26), 
y =a(2sin6 - sin 20), 


for 0< 6 < 27. 
(i) Sketch C and state the Cartesian equation of its line of symmetry. [2] 
(ii) Find the values of 6 at the points where C' meets the x-axis. [2] 
(iii) Show that the area enclosed by the z-axis, and the part of C above the z-axis, is given 
by 
02 
f ar (4 sin? 6 — 6 sin @ sin 26 + 2 sin” 20) dé, 
where 6; and 65 should be stated. [3] 
(iv) Hence find, in terms of a, the exact total area enclosed by C. [5] 


A cardioid is a curve traced by a fixed point on the perimeter of a circle of 
radius 7 which is rolling around another circle of radius 7. It is parameterized 
by the equations 


x = r(2cost — cos 2t) 


y =r(2sint — sin 2t), 


where 0 < ¢ < 27. What is the area inside the cardioid? 








6! Archived on Wayback Machine since at least 2017-09-19. 
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Exercise 665. (9758 N2019/I/11.) (Answer on p. 2082.) 


Scientists are investigating how the temperature of water changes in various environments. 


(i) The scientists begin by investigating how hot water cools. 


The water is heated in a container and then placed in a room which is kept at a 
constant temperature of 16°C. The temperature of the water ¢ minutes after it is 
placed in the room @°C. This temperature decreases at a rate proportional to the 
difference between the temperature of the water and the temperature of the room. 
The temperature of the water falls from a value of 80°C to 32°C in the first 30 
minutes. 


(a) Write down a differential equation for this situation. Solve this differential equa- 
tion to get @ as an exact function of t. [6] 
(b) Find the temperature of the water 45 minutes after it is placed in the room. [1] 

(ii) The scientists then model the thickness of ice on a pond. 


In winter the surface of the water in the pond freezes. Once the thickness of the ice 
reaches 3cm, it is safe to skate on the ice. The thickness of the ice is T.cm, t minutes 
after the water starts to freeze. The freezing of the water is modelled by a differential 
equation in which the rate of change of the thickness of the ice is inversely proportional 
to its thickness. It is given that T = 0 when t = 0. After 60 minutes, the ice is 1cm 
thick. 


Find the time from when freezing commences until the ice is first safe to skate on.|6] 


Exercise 666. (9758 N2019/II/1.) (Answer on p. 2082.) 
You are given that I = [ # (1 _ x)? da 


(i) Use integration by parts to find an expression for J. [2] 
(ii) Use the substitution u? = 1-2 to find another expression for I. [2] 


(iii) Show algebraically that your answers to parts (i) and (ii) differ by a constant. [2] 


Exercise 667. (9758 N2019/II/3.) (Answer on p. 2083.) 


A solid cylinder has radius rcm, height hcm and total surface area 900cm?. Find the exact 
value of the maximum possible volume of the cylinder. Find also the ratio r:h that gives 
this maximum volume. [7] 
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Exercise 668. (9758 N2019/II/4.) (Answer on p. 2083.) 


(i) Given that f (x) = sec 2x, find f’(«) and f” (x). Hence, or otherwise, find the Maclau- 


rin series for f (x), up to and including the term in 2’. [5] 


0.02 
(ii) Use your series from part (i) to estimate [ sec 2x dx, correct to 5 decimal places. 
0 








[2] 

0.02 ba 
(iii) Use your calculator to find [ sec 2x dx, correct to 5 decimal places. {1 

0 
(iv) Comparing your answers to parts (ii) and (iii), and with reference to the value of z, 
comment on the accuracy of your approximations. [2] 
(v) Explain why a Maclaurin series for g(x) = cosec2x cannot be found. [1] 
Exercise 669. (9758 N2018/I/1.) (Answer on p. 2083.) 

ge cee Ing dy . 
(i) Given that y = —, find — in terms of z. [2] 

us dx 
Il 
(ii) Hence, or otherwise, find the exact value of f _ dz, showing your working. [4] 
i, 2 

Exercise 670. (9758 N2018/I/2.) (Answer on p. 2084.) 


Do not use a calculator in answering this question. 


3 
A curve has equation y = — and a line has equation y + 2x = 7. The curve and the line 


x 
intersect at the points A and B. 


(i) Find the z-coordinates of A and B. [2] 

(ii) Find the exact volume generated when the area bounded by the curve and the line is 

rotated about the x-axis through 360°. [4] 

Exercise 671. (9758 N2018/I/3.) (Answer on p. 2084.) 
dy 


(i) It is given that oe 2y-6. Using the substitution y = ux’, show that the differential 
"3 


d 
equation can be transformed to — = f (x), where the function f (x) is to be found. 
[ 
3] 


d 
(ii) Hence, given that y = 2 when x = 1, solve the differential equation oS = 27 = 6. 10 
£ 


find y in terms of x. [4] 
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Exercise 672. (9758 N2018/I/7.) (Answer on p. 2085.) 





2 _ Aq? 1 
A curve C’ has equation 7 a ==, 
Lp 2 
dy _ 2a-y? 
i) Show that — = ————_.. es) 
Syne dx 2xy+l6y 8 


The points P and Q on C each have x-coordinate 1. The tangents to C at P and Q meet 
at the point N. 


(ii) Find the exact coordinates of N. [6] 


Exercise 673. (9758 N2018/1/9.) (Answer on p. 2085.) 


A curve C’ has parametric equations 
z= 20-sin20, y=2sin76, 
for 0<@0<7. 
(i) Show that 2 = cot é. [4] 
(ii) The normal to the curve at the point where 6 = a meets the x-axis at the point A. 
Show that the z-coordinate of A is ka, where k is a constant to be found. [4] 


(iii) Do not use a calculator in answering this part. 


The distance between two points along a curve is the arc-length. Scientists and engi- 
neers need to use arc-length in applications such as finding the work done in moving 
an object along the path described by a curve or the length of cabling used on a 
suspension bridge. 


The arc-length between two points on C, where @ = 6 and @ = ¥, is given by the 
formula 


Find the total length of C. [5] 
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Exercise 674. (9758 N2018/1/10.) (Answer on p. 2086.) 


An electrical circuit comprises a power source of V volts in series with a resistance of 
R ohms, a capacitance of C’ farads and an inductance of L henries. The current in the 
circuit, t seconds after turning on the power, is J amps and the charge on the capacitor is q 
coulombs. The circuit can be used by scientists to investigate resonance, to model heavily 
damped motion and to tune into radio stations on a stereo tuner. It is given that R, C and 
L are constants, and that J =0 when ¢ = 0. 


di d 
A differential equation for the circuit is LD— + RI + a V, where I = ele 

dt C dt 
(i) Show that, under certain conditions on V which should be stated, 


oT 7% F 
[i a, 2 
a a Ge 2) 


It is now given that the differential equation in part (i) holds for the rest of the question. 


(i) Given that J = Ate™2r is a solution of the differential equation, where A is a positive 








AL ae 
constant, show that C' = —. 5 
R? - 
(ii) In a particular circuit, R = 4, L = 3 and C = 0.75. Find the maximum value of J in 
terms of A, showing that this value is a maximum. [4] 
(iii) Sketch the graph of J against t. 


Exercise 675. (9758 N2018/II/1.) (Answer on p. 2087.) 
The curve y = f (x) passes through the point (0,69) and has gradient given by 


ii i 
ot. (2y-15) 


dx \3 
(i) Find f (2). [4] 
(ii) Find the coordinates of the point on the curve where the gradient is 4. [2] 
Exercise 676. (9758 N2018/II/4.) (Answer on p. 2087.) 


In this question you may use expansions from the List of Formulae (MF 26). 
(i) Find the Maclaurin expansion of In(cos2z) in ascending powers of x, up to and 


including the term in x°. State any value(s) of x in the domain 0 < x < —7 for which 


the expansion is not valid. [6] 


(ii) Use your expansion from part (i) and integration to find an approximate expression 


2 oP 2 
for [ ao dx. Hence find an approximate value for f ao dz, giving 
ae 0 iz 
your answer to 4 decimal places. [3] 


0-5 In (cos 2) re 


2 ) 
x 
giving your answer to 4 decimal places. [1] 


(iii) Use your graphing calculator to find a second approximate value for 7: 
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Exercise 677. (9758 N2017/I/1.) (Answer on p. 2088.) 


Using standard series from the List of Formulae (MF26), expand e*“In(1+az) as far as 
the term in x?, where a is a non-zero constant. Hence find the value of a for which there is 
no term in 2. [4] 


Exercise 678. (9758 N2017/I/3.) (Answer on p. 2088.) 
Do not use a calculator in answering this question. 


A curve C has equation y? — 2xy + 5x2” - 10 = 0. 


(i) Find the exact x-coordinates of the stationary points of C. [4] 

(ii) For the stationary point with x > 0, determine whether it is a maximum or minimum. 
3] 

Exercise 679. (9758 N2017/I/7.) (Answer on p. 2089.) 


It is given that f (x) =sin2mz +sin2nz, where m and n are positive integers and m # n. 


(i) Find [ sin 2mz sin 2naz da. [3] 


(ii) Find f[ “G@y 5) 


Exercise 680. (9758 N2017/1/11.) (Answer on p. 2089.) 


Sir Isaac Newton was a famous scientist renowned for his work on the laws of motion. One 
law states that, for an object falling vertically in a vacuum, the rate of change of velocity, 
ums‘, with respect to time, t seconds, is a constant, c. 


(i) (a) Write down a differential equation relating v, t and c. [1] 


(b) Initially the velocity of the object is 4ms~' and, after a further 2.58, the velocity 
of the object is 29ms~!. Find v in terms of t and state the value of c. [3] 


For an object falling vertically through the atmosphere, the rate of change of velocity is 
less than that for an object falling in a vacuum. The new rate of change of v is modelled 
as the difference between the value of c found in part (i)(b) and an amount proportional 
to the velocity v, with a constant of proportionality k. 


(ii) Given that in this case the initial velocity is zero, find v in terms of t and k. [5] 


For an object falling through the atmosphere, the ‘terminal velocity’ is the value approached 
by the velocity after a long time. 


(iii) A falling object has initial velocity zero and terminal velocity 40ms~!. Find how long 
it takes the object to reach 90% of its terminal velocity. [4] 
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Exercise 681. (9758 N2017/II/4.) (Answer on p. 2090.) 


(a) A flat novelty plate for serving food on is made in the shape of the region enclosed by 
the curve y = x? — 6x +5 and the line 2y = x — 1. Find the area of the plate. [4] 


(b) A curved container has a flat circular top. The shape of the container is formed by 





rotating the part of the curve x = where a is a constant greater than 1, between 


2°? 


il 
the points (0,0) and (—. 1) through 27t radians about the y-axis. 
a — 


(i) Find the volume of the container, giving your answer as a single fraction in terms 
of a and 7. [4] 


(ii) Another curved container with a flat circular top is formed in the same way from 


VY 


b-y? 





i] 
the curve x = and the points (0,0) and (—. 1) It has a volume that is 


bel 
four times as great as the container in part (i). Find an expression for b in terms 


of a. [3] 


Exercise 682. (9740 N2016/I/2.) (Answer on p. 2091.) 


2°°°* at the points where x = 0 


(i) Use your calculator to find the gradient of the curve y = 
1 
and x = 57 [2] 


1 
(ii) Find the equations of the tangents to this curve at the points where x = 0 and x = a" 


and find the coordinates of the point where these tangents meet. [3] 


Exercise 683. (9740 N2016/1/8.) (Answer on p. 2091.) 
It is given that y = f (x), where f (x) = tan (ax +b) for constants a and b. 


(i) Show that f’ (x) =a+ay’. Use this result to find f” (x) and f’ (x) in terms of a and 


y. 5] 
(ii) In the case where b = an, use your results from part (i) to find the Maclaurin series 
for f (x) in terms of a, up to and including the term in 2°. [3] 
(iii) Find the first two non-zero terms in the Maclaurin series for tan 2a. [3] 
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Exercise 684. (9740 N2016/1/9.) (Answer on p. 2092.) 


A stone is held on the surface of a pond and released. The stone falls vertically through the 
water and the distance, x metres, that the stone has fallen in time t seconds is measured. 


d 
It is given that x =0 and — = 0 when ¢ = 0. 


(i) The motion of the stone is modelled by the differential equation 


er dg 
— +2— =10. 
d@ “dt 
d 
(a) By substituting y = me show that the differential equation can be written as 
dy 
—=10-2y. 1 
a y [1] 
(b) Find y in terms of t and hence find x in terms of t. [6] 
(ii) A second model for the motion of the stone is suggested, given by the differential 
equation 
d’x sac ll 
de = 10-5sin a 
Find x in terms of t for this model. [3] 
(iii) The pond is 5 metres deep. For each of these models, find the time the stone takes to 
reach the bottom of the pond, giving your answers correct to 2 decimal places. [2] 
Exercise 685. (9740 N2016/II/1.) 0.1m? per minute 
————> 


dl 


Water is poured at a rate of 0.1m° per minute into 
a container in the form of an open cone. The semi- 
vertical angle of the cone is a, where tana = 0.5. 
At time t minutes after the start, the radius of the 
water surface is rm (see diagram). Find the rate 
of increase of the depth of water when the volume 
of water in the container is 3m?. [7] 


[The volume of a cone of base radius r and height 


1 
h is given by V = gm h.] (Answer on p. 2093.) 
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Exercise 686. (9740 N2016/II/2.) (Answer on p. 2093.) 


(a) (i) Find [ x” cosnx dx, where n is a positive integer. [3] 
27 TT 
(ii) Hence find i x? cosnx dx, giving your answers in the form a—, where the 
™ 
possible values of a are to be determined. [2] 
tr 


(b) The region bounded by the curve y = 7 the x-axis and the lines x = 0 and x = 2 


2 ) 
—2£ 
is rotated through 27 radians about the x-axis. Use the substitution u = 9-2? to find 


the exact volume of the solid obtained, simplifying your answer. [5] 


Exercise 687. (9740 N2016/II/3.) (Answer on p. 2094.) 
A curve D has parametric equations 
£=1— cost, y =1-cost, for 0 <¢ < 27. 


(i) Sketch the graph of D. Give in exact form the coordinates of the points where D 
meets the x-axis, and also give in exact form the coordinates of the maximum point 
on the curve. [4] 


(ii) Find, in terms of a, the area under D for 0 <t <a, where a is a positive constant less 
than 27. [3] 


il 
The normal to D at the point where ¢ = me cuts the x-axis at & and the y-axis at F’. 


(iii) Find the exact area of triangle OFF’, where O is the origin. [4] 


Exercise 688. (9740 N2015/T/3.) (Answer on p. 2095.) 


(i) Given that f is a continuous function, explain, with the aid of a sketch, why the value 


of 

wn fo )os(2)>~-rE) 
is f £@ da. [2] 
i! (ee) 


(ii) Hence evaluate lim — Tn 


nv 
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Exercise 689. (9740 N2015/I1/4.) (Answer on p. 2096.) 


A piece of wire of fixed length dm is cut into two parts. One part is bent into the shape 
of a rectangle with sides of length xm and ym. The other part is bent into the shape of 
a semicircle, including its diameter. The radius of the semicircle is xm. Show that the 
maximum value of the total area of the two shapes can be expressed as kd? m?, where k is 
a constant to be found. [6] 


Exercise 690. (9740 N2015/1/6.) (Answer on p. 2096.) 
(i) Write down the first three non-zero terms in the Maclaurin series for In (1 + 2x”), where 

I 
“5 <e < 5 simplifying the coefficients. [2] 


(ii) It is given that the three terms found in part (i) are equal to the first three terms in the 
series expansion of ax (1+ bx)° for small x. Find the exact values of the constants a, b 
and c and use these values to find the coefficient of x in the expansion of ax (1+ bx)’, 


giving your answer as a simplified rational number. [6] 
Exercise 691. (9740 N2015/1/10.) (Answer on p. 2097.) 
IN 
With origin O, the curves with y y =sing 


equations y = sinx and y = cosa, 


lL 

where 0 < x < 5% meet at the 

mM V/2 
point P with coordinates (z ). 
The area of the region bounded 
by the curves and the x-axis is 
A, and the area of the region 
bounded by the curves and the 
y-axis is Ay (see diagram). 











(i) Show that = a2. [4] 


1 
(ii) The region bounded by y = sina between O and P, the line y = 5V2 and the y-axis is 
rotated about the y-axis through 360°. Show that the volume of the solid formed is 


given by 
3V2 
nf (sin y) dy. [2] 
0 


b 
(iii) Show that the substitution y = sin u transforms the integral in (ii) to 7 [ u” cosu du, 


for limits a and b to be determined. Hence find the exact volume. ° [6] 
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Exercise 692. (9740 N2015/1/11.) (Answer on p. 2098.) 


A curve C’ has parametric equations 


1 
x = sin? 6, y = 3sin? 6 cos 8, for 0<@< =. 


ie) 


(i) Show that 2 = 2cot@-tané. [3] 


(ii) Show that C has a turning point when tan@ = Vk, where k is an integer to be 
determined. Find, in non-trigonometric form, the exact coordinates of the turning 
point and explain why it is a maximum. [6] 


(iii) Show that the area of the region bounded by C and the z-axis is given by 
47 
[ 9 sin* 6 cos? 6 dé. 
0 
Use your calculator to find the area, giving your answer correct to 3 decimal places. 
3] 
The line with equation y = ax, where a is a positive constant, meets C’ at the origin and at 


the point P. 


3 
(iv) Show that tan@ = — at P. Find the exact value of a such that the line passes through 
a 


the maximum point of C. [3] 


Exercise 693. (9740 N2015/II/1.) (Answer on p. 2099.) 


As a tree grows, the rate of increase of its height, hm, with respect to time, t years after 
planting, is modelled by the differential equation 


dh 1 1 
— = —/16-=h. 
dt 10 2 
The tree is planted as a seedling of negligible height, so that h = 0 when t = 0. 
(i) State the maximum height of the tree, according to this model. [1] 


(ii) Find an expression for t in terms of h, and hence find the time the tree takes to reach 
half its maximum height. [5] 


Exercise 694. (9740 N2014/T/2.) (Answer on p. 2099.) 


The curve C has equation x?y+xy?+54 = 0. Without using a calculator, find the coordinates 
of the point on C' at which the gradient is -1, showing that there is only one such point.[6] 
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Exercise 695. (9740 N2014/1/7.) (Answer on p. 2100.) 


It is given that f (x) = 2° - 324-7. The diagram shows the curve with equation y = f (x) 
and the line with equation y = —7, for x > 0. The curve crosses the positive x-axis at x =a, 
and the curve and the line meet where x = 0 and x = £. 


aN 


Y 











(i) Find the value of a, giving your answer correct to 3 decimal places, and find the exact 


value of 3. [2] 
(ii) Evaluate i, f (x) dz, giving your answer correct to 3 decimal places. [2] 
(iii) Find, in terms of V/3, the area of the finite region bounded by the curve and the line, 
for a 20, [3] 
(iv) Show that f(a) = f(-7). What can be said about the six roots of the equation 
f (x) =0? 4] 
Exercise 696. (9740 N2014/1/8.) (Answer on p. 2101.) 


It is given that f (x) = , where -3< 2 <3. 


i] 
V9 - 2? 
(i) Write down [ rR Ga got [1] 


(ii) Find the binomial expansion for f (2), up to and including the term in x°. Give the 
coefficients as exact fractions in their simplest form. [4] 


(iii) Hence, or otherwise, find the first four non-zero terms of the Maclaurin series for 


di 
sin’ 3" Give the coefficients as exact fractions in their simplest form. [4] 
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Exercise 697. (9740 N2014/1/10.) (Answer on p. 2102.) 


The mass, x grams, of a certain substance present in a chemical reaction at time t minutes 
satisfies the differential equation 





dx 9 
a k; (1 ae cae ) 
1 7 1 dz 1 
where 0< a < 5 and k is a constant. It is given that x = 5 and pra when ¢ = 0. 
il 
(i) Show that k= —=. [1] 
(ii) By first expressing 1+ - x? in completed square form, find t in terms of z. [5] 


(iii) Hence find 
(a) the exact time taken for the mass of the substance present in the chemical reaction 
to become half of its initial value, [1] 


(b) the time taken for there to be none of the substance present in the chemical 
reaction, giving your answer correct to 3 decimal places. [1] 








(iv) Express the solution of the differential equation in the form x = f (t) and sketch the 
part of the curve with this equation which is relevant in this context. [5] 


Exercise 698. (9740 N2014/1/11.) (Answer on p. 2103.) 
[It is given that the volume of a sphere of radius r 


is 3m and the volume of a circular cone with base 
i 
radius r and height h is gh] 


A toy manufacturer makes a toy which consists of a 
hemisphere of radius rcm joined to a circular cone 
of base radius rcm and height hem (see diagram). 
The manufacturer determines that the length of the 
slant edge of the cone must be 4cm and that the 
total volume of the toy, Vcm®, should be as large as 
possible. 


(i) Find a formula for V in terms of r. Given that 
r =7r 1 is the value of r which gives the maximum 
value of V, show that r; satisfies the equation 
45r4 — 768r? + 1024 = 0. [6] 





(ii) Find the two solutions to the equation in part (i) for which r > 0, giving your answers 
correct to 3 decimal places. [2] 


(iii) Show that one of the solutions found in part (ii) does not give a stationary value of 
V. Hence write down the value of r; and find the corresponding value of h. [3] 


(iv) Sketch the graph showing the volume of the toy as the radius of the hemisphere varies. 


[3] 
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Exercise 699. (9740 N2014/II/2.) (Answer on p. 2104.) 


Using partial fractions, find 


2 Ox? +24-13 
—__.——- dz. 
0 (2x-5)(a?+4+9) 


Give your answer in the form alnb+ctan-!d, where a, b, c and d are rational numbers to 
be determined. [9] 


Exercise 700. (9740 N2013/I1/5.) (Answer on p. 2104.) 
It is given that 


fe: 
1-— for -a<a<a, 


f(x)= a? 


0 for <a < 20, 


and that f (#+3a) = f (x) for all real values of x, where a is a real constant. 


(i) Sketch the graph of y = f (7) for -4a < x < 6a. [3] 


Ba 
(ii) Use the substitution x = asin to find the exact value of f f (x) dz in terms of a 
and 7t. [5] 


Exercise 701. (9740 N2013/1/10.) (Answer on p. 2105.) 


The variables x, y and z are connected by the following differential equations. 


——=3=2 A 
7 Zz (A) 
dy 
pa B 
an (B) 
eee 3 
(i) Given that z< s solve equation (A) to find z in terms of «. [4] 
(ii) Hence find y in terms of a. [2] 
(iii) Use the result in part (ii) to show that 
dy dy 
for constants a and b to be determined. [3] 


(iv) The result in part (ii) represents a family of curves. Some members of the family are 
straight lines. Write down the equations of two of these lines. On a single diagram, 
sketch one of your lines together with a non-linear member of the family of curves 
that has your line as an asymptote. [4] 
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Exercise 702. (9740 N2013/1/11.) (Answer on p. 2106.) 


A curve C’ has parametric equations 
x = 3t?, y = 20°. 


(i) Find the equation of the tangent to C at the point with parameter t. [3] 


(ii) Points P and Q on C have parameters p and q respectively. The tangent at P meets 
the tangent at Q at the point R. Show that the x-coordinate of R is p? + pq+q’, and 
find the y-coordinate of R in terms of p and q. Given that pq = -1, show that R lies 
on the curve with equation x = y?+1. [5] 


Ay 











O 


A curve L has equation x = y2+1. The diagram shows the parts of C and L for which y > 0. 
The curves C' and L touch at the point M. 


(iii) Show that 4¢° - 3t7+1=0 at M. Hence, or otherwise, find the exact coordinates of 
M. 


3] 
(iv) Find the exact value of the area of the shaded region bounded by C and L for which 
ieale [6] 
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Exercise 703. (9740 N2013/II/2.) (Answer on p. 2107.) 


Fig. 1 shows a piece of card, ABC’, in the form of an equilateral triangle of side a. A 
kite shape is cut from each corner, to give the shape shown in Fig. 2. The remaining card 
shown in Fig. 2 is folded along the dotted lines, to form the open triangular prism of height 
x shown in Fig. 3. 





2 


il 
(i) Show that the volume V of the prism is given by V = zev3 (a — 2x3) [3] 


(ii) Use differentiation to find, in terms of a, the maximum value of V, proving that it is 
a maximum. [6] 





Exercise 704. (9740 N2013/II/3.) (Answer on p. 2108.) 
(i) Given that f(x) =In(1+2sin~), find f (0), f’(0), f” (0) and f’” (0). Hence write 
down the first three non-zero terms in the Maclaurin series for f (x). [7] 


(ii) The first two non-zero terms in the Maclaurin series for f(x) are equal to the first 
two non-zero terms in the series expansion of e” sinnxz. Using appropriate expansions 
from the List of Formulae (MF 15), find the constants a and n. Hence find the third 














non-zero term of the series expansion of e“” sin na for these values of a and n. [5] 

Exercise 705. (9740 N2012/1/2.) (Answer on p. 2108.) 

) Find f 2 a [2] 

(i) Fin [ Teg: [2] 

(ii) Use the substitution u = x? to find i de. [3] 

1+-2z4 ~ 

i 2 a 

(iii) Evaluate [ (; ; i] dx, giving the answer correct to 3 decimal places. [1 
0 - 
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Exercise 706. (9740 N2012/1/4.) (Answer on p. 2109.) 
In the triangle ABC, AB = 1, angle BAC = 9 radians and angle ABC = on radians (see 


diagram). 


B 
1 
Ao C 
(i) Show that AC : [4] 
i ow tha = ———__.. 
cos 6 — sin @ 
(ii) Given that 6 is a sufficiently small angle, show that 
AC #1+a0 +07, 
for constants a and 6 to be determined. [4] 
Exercise 707. (9740 N2012/1/8.) (Answer on p. 2109.) 
The curve C’ has equation 
a-y=(rt+y)?. 
It is given that C' has only one turning point. 
dy 2 
i) Sh hat 1+ — = ————_.. 4 
eh newat * ae 22+ 2y+1 4] 
Py dy\° 
ii) H therwi how that —= =-[{1+—]. 3 
(ii) Hence, or otherwise, show tha qa ( + “| [3] 


(iii) Hence, state, with a reason, whether the turning point is a maximum or a minimum. 


2] 
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Exercise 708. (9740 N2012/1/10.) (Answer on p. 2110.) 
[It is given that a sphere of radius r has surface 


A 
area 47tr? and volume 3m 


A model of a concert hall is made up of three 
parts. The roof is modelled by the curved sur- 
face of a hemisphere of radius rcm. The walls 
are modelled by the curved surface of a cylin- 


der of radius rcm and height hcm. The floor 
is modelled by a circular disc of radius rcm. | 
The three parts are joined together as shown in h 
the diagram. The model is made of material of 
negligible thickness. | 


(i) It is given that the volume of the model is a fixed value kcm?, and the external surface 
area is a minimum. Use differentiation to find the values of r and h in terms of k. 
Simplify your answers. [7] 


(ii) It is given instead that the volume of the model is 200cm? and its external surface 
area is 180cm?. Show that there are two possible values of r. Given also that r < h, 
find the value of r and the value of h. [5] 


Exercise 709. (9740 N2012/1/11.) (Answer on p. 2111.) 


A curve C’ has parametric equations 
x=6-sind, y =1-cos98, 
where 0 < 6 < 27. 


d 1 
(i) Show that 7 = cot 5° and find the gradient of C’ at the point where 6 = 7. What can 
£ 


be said about the tangents to C as 6 > 0 and 6 > 2? [5] 
(ii) Sketch C, showing clearly the features of the curve at the points where 6 = 0, 7 and 
271. [3] 
(iii) Without using a calculator, find the exact area of the region bounded by C' and the 
x-axis. [5] 








1 
(iv) A point P on C has parameter p, where 0 < p< 5M Show that the normal to C at P 


crosses the x-axis at the point with coordinates (p,0). [3] 
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Exercise 710. (9740 N2012/II/1.) (Answer on p. 2112.) 


ae 
(a) Find the general solution of the differential equation 3 = 16-927, giving your answer 
cs 


in the form y = f (2). [3] 

d 
(b) Given that u and t are related by — = 16-9u?, and that u = 1 when t = 0, find t in 
terms of u, simplifying your answer. [5] 
Exercise 711. (9740 N2011/T/3.) (Answer on p. 2112.) 


The parametric equations of a curve are x = t?, y = 


t 
2 

(i) Find the equation of the tangent to the curve at the point (0? = simplifying your 
Pp 


answer. [2] 


(ii) Hence find the coordinates of the points Q and R where this tangent meets the z- and 


y-axes respectively. [2] 
(iii) Find a cartesian equation of theteeus-of the mid-point of QR as p varies.°° [3] 
Exercise 712. (9740 N2011/1/4.) (Answer on p. 2112.) 


(i) Use the first three non-zero terms of the Maclaurin series for cos x to find the Maclaurin 
series for g(x), where g(x) = cos°z, up to and including the term in «*. [3] 


a 
(ii) (a) Use your answer to part (i) to give an approximation for ri g(x) dz in terms of 
0 


1 
a, and evaluate this approximation in the case where a = a [3] 


Ale 


™ 
(b) Use your calculator to find an accurate value for [ g(x) dx. Why is the ap- 
0 


proximation in part (ii) (a) not very good? 


Exercise 713. (9740 N2011/I/5.) (Answer on p. 2113.) 
It is given that f (#) =2-2. 
(i) On separate diagrams, sketch the graphs of y = f (\z|) and y = |f(x)|, giving the 


coordinates of any points where the graphs meet the z- and y-axes. You should label 
the graphs clearly. [3] 


(ii) State the set of values of x for which f (|x|) =|f ()|. [1] 


1 a 
(iii) Find the exact value of the constant a for which [, 7 (e)} dae f Fide. [3] 





°°2The word locus has been removed from your H2 Maths syllabus. But here this word doesn’t really 
mean much. We’ll just delete these three words without changing the question’s meaning. 
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Exercise 714. (9740 N2011/1/8.) (Answer on p. 2114.) 


(i) Find f (100-0?) av. 


(ii) A stone is dropped from a stationary balloon. It leaves the balloon with zero speed, 
and t seconds later its speed v metres per second satisfies the differential equation 


du 
— = 10-0.1v’. 
di . 
(a) Find t in terms of v. Hence find the exact time the stone takes to reach a speed 
of 5 metres per second. [5] 


(b) Find the speed of the stone after 1 second. [3] 
(c) What happens to the speed of the stone for large values of t? [2] 


Exercise 715. (9740 N2011/II/2.) (Answer on p. 2114.) 


The diagram shows a rectangular piece 
of cardboard ABC'D of sides n metres 
and 2n metres, where n is a positive con- 
stant. A square of side x metres is re- 
moved from each corner of ABC'D. The 
remaining shape is now folded along PQ, 
QR, RS and SP to form an open rect- 
angular box of height « metres. 





(i) Show that the volume V cubic metres of the box is given by V = 2n?x -6n2? + 42° [3] 


(ii) Without using a calculator, find in surd form the value of x that gives a stationary 


value of V, and explain why there is only one answer. [6] 
Exercise 716. (9740 N2011/II/4.) (Answer on p. 2115.) 
(a) (i) Obtain a formula for 7 xe ** dx in terms of n, where n > 0. [5] 

0 
(ii) Hence evaluate i oe dm, [1] 
0 


2n 


[You may assume that ne~?” and n2e?" > 0 as n > 00] 


(a) The region bounded by the curve y = = the x-axis and the lines x = 0 and xz = 1 
is rotated through 27 radians about the x-axis. Use the substitution x = tan @ to show 
that the volume of the solid obtained is given by 167t f[ - sin? 6d6, and evaluate this 
integral exactly. [6] 
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Exercise 717. (9740 N2010/I/2.) (Answer on p. 2115.) 


(i) Find the first three terms of the Maclaurin series for e”(1+sin2z). [You may use 
standard results given in the List of Formulae (MF 15). [3] 


(ii) It is given that the first two terms of this series are equal to the first two terms in the 


n 
series expansion, in ascending powers of x, of (1 + 5") . Find n and show that the 


third terms in each of these series are equal. [3] 
Exercise 718. (9740 N2010/I/4.) (Answer on p. 2115.) 
d 
(i) Given that 2? — y?+ 2ry +4 =0, find 7 in terms of x and y. [4] 
at 

(ii) For the curve with equation «x? — y? + 2ry +4 = 0, find the coordinates of each point at 
which the tangent is parallel to the x-axis. [4] 
Exercise 719. (9740 N2010/I/6.) (Answer on p. 2116.) 


The diagram shows the curve with equation y = x? -—32+1 and the line with equation y = 1. 
The curve crosses the z-axis at x = a, x = 6 and x =7y and has turning points x = -1 and 
eI. 











(i) Find the values of 3 and ¥, giving your answers correct to 3 decimal places. [2] 
(ii) Find the area of the region bounded by the curve and the z-axis between x = 6 and 
w=. 2] 

(iii) Use a non-calculator method to find the shaded area. [4] 


(iv) Find the set of values of k for which the equation «° -3x+1 =k has three real distinct 
roots. [2] 
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Exercise 720. (9740 N2010/I/7.) (Answer on p. 2116.) 


(i) A bottle containing liquid is taken from a refrigerator and placed in a room where the 
temperature is a constant 20°C. As the liquid warms up, the rate of increase of its 
temperature 60°C after time t minutes is proportional to the temperature difference 
(20 - 0) °C. Initially the temperature of the liquid is 10°C and the rate of increase of 
the temperature is 1°C per minute. By setting up and solving a differential equation, 


show that 6 = 20 - 10e~10", (7 
(ii) Find the time it takes the liquid to reach a temperature of 15°C, and state what 
happens to @ for large values of t. Sketch a graph of 0 against t. [4] 
Exercise 721. (9740 N2010/1/9.) (Answer on p. 2117.) 


A company requires a box made of cardboard of negligible thickness to hold 300cm? of 
powder when full. The length of the box is 3xcm, the width is «cm and the height is ycm. 
The lid has depth kycm, where 0 < k <1 (see diagram). 


oe 397 


Box Lid 


(i) Use differentiation to find, in terms of k, the value of x which gives a minimum total 
external surface area of the box and the lid. [6] 





(ii) Find also the ratio of the height to the width, cs in this case, simplifying your answer. 
ca 
[2| 


(iii) Find the values between which 7 must lie. [2] 
< 





(iv) Find the value of k for which the box has square ends. [2] 
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Exercise 722. (9740 N2010/1/11.) 
A curve C’ has parametric equations 


1 
Yal= =. 


il 
Ltr, 
t t 


(Answer on p. 2118.) 


(i) The point P on the curve has parameter p. Show that the equation of the tangent at 


P is 


(p?+1)x-(p?-1)y=4p. 


4] 


(ii) The tangent at P meets the line y = x at the point A and the line y = —x at the point 
B. Show that the area of triangle OAB is independent of p, where O is the origin.|4] 


(iii) Find a cartesian equation of C’. Sketch C, giving the coordinates of any points where 
C’ crosses the x- and y-axes and the equations of any asymptotes. [4] 


Exercise 723. (9740 N2010/II/3.) 


(Answer on p. 2119.) 


d 
(i) Given that y = xV x + 2, find aa expressing your answer as a single algebraic fraction. 
x 


Hence, show that there is only one value of x for which the curve y = x/x+2 has a 


turning point, and state this value. [5] 
(ii) A curve has equation y? = 2? (x + 2). 

(a) Find exactly the possible values of the gradient at the point where x = 0. [2] 

(b) Sketch the curve y? = x? (a + 2). [2] 


(iii) On a separate diagram sketch the graph of y = f’ (x), where f (x) = 7Vx+2. State 


the equations of any asymptotes. 


Exercise 724. (9740 N2009/I/2.) 
Find the exact value of p such that 


[ : d i : d 
———-— C= ————————— a 
0 4-2 0 /1— pea? 


Exercise 725. (9740 N2009/I1/4.) 


It is given that 
7-x* for0<2<2, 


f (®)= 


2x-1 for2<2<4, 


and that f (x) = f (x +4) for all real values of x. 
(i) Evaluate f (27) + f (45). 
(ii) Sketch the graph of y = f (x) for -7< x < 10. 
3 
(iii) Find [ _f (2) de, 
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(Answer on p. 2120.) 
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Exercise 726. (9740 N2009/I/7.) (Answer on p. 2120.) 


(i) Given that f (x) =e**, find f (0), f’(0) and f” (0). Hence write down the first two 


non-zero terms in the Maclaurin series for f (2). Give the coefficients in terms of e.[5] 


(ii) Given that the first two non-zero terms in the Maclaurin series for f (az) are equal 


to the first two non-zero terms in the series expansion of a, where a and 6 are 
constants, find a and 6 in terms of e. [4] 
Exercise 727. (9740 N2009/1/11.) (Answer on p. 2121.) 
The curve C' has equation y = f (x), where f (x) = re. 
(i) Sketch the curve C. [2] 
(ii) Find the exact coordinates of the turning points on the curve. [4] 


(iii) Use the substitution u = x” to find [ f (x) da, for n > 0. Hence find the area of the 
0 
region between the curve and the positive x-axis. [4] 


2 
(iv) Find the exact value of [ LF | de: [2] 


(v) Find the volume of revolution when the region bounded by the curve, the lines x = 0, 
x = 1 and the z-axis is rotated completely about the x-axis. Give your answer correct 
to 3 significant figures. [2] 


Exercise 728. (9740 N2009/II/1.) (Answer on p. 2122.) 
The curve C’ has parametric equations 
ag = t" + At, y=Pre. 
(i) Sketch the curve for -2<t< 1. [1] 


The tangent to the curve at the point P where t = 2 is denoted by J. 


(ii) Find the cartesian equation of J. [3] 


(iii) The tangent / meets C again at the point Q. Use a non-calculator method to find the 
coordinates of Q. [4] 
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Exercise 729. (9740 N2009/II/4.) (Answer on p. 2123.) 


Two scientists are investigating the change of a certain population of size n thousand at 
time t years. 


d?n 
(i) One scientist suggests that n and t are related by the differential equation ap 10-6t. 
Find the general solution of this differential equation. Sketch three members of the 


family of solution curves, given that n = 100 when t = 0. [5] 


(ii) The other scientist suggests that n and ¢t are related by the differential equation 


d 

— =3-0.02n. Find n in terms of t, given again that n = 100 when ¢t = 0. Explain in 

simple terms what will eventually happen to the population using this model. [7] 
Exercise 730. (9740 N2008/I/1.) (Answer on p. 2123.) 


The diagram shows the curve with equation y = x”. The area of the region bounded by the 
curve, the lines x = 1, x = 2 and the z-axis is equal to the area of the region bounded by 
the curve, the lines y = a, y = 4 and the y-axis , where a < 4. Find the value of a. [4] 

















ay 
> 
Exercise 731. (9740 N2008/I/4.) (Answer on p. 2124.) 
(i) Find the general solution of the differential equation 
dy 30 
dy _ 9]. 
dx «*+1 I 


(ii) Find the particular solution of the differential equation for which y = 2 when x = 0./1] 
(iii) What can you say about the gradient of every solution curve as © > +00? [1] 


(iv) Sketch, on a single diagram, the graph of the solution found in part (ii), together with 
2 other members of the family of solution curves. [3] 
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Exercise 732. (9740 N2008/I/5.) (Answer on p. 2124.) 





wa ol 
(i) Find the exact value of f - ee dm. [3] 
(ii) Find, in terms of n and e, i x" Inxdz, where n+ -1. [4] 
1 
Exercise 733. (9740 N2008/1/6.) (Answer on p. 2125.) 


(a) In the triangle ABC, AB = 1, BC = 3 and angle ABC = 6 radian. Given that 6 is a 
sufficiently small angle, show that 


AC # V4+ 30? x a + b6", 


for constants a and b to be determined. [5] 


(b) Given that f(x) = tan (20 ~ *), find f (0), f’(0) and f” (0). Hence find the first 3 


terms in the Maclaurin series of f (x). [5] 


Exercise 734. (9740 N2008/I/7.) (Answer on p. 2125.) 


A new flower-bed is being designed for a large garden. The 
flower-bed will occupy a rectangle xm by ym together with 
a semicircle of diameter «m, as shown in the diagram. A 
low wall will be built around the flower-bed. The time 
needed to build the wall will be 3 hours per metre for the —[-.-------------------------- | 
straight parts and 9 hours per metre for the semicircular 
part. Given that a total time of 180 hours is taken to build 

















the wall, find, using differentiation, the values of 7 and y y y 

which give a flower-bed of maximum area. [10] 
L 
2 

Exercise 735. (9740 N2008/II/1.) Let f(a) =e* sina. (Answer on p. 2126.) 

(i) Sketch the graph for y = f (x) for -3< 2 <3. [2] 

(ii) Find the series expansion of f (7) in ascending powers of x, up to and including the 

term in 2°. [3] 


Denote the answer to part (ii) by g(z). 
(iii) On the same diagram as in part (i), sketch the graph of y = g(x). Label the two 


graphs clearly. [1] 
(iv) Find, for -3 < x < 3, the set of values of x for which the value of g(x) is within +0.5 
of the value of f (x). [3] 
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Exercise 736. (9740 N2008/II/2.) (Answer on p. 2126.) 


The diagram shows the curve C with equation y? = 2/1-— 2x. The region enclosed by C is 
denoted by R. 


IN 
y 








(i) Write down an integral that gives the area of R, and evaluate this integral numerically. 
3] 

(ii) The part of R above the z-axis is rotated through 27 radians about the z-axis. By 
using the substitution u = 1-2, or otherwise, find the exact value of the volume 





obtained. [3] 
(iii) Find the exact x-coordinate of the maximum point of C. [3] 
Exercise 737. (9233 N2008/I/2.) (Answer on p. 2126.) 
Find the constants a and 6 such that, when x is small, 
cos 2x 
a a+ bx? 4 
V1+2? i! 
Exercise 738. (9233 N2008/I/3.) Show that (Answer on p. 2127.) 
: 1 3 
—22 Se ~2 5 
7 xe dx ra [5] 


1443, Contents www.EconsPhDTutor.com 


Exercise 739. (9233 N2008/I/4.) (Answer on p. 2127.) 
Use the substitution t = In z to find the value of 


a 


f x (Inax) ol 





Exercise 740. (9233 N2008/I/6.) (Answer on p. 2127.) 
(i) Given that 0 <a< b, sketch the graph of y = |x -a| for -b< x <b. [3] 
b 

(ii) Find [ _ [eal de. 2] 
Exercise 741. (9233 N2008/1/8.) (Answer on p. 2127.) 

Find the exact value of a for which 

| 2v3 
ri ——.dzr = Fh dy. [5] 
a 4+x? $ 1-72 

Exercise 742. (9233 N2008/1/10.) (Answer on p. 2128.) 


d 
(i) Prove that the substitution y = xz reduces the differential equation xy = 27 +y" to 
ie 


cz—=l1. [3] 


d 
(ii) Hence find the solution of the differential equation ry = ¢?+y’ for which y = 6 
i 
when x = 2. [5] 


Exercise 743. (9233 N2008/1/13.) (Answer on p. 2128.) 


A curve is defined by the parametric equations 
x = cos*t, y = sin t, for 0<t< an. 
(i) Show that the equation of the normal to the curve at the point P (cos® t, sin® t) is 
a cost — ysint = cos*t — sin* t. [5] 
(ii) Prove the identity cos't— sin’ t = cos 2t. [2] 


(iii) The normal at P meets the z-axis at A and the y-axis at B. Show that the length 
AB can be expressed in the form k cot 2t, where k is a constant to be found. 


oO 
ye 








S 
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Exercise 744. (9233 N2008/I/14 EITHER.) (Answer on p. 2129.) 


It is required to prove the statement: 


i 1-(n+1)a"+na™! 
(1-«)? 


1 +22 + 3x? +---+na" 
(i) Use mathematical induction to prove the statement for all positive integers n. [6] 


(ii) By considering the expression obtained by integrating each term on the left hand side, 
prove the statement without using mathematical induction. [6] 


Exercise 745. (9233 N2008/II/1.) (Answer on p. 2129.) 


Use the formulae for cos(A +B) and cos(A- 8B), with A = 52 and B = z, to show that 
2sind5xsinx can be written as cos px — cosqx, where p and q are positive integers. [2] 


Hence find the exact value of 


zn 
[ sin 5x sin x da. [3] 
0 


Exercise 746. (9233 N2008/II/5.) (Answer on p. 2129.) 
(i) Show that the derivative of the function 


20 
gb? 





In(1 +2) - 


is never negative. [5] 


2 
(ii) Hence show that In(1+) > 





; when x > 0. [3] 


L+ 





Exercise 747. (9740 N2007/1/4.) (Answer on p. 2130.) 
The current J in an electric circuit at time t satisfies the differential equation 
dl 
4— =2-3]. 
dt 
(i) Find J in terms of t, given that J = 2 when t = 0. [6] 
(ii) State what happens to the current in this circuit for large values of t. [1] 
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Exercise 748. (9740 N2007/1/11.) (Answer on p. 2130.) 


A curve has parametric equations 


1 
x = cos’ t, y = sin®t, for 0< t< =7. 


ie) 


(i) Sketch the curve. [2] 


1 
(ii) The tangent to the curve at the point (cos? 6, sin? 0), where 0 < 6 < =m, meets the x- 


and y-axes at Q and R respectively. The origin is denoted by O. Show that the area 
of AOQR is 


1 
7p 8in (3 cos? 6 + 2sin? 0). [6] 


ar 
(iii) Show that the area under the curve for 0 < t < 0.571 is 2 [P costsin‘tdt, and use the 
0 


substitution sint = u to find this area. [5] 


Exercise 749. (9740 N2007/II/3.) (Answer on p. 2131.) 


(i) By successively differentiating (1 +)", find Maclaurin series for (1+2)”, up to and 
including the term in 2°. [4] 


3 
(ii) Obtain the expansion of (4-2)? (1+22”)? up to and including the term in 2°. [5] 


(iii) Find the set of values of x for which the expansion in part (ii) is valid. [2] 


Exercise 750. (9740 N2007/II/4.) (Answer on p. 2131.) 


Bay Bag 
(i) Find the exact value of [ * sin? eda. Hence find the exact value of [ * cos? x dax.[6] 
0 0 


il 
(ii) The region R is bounded by the curve y = x*sinz, the line x = a and the part of the 
1 
x-axis between 0 and 5 Find 


(a) the exact area of R, [5] 


(b) the numerical value of the volume of revolution formed when R is rotated com- 
pletely about the z-axis, giving your answer correct to 3 decimal places. [2] 


Exercise 751. (9233 N2007/I1/2.) (Answer on p. 2131.) 


5 
Find the first negative coefficient in the expansion of (4+ 3)? in a series of ascending 


A 
powers of x, where |a| < 3" Give your answer as a fraction in its lowest terms. [3] 
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Exercise 752. (9233 N2007/I/3.) (Answer on p. 2132.) 
1 1 1 
The region bounded by the curve y = —————, the z-axis and the lines x = = and x = ~—V3 


V1+ 42? 2 2 


is rotated through 4 right angles about the x-axis to form a solid of revolution of volume 
V. Find the exact value of V, giving your answer in the form k7’. [5] 


Exercise 753. (9233 N2007/1/8.) (Answer on p. 2132.) 


(i) Use the substitution t = sinu to show that 


(sin ¢) cos (sin! t) 
i (sin) | 


1-# 


dt 


simplifies to [ ucos u? du. [3] 





1 (sin t) COs [ (sin! t) | 


TF dt. (4] 


(i) Hence evaluate | 
0 


Exercise 754. (9233 N2007/1/10.) (Answer on p. 2132.) 


(i) By sketching the graphs of y = cosx and y = sinz, or otherwise, solve the inequality 


cosx >sinz 


for 0 <%s 2m. [3] 
27 
(ii) Evaluate [ |cos x — sin a] dx. [5] 
0 
Exercise 755. (9233 N2007/1/11.) (Answer on p. 2132.) 


Use partial fractions to evaluate 


2 5a +4 
= 
1 (2-5) (a? +4) 
giving your answer in the form —Ina, where a is a positive integer. [9] 
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Exercise 756. (9233 N2007/1/13.) (Answer on p. 2133.) 


d 
In this question, the result es sec x = secxtanx may be quoted without proof. 
aa 


Given that y = In (sec), show that 


dy d?y dy 
-9 3 
(i) dx? daz? dx’ 
dty - 
(ii) the value of af when x = 0 is 2. [4] 
x 


(iii) Write down the Maclaurin series for In (sec) up to and including the term in 2’. [2] 














(iv) By substituting « a show that In2 ds + e [3] 
iv — Ww x — 

‘ a. 16 1536 es 
Exercise 757. (9233 N2007/1/14.) (Answer on p. 2133.) 
A family of curves is given by x? - y? = Az, where A is an arbitrary constant. 

(i) Show that 
veer a 
dx 2xy 


A second, related family of curves is given by the differential equation 


dy 20 





dx 2+ yp? 


(ii) By substituting y = vx, where v is a function of x, show that, for the second family of 
curves, 


dv 33utv? 





“de L402" 4] 
(iii) Hence show that the second family of curves is given by 
327y + y? =C, 
where C’ is an arbitrary constant. [4] 
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Exercise 758. (9233 N2006/I1/7.) (Answer on p. 2133.) 


A hollow cone of semi-vertical angle 45° is 
held with its axis vertical and vertex down- 
wards (see diagram). At the beginning of an ———————_— 
experiment, it is filled with 390 cm? of liquid. 
The liquid runs out through a small hole at 
the vertex at a constant rate of 2cm?s. 
Find the rate at which the depth of the liq- 


uid is decreasing 3 minutes after the start of 
the experiment. [6] 





Exercise 759. (9233 N2006/1/8.) (Answer on p. 2134.) 


Find the coordinates of the points on the curve 


327 + xy + y” = 33 


at which the tangent is parallel to the z-axis. [7] 

Exercise 760. (9233 N2006/1/9.) (Answer on p. 2134.) 
d 

(i) Use the derivative of cos@ to show that 7 (sec) = sec O tan 0. [2] 


(ii) Use the substitution x = sec 6-1 to find the exact value of 





= 


: 1 
[ da. [6 
V2-1 (2 +1) Va? + 2x 


Exercise 761. (9233 N2006/1/12.) (Answer on p. 2134.) 
(i) Express 


1l+2-22? 


PO)" Gray Gea) 


in partial fractions. [4] 


oo 


(ii) Expand f (x) in ascending powers of x, up to and including the term in 2”. [5 
1 


mm 
— 


(iii) State the set of values of x for which the expansion is valid. 
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Exercise 762. (9233 N2006/1/14.) (Answer on p. 2135.) 


; : c : me 
A curve has parametric equations x = ct, y = where c is a positive constant. 


Cc G é 
Three points P (. “), Q (co. “), R (cr 4) on the curve are shown in the diagram. 
Pp qd r 














y 
Q 
> 
O ae 
R 
P 
' . . 1 
(i) Prove that the gradient of QR is -—. [2] 
qr 
(ii) Given that the line through P perpendicular to QR meets the curve at V (cv, “), find 
v 
v in terms of p, g and r. [2] 
(iii) Find the gradient of the normal at P. [3] 
1 _ 
(iv) The normal at P meets the curve again at S (cs. “). Show that s = -——. [2] 
8 p 
(v) Given that angle QPR is 90°, prove that QR is parallel to the normal at P. [3] 
Exercise 763. (9233 N2006/II/2.) (Answer on p. 2135.) 
d 32 
(i) Given that z = eee show that — = 5 [3] 
(x? + 32)? da (2 +32)? 
a1 
(ii) Find the exact value of the area of the region bounded by the curve y = ————, 
(x? + 32)? 
the x-axis and the lines x = 2 and x = 7. [3] 
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138. Past-Year Questions for Part VI. Prob. and Stats. 


(Sorry hor, I haven’t written the answers for 2016-19 questions yet.) 


Exercise 764. (9758 N2019/II/6.) (Answer on p. 1451.) 


In a certain country, there are 100 professional football clubs, arranged in 4 divisions. There 
are 22 clubs in Division One, 24 in Division Two, 26 in Division Three and 28 in Division 
Four. 


(i) Alice wishes to find out about approaches to training by clubs in Division One, so she 
sends a questionnaire to the 22 clubs in Division One. Explain whether these 22 clubs 
form a sample or a population. [1] 


(ii) Dillip wishes to investigate the facilities for supporters at the football clubs, but does 
not want to obtain the detailed information necessary from all 100 clubs. Explain how 
he should carry out his investigation, and why he should do the investigation in this 
way. [2] 

(iii) Find the number of different possible samples of 20 football clubs, with 5 clubs chosen 
from each division. [3] 


A764 (9758 N2019/II/6). 


Exercise 765. (9758 N2019/II/7.) (Answer on p. 1451.) 


A company produces drinking mugs. It is known that, on average, 8% of the mugs are 
faulty. Each day the quality manager collects 50 of the mugs at random and checks them; 
the number of faulty mugs found is the random variable F’. 


(i) State, in the context of the question, two assumptions needed to model F’ by a binomial 
distribution. [2] 


You are now given that F’ can be modelled by a binomial distribution. 


(ii) Find the probability that, on a randomly chosen day, at least 7 faulty mugs are found. 
[2] 

(iii) The number of faulty mugs produced each day is independent of other days. Find the 
probability that, in a randomly chosen working week of 5 days, at least 7 faulty mugs 
are found on no more than 2 days. [2] 


The company also makes saucers. The number of faulty saucers also follows a binomial 
distribution. The probability that a saucer is faulty is p. Faults on saucers are independent 
of faults on mugs. 


(iv) Write down an expression in terms of p for the probability that, in a random sample 
of 10 saucers, exactly 2 are faulty. [1] 


The mugs and saucers are sold in sets of 2 randomly chosen mugs and 2 randomly chosen 
saucers. The probability that a set contains at most 1 faulty item is 0.97. 


(v) Write down an equation satisfied by p. Hence find the value of p. [4] 
A765 (9758 N2019/II/7). 
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Exercise 766. (9758 N2019/II/8.) (Answer on p. 1452.) 


Gerri collects characters given away in packets of breakfast cereal. There are four different 
characters: Horse, Rider, Dog and Bird. Each character is made in four different colours: 
Orange, Yellow, Green and White. Gerri has collected 56 items; the numbers of each 
character and colour are shown in the table. 


Orange Yellow Green White 

Horse 1 1 3 4 

Rider iL 1 fi 5 

Dog 3 iG 1 6 

Bird 4 5 6 1 

(i) Gerri puts all the items in a bag and chooses one item at random. 

(a) Find the probability that this item is either a Horse or a Rider. [1] 
(b) Find the probability that this item is either a Dog or a Bird but the item is not 
White. [1] 


(ii) Gerri now puts the item back in the bag and chooses two items at random. 


(a) Find the probability that both of the items are Horses, but neither of the items 


is Orange. [1] 
(b) Find the probability that Gerri’s two items include exaclty one Dog and exactly 
one item that is Yellow. [3] 


(iii) Gerri has two favourites among the 16 possible colour/character combinations. The 


probability of choosing these two at random from the 56 items is —. Write down all 
the possibilities for Gerri’s two favourite colour/character combinations. [3] 
A766 (9758 N2019/II/8). 
Exercise 767. (9758 N2019/II/9.) (Answer on p. 1452.) 


A company produces resistors rated at 750 ohms for use in electronic circuits. The produc- 
tion manager wishes to test whether the mean resistance of these resistors is in fact 750 
ohms. He knows that the resistances are normally distributed with variance 100 ohms’. 


(i) Explain whether the manager should carry out a 1-tail test or a 2-tail test. State 
hypotheses for the test, defining any symbols you use. [2] 


The production manager takes a random sample of 8 of these resistors. He finds that the 
resistances, in ohms, are as follows. 


742 771 768 738 769 752 742 766 


(ii) Find the mean of the sample of 8 resistors. Carry out the test, at 5% level of signifi- 
cance, for the production manager. Give your conclusion in context. [5] 


The company also produces resistors rated at 1250 ohms. Nothing is known about the 
distribution of the resistances of these resistors. 


(iii) Describe how, and why, a test of the mean resistance of the 1250 ohms resistors would 
need to differ from that for the 750 ohms resistors. [2] 


A767 (9758 N2019/II/9). 
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Exercise 768. (9758 N2019/II/10.) (Answer on p. 1453.) 


Abi and Bhani find the fuel consumption for a car driven at different constant speeds. The 
table shows the fuel consumption, y kilometres per litre, for different constant speeds, x 
kilometres per hour. 


i 
(i) Abi decides to model the data using the line y = 35 - Bt 


(a) On the grid opposite [omitted from this textbook] 
e draw a scatter diagram of the data, 
¢ draw the line y = 35 - Bf [2] 
(b) For a line of best fit y = f (x), the residual for a point (a,b) plotted on the scatter 
diagram is the vertical distance between (a, f (a)) and (a,b). Mark the residual 
for each point on your diagram. [1] 
(c) Calculate the sum of the squares of the residuals for Abi’s line. [1] 
(d) Explain why, in general, the sum of the squares of the residuals rather than the 
sum of the residuals is used. [1] 


Bhani models the same data using a straight line passing through the points (40,22) and 
(55,17). The sum of the squares of the residuals for Bhani’s line is 1. 


(ii) State, with a reason, which of the two models, Abi’s or Bhani’s, gives a better fit. [1] 


(iii) State the coordinates of the point that the least squares regression line must pass 








through. {1 
(iv) Use your calculator to find the equation of the least squares regression line of y on x. 
State the value of the product moment correlation coefficient. [3] 


(v) Use the equation of the regression line to estimate the fuel consumption when the 
speed is 30 kilometres per hour. Explain whether you would expect this value to be 
reliable. [2| 


(vi) Cerie performs a similar experiment on a different car. She finds that the sum of the 
squares of the residuals for her line is 0. What can you deduce about the data points 
in Cerie’s experiment? [1] 


A768 (9758 N2019/II/10). 
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Exercise 769. (9758 N2019/II/11.) (Answer on p. 1454.) 


In this question you should state clearly all the distributions that you use, 
together with the values of the appropriate parameters. 


Arif is making models of hydrocarbon molecules. Hydrocarbons are chemical compounds 
made from carbon atoms and hydrogen atoms. Arif has a bag containing a large number 
of white balls to represent the carbon atoms, and a bag containing a large number of black 
balls to represent the hydrogen atoms. 


The masses of the white balls have the distribution N (110, 4?) and the masses of the black 
balls have the distribution N (55, - The units for mass are grams. 

(i) Find the probability that the total mass of 4 randomly chosen white balls is more 

than 425 grams. [2] 


(ii) Find the probability that the total mass of a randomly chosen white ball and a ran- 
domly chosen black ball is between 161 and 175 grams. [2] 


(iii) The probability that 2 randomly chosen white balls and 3 randomly chosen black balls 
have total mass less than M grams is 0.271. Find the value of M. [4] 








Arif also has a bag containing a large number of connecting rods to fix the balls together. 
The masses of the connecting rods, in grams, have the distribution N (20, 0.97). In order 
to make models of methane (a hydrocarbon), Arif has to drill 1 hole in each black ball, and 
4 holes in each white ball, for the connecting rods to fit in. This reduces the mass of each 
black ball by 10% and reduces the mass of each white ball by 30%. 


A methane molecule consists of 1 carbon atom and 4 hydrogen atoms. Arif makes a model 
of a methane molecule using 4 black balls, 1 white ball and 4 connecting rods (see diagram). 
The balls and connecting rods are all chosen at random. 


(iv) Find the probability that the mass of Arif’s model is more than 350 grams. [4] 
A769 (9758 N2019/II/11). 
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Exercise 770. (9758 N2018/II/5.) (Answer on p. 1455.) 


The manufacturer of a certain type of fan used for cooling electronic devices claims that 
the mean time to failure (MTTF) is 65000 hours. The quality control manager suspects 
that the MT'TF is actually less than 65 000 hours and decides to carry out a hypothesis test 
on a sample of these fans. (An accelerated testing procedure is used to find the MTTF.) 


(i) Explain why the manager should take a sample of at least 30 fans, and state how 
these fans should be chosen. [2] 


(ii) State suitable hypotheses for the test, defining any symbols that you use. [2] 


The quality control manager takes a suitable sample of 43 fans, and finds that they have 
an MTTF of 64230 hours. 


(iii) Given that the manager concludes that there is no reason to reject the null hypothesis 
at the 5% level of significance, find the range of possible values of the variance used 
in calculating the test statistic. [3] 


A770 (9758 N2018/II/5). 


Exercise 771. (9758 N2018/II/6.) (Answer on p. 1455.) 


In a computer game, a bug moves from left to right through a network 
of connected paths. The bug starts at S and, at each junction, randomly 
takes the left fork with probability p or the right fork with 
probability g, where q = 1-j. The forks taken at each 
junction are independent. The bug finishes its 
journey at one of the 9 endpoints 
labelled A-I (see diagram). 










(i) Show that the probability that the 
bug finishes its journey at D is 56p°q°. [2] 
(ii) Given that the probability that the bug finishes 
its journey at D is greater than the probability that 
the bug finishes its journey at any one of the other 


endpoints, find exactly the possible range of values of p. [4] 


In another version of the game, the probability that, at each junction, the bug takes the 
left fork is 0.9p, the probability that the bug takes the right fork is 0.9q¢ and the probability 
that the bug is swallowed up by a ‘black hole’ is 0.1. 


(iii) Find the probability that, in this version of the game, the bug reaches one of the 
endpoints A-I, without being swallowed up by a black hole. [1] 


A771 (9758 N2018/II/6). 
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Exercise 772. (9758 N2018/II/7.) (Answer on p. 1456.) 
The events A, B and C are such that P(A) = a, P(B) = b and P(C) =c. A and B are 
independent events. A and C’ are mutually exclusive events. 
(i) Find an expression for P (A’n B’) and hence prove that A’ and B’ are independent 
events. [2| 


(ii) Find an expression for P(A’nC"). Draw a Venn diagram to illustrate the case when 
A’ and C" are also mutually exclusive events. (You should not show event B on your 


diagram. ) [2] 
You are now given that A’ and C” are not mutually exclusive, P(A) = =, P(bBnC)= = 
and P(A’n B'nC’) = a 
(iii) Find exactly the maximum and minimum possible values of P(An B). [4] 
A772 (9758 N2018/II/7). 
Exercise 773. (9758 N2018/II/8.) (Answer on p. 1456.) 


A bag contains (n +5) numbered balls. Two of the balls are numbered 3, three of the balls 
are numbered 4 and n of the balls are numbered 5. Two balls are taken, at random and 
without replacement, from the bag. The random variable S is the sum of the numbers on 
the two balls taken. 





(i) Determine the probability distribution of S. [4] 
(ii) For the case where n = 1, find P (S = 10) and explain this result. [1] 
1 
(iii) Show that e(S) = ula and Var (S') = where g(n) is a quadratic 
+95 (n +5) (n+4) 

polynomial to be determined. [6] 
A773 (9758 N2018/II/8). 

Exercise 774. (9758 N2018/II/9.) (Answer on p. 1457.) 


Many electronic devices need a fan to keep them cool. In order to maximise the lifetime of 
such fans, the speed they run at is reduced when conditions allow. Running a fan at a lower 
speed reduces the power required. The following table gives details, for a particular type 
of fan, of the power required (P watts) at different fan speeds (R revolutions per minute). 





Fan speed (R) | 3600 | 4500 5400 | 6300 | 7200 | 8100 | 9000 | 9900 
Power (P) 0.22 | 0.34 | 0.52 | 0.78 | 1.06 | 1.48 | 2.04 | 2.64 






































(i) Draw a scatter diagram of these data. Use your diagram to explain whether the 
relationship between P and R is likely to be well modelled by an equation of the form 
P=aR+b, where a and 6 are constants. [2] 

(ii) By calculating the relevant product moment correlation coefficients, determine whether 
the relationship between P and R is modelled better by P = aR+b or by P = aR? +b. 
Explain how you decide which model is better, and state the equation in this case.|5] 


(iii) Use your equation to estimate the speed of the fan when the power is 0.9 watts. 
Explain whether your estimate is reliable. [2] 
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(iv) Use your equation to estimate the power used when the speed of the fan is 3300 


revolutions per minute. Explain whether your estimate is reliable. [2] 
(v) Re-write your equation from part (ii) so that it can be used when the speed of the 
fan, R, is given in revolutions per second. [1] 


A774 (9758 N2018/II/9). 
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Exercise 775. (9758 N2018/II/10.) (Answer on p. 1458.) 
In this question you should state the parameters of any distributions that you use. 
A manufacturer produces specialist light bulbs. The masses in grams of one type of light 
bulb have the normal distribution N (50, 1.5*). 
(i) Sketch the distribution for masses between 40 grams and 60 grams. [2] 
(ii) Find the probability that the mass of a randomly chosen bulb is less than 50.4 grams. 
[1] 
Each light bulb is packed into a randomly chosen box. The masses of the empty boxes have 
the distribution N (75, 2”). 


(iii) Find the probability that the total mass of 4 randomly chosen empty boxes is more 


than 297 grams. [2] 
(iv) Find the probability that the total mass of a randomly chosen light bulb and a ran- 
domly chosen box is between 124.9 and 125.7 grams. [3] 


In order to protect the bulbs in transit each bulb is surrounded by padding before being 
packed in a box. The mass of the padding is modelled as 30% of the mass of the bulb. 


(v) The probability that the total mass of a box containing a bulb and padding more than 


k grams is 0.9. Find k. [4] 
(vi) Find the probability that the total mass of 4 randomly chosen boxes, each containing 
a bulb and padding, is more than 565 grams. [3] 


A775 (9758 N2018/II/10). 
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Exercise 776. (9758 N2017/II/5.) (Answer on p. 2136.) 


A bag contains 6 red counters and 3 yellow counters. In a game, Lee removes counters 
at random from the bag, one at a time, until he has taken out 2 red counters. The total 
number of counters Lee removes from the bag is denoted by 7’. 


(i) Find P(T =t) for all possible values of t. 
(ii) Find E(7) and Var (T). [2 


oe ee 


Lee plays this game 15 times. 


3. Find the probability that Lee has to take at least 4 counters out of the bag in at least 5 
of his 15 games. [2] 


Exercise 777. (9758 N2017/II/6.) (Answer on p. 2136.) 


A children’s game is played with 20 cards, consisting of 5 sets of 4 cards. Each set consists 
of a father, mother, daughter and son from the same family. The family names are Red, 
Blue, Green, Yellow and Orange. So, for example, the Red family cards are father Red, 
mother Red, daughter Red and son Red. 


The 20 cards are arranged in a row. 
(i) In how many different ways can the 20 cards be arranged so that the 4 cards in each 
family set are next to each other? [2] 


(ii) In how many different ways can the cards be arranged so that all five father cards are 
next to each other, all four Red family cards are next to each other and all four Blue 
family cards are next to each other? [3] 


The cards are now arranged at random in a circle. 


(iii) Find the probability that no two father cards are next to each other. [4] 


Exercise 778. (9758 N2017/II/7.) (Answer on p. 2136.) 


The production manager of a food manufacturing company wishes to take a random sample 
of a certain type of biscuit bar from the thousands produced one day at his factory, for 
quality control purposes. He wishes to check that the mean mass of the bars is 32 grams, 
as stated on the packets. 


(i) State what it means for a sample to be random in this context. [1] 


The masses, x grams, of a random sample of 40 biscuit bars are summarised as follows. 


n = 40, \) (a - 32) = -7.7, \\ (a - 32)? = 11.05. 
(ii) Calculate unbiased estimates of the population mean and variance of the mass of 
biscuit bars. [2] 


(iii) Test, at the 1% level of significance, the claim that the mean mass of biscuit bars is 
32 grams. You should state your hypotheses and define any symbols you use. [5] 


(iv) Explain why there is no need for the production manager to know anything about the 
population distribution of the masses of the biscuit bars. [2] 
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Exercise 779. (9758 N2017/II/8.) (Answer on p. 2136.) 


(a) Draw separate scatter diagrams, each with 8 points, all in the first quadrant, which rep- 
resent the situation where the produce moment correlation coefficient between variables 
x and y is 
(i) =, 
(ii) 0, 
(iii) between 0.5 and 0.9. [3] 
(b) An investigation into the effect of a fertiliser on yields of corn found that the amount 


of fertiliser applied, x, resulted in the average yields of corn, y, given below, where x 
and y are measured in suitable units. 





x | 0 |} 40 | 80 | 120 | 160 | 200 
y | 70 | 104 | 118 | 119 | 126 | 129 
































(i) Draw a scatter diagram for these values. State which of the following equations, 
where a and 6 are positive constants, provides the most accurate model of the 
relationship between x and y. 


(A) y=ax?+b. (B) y= +b. 


(C) y=aln2r+b. (D) y=aVr+b. [2] 


(ii) Using the model you chose in part (i), write down the equation for the relationship 
between x and y, giving the numerical values of the coefficients. State the product 
moment correlation coefficient for this model. [3] 

(iii) Give two reasons why it would be reasonable to use your model to estimate the 
value of y when x = 189. [2] 
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Exercise 780. (9758 N2017/II/9.) (Answer on p. 2136.) 


On average 8% of a certain brand of kitchen lights are faulty. The lights are sold in boxes 
of 12. 


(i) State, in context, two assumptions needed for the number of faulty lights in a box to 
be well modelled by a binomial distribution. [2] 


Assume now that the number of faulty lights in a box has a binomial distribution. 


(ii) Find the probability that a box of 12 of these kitchen lights contains at least 1 faulty 
light. [1] 


The boxes are packed into cartons. Each carton contains 20 boxes. 


(iii) Find the probability that each box in one randomly selected carton contains at least 


one faulty light. [1] 
(iv) Find the probability that there are at least 20 faulty lights in a randomly selected 
carton. [2] 
(v) Explain why the answer to part (iv) is greater than the answer to part (iii). [1] 


The manufacturer introduces a quick test to check if lights are faulty. Lights identified 
as faulty are discarded. If a light is faulty there is a 95% chance that the quick test will 
correctly identify the light as faulty. If the light is not faulty, there is a 6% chance that the 
quick test will incorrectly identify the light as faulty. 

(vi) Find the probability that a light identified as faulty by the quick test is not faulty.[3] 


(vii) Find the probability that the quick test correctly identifies lights as faulty or not 


faulty. [1] 
(viii) Discuss briefly whether the quick test is worthwhile. [1] 
Exercise 781. (9758 N2017/II/10.) (Answer on p. 2136.) A small component 


for a machine is made from two metal spheres joined by a short metal bar. The masses in 
grams of the spheres have the distribution N (20, 0.57). 


(i) Find the probability that the mass of a randomly selected sphere is more than 20.2 
grams. 1] 


In order to protect them from rusting, the spheres are given a coating which increases the 
mass of each sphere by 10%. 


(ii) Find the probability that the mass of a coated sphere is between 21.5 and 22.45 grams. 
State the distribution you use and its parameters. [3] 


(iii) The masses of the metal bars are normally distributed such that 60% of them have a 
mass greater than 12.2 grams and 25% of them have a mass less than 12 grams. Find 
the mean and standard deviation of the masses of metal bars. [4] 


(iv) The probability that the total mass of a component, consisting of two randomly chosen 
coated spheres and one randomly chosen bar, is more than k grams is 0.75. Find k, 
stating the parameters of any distribution you use. [4] 
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Exercise 782. (9740 N2016/II/5.) (Answer on p. 2136.) 


In a game of chance, a player has to spin a fair spinner. The spinner has 7 sections and an 
arrow which has an equal chance of coming to rest over any of the 7 sections. The spinner 
has 1 section labelled R, 2 sections labelled B and 4 sections labelled Y (see diagram). 


The player then has to throw one of three fair six-sided dice, coloured red, blue or yellow. 
If the spinner comes to rest over R the red die is thrown, if the spinner comes to rest over 
B the blue die is thrown and if the spinner comes to rest over Y the yellow die is thrown. 
The yellow die has one face with + on it, the blue die has two faces with + on it and the 
red die has three faces with + on it. The player wins the game if the die thrown comes to 
rest with a face showing * uppermost. 


(i) Find the probability that a player wins a game. [2] 


(ii) Given that a player wins a game, find the probability that the spinner came to rest 
over B. [1] 


(iii) Find the probability that a player wins 3 consecutive games, each time throwing a die 
of a different colour. [2] 
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Exercise 783. (9740 N2016/II/6.) (Answer on p. 2136.) The number of employees of a 
company, classified by department and gender, is shown below. 





Production | Development | Administration | Finance 
Male 2345 1013 207 344 
Female 867 679 591 523 





























(i) The directors wish to survey a sample of 100 of the employees. This sample is to be 
a stratified sample, based on department and gender. 
(a) How many males should be in the sample? [1] 
(b) How many females from the Development department should be in the sample? 


[1] 


The Managing Director knows that, some years ago, the mean age of employees was 37 
years. He believes that the mean age of employees now is less than 37 years. 


(ii) State why the stratified sample from part (i) should not be used for a hypothesis test 
of the Managing Director’s belief. [1] 


The Company Secretary obtains a suitable sample of 80 employees in order to carry out a 
hypothesis test of the Managing Director’s belief that the mean age of the employees now 
is less than 37 years. You are given that the population variance of the ages is 140 years’. 


(iii) Write down appropriate hypotheses to test the Managing Director’s belief. You are 
given that the result of the test, using a 5% significance level, is that the Managing 
Director’s belief should be accepted. Determine the set of possible values of the mean 
age of the sample of employees. [4] 


(iv) You are given instead that the mean age of the sample of employees is 35.2 years, and 
that the result of a test at the a% significance level is that the Managing Director’s 
belief should not be accepted. Find the set of possible values of a. [3] 


Exercise 784. (9740 N2016/II/7.) (Answer on p. 2136.) 


The management board of a company consists of 6 men and 4 women. A chairperson, a 
secretary and a treasurer are chosen from the 10 members of the board. Find the number 
of ways the chairperson, the secretary and the treasurer can be chosen so that 


(i) they are all women, [1] 
(ii) at least one is a woman and at least one is a man. [3] 
The 10 members of the board sit at random around a table. Find the probability that 


(iii) the chairperson, the secretary and the treasurer sit in three adjacent places, [3] 


(iv) the chairperson, the secretary and the treasurer are all separated from each other by 
at least one other person. [3] 
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Exercise 785. (9740 N2016/II/8.) (Answer on p. 2136.) 


A website about electric motors gives information about the percentage efficiency of motors 
depending on their power, measured in horsepower. Xian has copied the following table for 
a particular type of electric motor, but he has copied one of the efficiency values wrongly. 





Power, x i 1.5 2 3 5 7.0 | 10 | 20 |) 30 | 40 | 50 
Efficiency, y | 72.5 | 82.5 | 84.0 | 87.4 | 87.5 | 88.5 | 89.5 | 90.2 | 91.0 | 91.7 | 92.4 















































(i) Plot a scatter diagram on graph paper for these values, labelling the axes, using a 
scale of 2cm to represent 10% efficiency on the y-axis and an appropriate scale for the 
x-axis. On your diagram, circle the point that Xian has copied wrongly. [2] 


For parts (ii), (iii) and (iv) of this question you should exclude the point for which Xian 
has copied the efficiency value wrongly. 


(ii) Explain from your scatter diagram why the relationship between x and y should not 
be modelled by an equation of the form y = ax + b. [1] 
(iii) Suppose that the relationship between x and y is modelled by an equation of the form 
y = — +d, where c and d are constants. State with a reason whether each of c and d 
Zz 

is positive or negative. [2] 
(iv) Find the product moment correlation coefficient and the constants c and d for the 
model in part (iii). [3] 

C 
(v) Use the model y = — +d, with the values of c and d found in part (iv), to estimate the 

z 


efficiency value (y) that Xian has copied wrongly. Give two reasons why you would 


expect this estimate to be reliable. [3] 
Exercise 786. (9740 N2016/II/9.) (Answer on p. 2136.) 
(a) The random variable X has distribution N (15,a”) and P (10 < X < 20) =0.5. Find the 

value of a. [2] 
(b) The random variable Y has distribution B(4,p) and P(Y =1)+P(Y =2) =0.5. Show 
that 4p* - 12p? + 8p = 1 and hence find the possible values of p. [4] 


(c) On a television quiz show contestants have to select the right answer from one of 
three alternatives. George decides to do this entirely by guesswork. Use a suitable 
approximation, which should be stated, to find the probability that George guesses at 
least 30 questions right out of 100. [4] 
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Exercise 787. (9740 N2016/II/10.) (Answer on p. 2136.) 
Mia owns a field. Various types of weed are found in Mia’s field. 
(i) State, in this context, two conditions that must be met for the numbers of a particular 
type of weed in Mia’s field to be well modelled by a Poisson distribution. [2] 
For the remainder of this question assume that these conditions are met. 


There is an average of 1.5 dandelion plants (a type of weed) per m? in Mia’s field. 

(ii) Find the probability that in 1m? of Mia’s field there are at least 2 dandelion plants. 
2] 

(iii) Find the probability that in 4m? of Mia’s field there are at most 3 dandelion plants. 
2] 

(iv) Use a suitable approximation, which should be stated, to find the probability that the 


number of dandelion plants in an 80m? area of Mia’s field is between 110 and 140 
inclusive. [4] 


The distribution of daisies (another type of weed) per m? in Mia’s field can be modelled by 
Po(A). The probability that the number of daisies in a 1m? area of the field is less than 
or equal to 2 is the same as the probability that the number of daisies in a 2m? area of the 
field is more than 2. 


(v) Write down an equation in A and solve it to find X. [4] 


Exercise 788. (9740 N2015/II/5.) (Answer on p. 2136.) 
The manager of a busy supermarket wishes to conduct a survey of the opinions of customers 
of different ages about different types of cola drink. 

(i) Give a reason why the manager would not be able to use stratified sampling. [1] 
(ii) Explain briefly how the manager could carry out a survey using quota sampling. [2] 


(iii) Give one reason why quota sampling would not necessarily provide a sample which is 
representative of the customers of the supermarket. [1] 


Exercise 789. (9740 N2015/II/6.) (Answer on p. 2136.) 


‘Droppers’ are small sweets that are made in a variety of colours. Droppers are sold in 
packets and the colours of the sweets in a packet are independent of each other. On 
average, 25% of Droppers are red. 


(i) A small packet of Droppers contains 10 sweets. Find the probability that there are at 
least 4 red sweets in a small packet. [2] 


A large packet of Droppers contains 100 sweets. 


(ii) Use a suitable approximation, which should be stated, to find the probability that a 
large packet contains at least 30 red sweets. [3] 
(iii) Yip buys 15 large packets of Droppers. Find the probability that no more than 3 of 
these packets contain at least 30 red sweets. [2] 
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Exercise 790. (9740 N2015/II/7.) (Answer on p. 2137.) 
The average number of errors per page for a certain daily newspaper is being investigated. 


(i) State, in context, two assumptions that need to be made for the number of errors per 
page to be well modelled by a Poisson distribution. [2] 


Assume that the number of errors per page has the distribution Po (1.3). 


(ii) Find the probability that, on one day, there are more than 10 errors altogether on the 
first 6 pages. [3] 
(iii) The probability that there are fewer than 2 errors altogether on the first n pages of 
the newspaper is less than 0.05. Write down an inequality in terms of n to represent 
this information, and hence find the least possible value of n. [2] 


Exercise 791. (9740 N2015/II/8.) (Answer on p. 2137.) 


A market stall sells pineapples which have masses that are normally distributed. The 
stall owner claims the mean mass of the pineapples is at least 0.9kg. Nur buys a random 
selection of 8 pineapples from the stall. The 8 pineapples have masses, in kg, as follows. 


0.80 1.000 0.82 0.85 0.93 0.96 0.81 0.89 


(i) Find unbiased estimates of the population mean and variance of the mass of pineap- 


ples. 
(ii) Test the stall owner’s claim at the 10% level of significance. [7] 
Exercise 792. (9740 N2015/II/9.) (Answer on p. 2137.) 


For events A, B and C it is given that P(A) = 0.45, P(B) = 0.4, P(C) = 0.3 and 
P(AnBnC) = 0.1. It is also given that events A and B are independent, and that 
events A and C' are independent. 


(i) Find P(B|A). [1] 
(ii) Given also that events B and C are independent, find P(A’n B'nC"). [3] 
(iii) Given instead that events B and C are not independent, find the greatest and least 

possible values of P(A‘n B'nC’). [4] 
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Exercise 793. (9740 N2015/II/10.) 


In an experiment the following information was gathered about air pressure P, measured 
in inches of mercury, at different heights above sea-level h, measured in feet. 


(Answer on p. 2138.) 















































h | 2000 | 5000 | 10000 | 15000 | 20000 | 25000 | 30000 | 35000 | 40000 | 45000 
P| 27.8 | 24.9 | 206 | 169 | 13.8 | 11.1 | 889 | 7.04 | 5.52 | 4.28 
(i) Draw a scatter diagram for these values, labelling the axes. [1] 


(ii) Find, correct to 4 decimal places, the product moment correlation coefficient between 


(a) h and P, 

(b) Inh and P, 

(c) Vh and P. [3] 
(iii) Using the most appropriate case from part (ii), find the equation which best models 

air pressure at different heights. [3] 


(iv) Given that 1 metre = 3.28 feet, re-write your equation from part (iii) so that it can 
be used to estimate the air pressure when the height is given in metres. [2] 








Exercise 794. (9740 N2015/II/11.) (Answer on p. 2139.) 
This question is about arrangements of all eight letters in the word CABBAGES. 


(i) Find the number of different arrangements of the eight letters that can be made. [2] 


(ii) Write down the number of these arrangements in which the letters are not in alpha- 
betical order. [1] 


(iii) Find the number of different arrangements that can be made with both the A’s to- 
gether and both the B’s together. [2] 


(iv) Find the number of different arrangements that can be made with no two adjacent 
letters the same. [4] 








Exercise 795. (9740 N2015/II/12.) (Answer on p. 2139.) 


In this question you should state clearly the values of the parameters of any normal distri- 
bution you use. 

The masses in grams of apples have the distribution N (300, 20) and the masses in grams 
of pears have the distribution N (200, 15), A certain recipe requires 5 apples and 8 pears. 


(i) Find the probability that the total mass of 5 randomly chosen apples is more than 
1600 grams. [2] 


(ii) Find the probability that the total mass of 5 randomly chosen apples is more than the 
total mass of 8 randomly chosen pears. [3] 


The recipe requires the apples and pears to be prepared by peeling them and removing the 
cores. This process reduces the mass of each apple by 15% and the mass of each pear by 
10%. 


(iii) Find the probability that the total mass, after preparation, of 5 randomly chosen 
apples and 8 randomly chosen pears is less than 2750 grams. [4] 
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Exercise 796. (9740 N2014/II/5.) (Answer on p. 2140.) 


An Internet retailer has compiled a list of 10000 regular customers and wishes to carry out 
a survey of customer opinions involving 5% of its customers. 


(i) Describe how the marketing manager could choose customers for this survey using 
systematic sampling. [2] 


(ii) Give one advantage and one disadvantage of systematic sampling in this context. [2] 


Exercise 797. (9740 N2014/II/6.) (Answer on p. 2140.) 


A team in a particular sport consists of 1 goalkeeper, 4 defenders, 2 midfielders and 4 
attackers. A certain club has 3 goalkeepers, 8 defenders, 5 midfielders and 6 attackers. 


(i) How many different teams can be formed by the club? [2] 
One of the midfielders in the club is the brother of one of the attackers in the club. 


(ii) How many different teams can be formed which include exactly one of the two broth- 
ers? 3] 


The two brothers leave the club. The club manager decides that one of the remaining 
midfielders can play either as a midfielder or as a defender. 


(iii) How many different teams can now be formed by the club? [3] 


Exercise 798. (9740 N2014/II/7.) (Answer on p. 2140.) 
Yan is carrying out an experiment with a fair 6-sided die and a biased 6-sided die, each 


numbered from 1 to 6. 


(i) Yan rolls the fair die 10 times. Find the probability that it shows a 6 exactly thrice. 
1] 

(ii) Yan now rolls the fair die 60 times. Use a suitable approximate distribution, which 
should be stated, to find the probability that the die shows a 6 between 5 and 8 times, 
inclusive. [3] 


ss ; _ i 
The probability that the biased die shows a 6 is 15: 
(iii) Yan rolls the biased die 60 times. Use a suitable approximate distribution, which 
should be stated, to find the probability that the biased die shows a 6 between 5 and 
8 times, inclusive. [3] 
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Exercise 799. (9740 N2014/II/8.) (Answer on p. 2141.) 


(a) Sketch a scatter diagram that might be expected when x and y related approximately 
by y = px” +t in each of the cases (i) and (ii) below. In each case your diagram should 
include 6 points, approximately equally spaced with respect to x, and with all x-values 
positive. 


(i) p and ¢ are both positive. 
(ii) p is negative and t is positive. [2] 


(b) The age in months (m) and prices in dollars (P) of a random sample of ten used cars 
of a certain model are given in the table. 





m 


11 


20 


28 


36 


AQ 


AT 


58 


62 


68 


75 





a 








112800 





102 600 





76 500 





72 000 





72 000 





69 000 





65 800 





57 000 





50 600 





47 600 








It is thought that the price after m months can be modelled by one of the formulae 


P=am+6, P=clnm+d, 


where a, 6, c and d are constants. 


(i) Find, correct to 4 decimal places, the value of the product moment correlation 
coefficient between 


(A) m and P; and 


(B) Inm and P. [2] 
(ii) Explain which of P = am+b and P =clnm+d is the better model and find the 
equation of a suitable regression line for this model. [3] 


(iii) Use the equation of your regression line to estimate the price of a car that is 50 
months old. [1] 








Exercise 800. (9740 N2014/II/9.) (Answer on p. 2141.) 


The number of minutes that the 0815 bus arrives late at my local bus stop has a normal 
distribution; the mean number of minutes the bus is late has been 4.3. A new company 
takes over the service, claiming that punctuality will be improved. After the new company 
takes over, a random sample of 10 days is taken and the number of minutes that the bus 
is late is recorded. The sample mean is t minutes and the sample variance is k? minutes’. 
A test is to be carried out at the 10% level of significance to determine whether the mean 
number of minutes late has been reduced. 


(i) State appropriate hypotheses for the test, defining any symbols that you use. [2] 
(ii) Given that k? = 3.2, find the set of values of ¢ for which the result of the test would 
be that the null hypothesis is not rejected. 4 


(iii) Given instead that t = 4.0, find the set of values of k? for which the result of the test 
would be to reject the null hypothesis. 5 








1469, Contents www.EconsPhDTutor.com 


Exercise 801. (9740 N2014/II/10.) (Answer on p. 2142.) 


A game has three sets of ten symbols, and one symbol from each set is randomly chosen to 
be displayed on each turn. The symbols are as follows. 


Setl + + + + x x x © 
Sett2 + + + x O O OO x * 
Set3 + + x x x x OO 


For example, if a + symbol is chosen from set 1, a © symbol is chosen from set 2 and a * 
symbol is chosen from set 3, the display would be +Ox. 


(i) Find the probability that, on one turn, 


(a) « * x is displayed, 1] 
(b) at least one * symbol is displayed, 2] 
(c) two xsymbols and one + symbol are displayed in any order. 3 


(ii) Given that exactly one of the symbols displayed is «, find the probability that the 
other two symbols are + and ©. A| 








Exercise 802. (9740 N2014/II/11.) (Answer on p. 2142.) 
An art dealers sells both original paintings and prints. (Prints are copies of paintings.) It 
is to be assumed that his sales of originals per week can be modelled by the distribution 
Po (2) and his sales of prints per week can be modelled by the independent distribution 
Po (11). 


(i) Find the probability that, in a randomly chosen week, 
(a) the art dealer sells more than 8 prints, [2] 
(b) the art dealer sells a total of fewer than 15 prints and originals combined. [2] 


(ii) The probability that the art dealer sells fewer than 3 originals in a period of n weeks 
is less than 0.01. Express this information as an inequality in n, and hence find the 








smallest possible integer value of n. 5 
(iii) Using a suitable approximation, which should be stated, find the probability that the 
art dealer sells more than 550 prints in a year (52 weeks). 3 
(iv) Give two reasons in context why the assumptions made at the start of this question 
may not be valid. 2| 
Exercise 803. (9740 N2013/II/5.) (Answer on p. 2143.) 


A large multi-national company has 100000 employees based in several different countries. 
To celebrate the 90th anniversary of the founding of the company, the Chief Executive 
wishes to invite a representative sample of 90 employees to a party, to be held at the 
company’s Headquarters in Singapore. 


(i) Explain how random sampling could be carried out to choose the 90 employees. Ex- 
plain briefly why this may not provide the representative sample that the Chief Ex- 
ecutive wants. [2] 

(ii) Name a more appropriate sampling method, and explain how it can be carried out to 
provide the representative sample that the Chief Executive wants. [2] 
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Exercise 804. (9740 N2013/II/6.) (Answer on p. 2143.) 


The continuous random variable Y has the distribution N ( pi o*). It is known that P(Y < 2a) = 
0.95 and P(Y <a) =0.25. Express y in the form ka, where k is a constant to be determined. 
4] 


Exercise 805. (9740 N2013/II/7.) (Answer on p. 2143.) 


On average one in 20 packets of a breakfast cereal contains a free gift. Jack buys n packets 
from a supermarket. The number of these packets containing a free gift is the random 
variable F’. 


(i) State, in context, two assumptions needed for F’ to be well modelled by a binomial 
distribution. [2] 


Assume now that F has a binomial distribution. 


(ii) Given that n = 20, find P(F = 1). [1] 
(iii) Given instead that n = 60, use a suitable approximation to find the probability that 
F is at least 5. State the parameter(s) of the distribution that you use. [3] 


Exercise 806. (9740 N2013/II/8.) (Answer on p. 2143.) For events A and B it is given 
that P(A) = 0.7, P(B|A’) = 0.8 and P (A|B’) = 0.88. Find 


(i) P(Bn A’), i 
(ii) P(A'N B’), [2] 
(iii) P(A B). [3] 
Exercise 807. (9740 N2013/II/9.) (Answer on p. 2143.) 


A motoring magazine editor believes that the figures quoted by car manufacturers for 
distances travelled per litre of fuel are too high. He carries out a survey into this by asking 
for information from readers. For a certain model of car, 8 readers reply with the following 
data, measured in km per litre. 


140 125 110 110 125 126 156 13.2 
(i) Calculate unbiased estimates of the population mean and variance. [2] 


The manufacturer claims that this model of car will travel 13.8 km per litre on average. It 
is given that the distances travelled per litre for cars of this model are normally distributed. 


(ii) Stating a necessary assumption, carry out a t-test of the magazine editor’s belief at 
the 5% significance level. [5] 


1471, Contents www.EconsPhDTutor.com 


Exercise 808. (9740 N2013/II/10.) (Answer on p. 2144.) 


(i) Sketch a scatter diagram that might be expected when z and y are related approxi- 
mately as given in each of the cases (A), (B) and (C) below. In each case your diagram 
should include 6 points, approximately equally spaced with respect to x, and with all 
x- and y-values positive. The letters a, b, c, d, e and f represent constants. 


(A) y=a+bz’, where a is positive and b is negative, 
(B) y=c+dlnz, where c is positive and d is negative, 


(C) y=et f where e is positive and f is negative. [3] 
as 


A motoring website gives the following information about the distance travelled, ykm, by 
a certain type of car at different speeds, xkmh!, on a fixed amount of fuel. 





Speed, x 88 | 96 | 104 | 112 | 120 | 128 
Distance, y | 148 | 147 | 144 | 138 | 126 | 107 
































(ii) Draw the scatter diagram for these values, labelling the axes. [1] 


(iii) Explain which of the three cases in part (i) is the most appropriate for modelling 
these values, and calculate the product moment correlation coefficient for this case.|2] 


(iv) It is required to estimate the distance travelled at a speed of 110kmh”'. Use the case 
that you identified in part (iii) to find the equation of a suitable regression line, and 
use your equation to find the required estimate. [3] 


Exercise 809. (9740 N2013/II/11.) (Answer on p. 2145.) 


A machine is used to generate codes consisting of three letters followed by two digits. Each 

of the three letters generated is equally likely to be any of the twenty-six letters of the 

alphabet A—Z. Each of the two digits generated is equally likely to be any of the nine digits 

1-9. The digit 0 is not used. Find the probability that a randomly chosen code has 
(i) three different letters and two different digits, 

(ii) the second digit higher than the first digit, 


(iii) exactly two letters the same or two digits the same, but not both, 








= NN 


(iv) exactly one vowel (A, E, I, O or U) and exactly one even digit. 
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Exercise 810. (9740 N2013/II/12.) (Answer on p. 2145.) 


A company has two departments and each department records the number of employees 
absent through illness each day. Over a long period of time it is found that the aver- 
age numbers absent on a day are 1.2 for the Administration Department and 2.7 for the 
Manufacturing Department. 


(i) State, in this context, two conditions that must be met for the numbers of absences to 
be well modelled by Poisson distributions. Explain why each of your two conditions 
may not be met. [3] 


For the remainder of this question assume that these conditions are met. You should assume 
also that absences in the two departments are independent of each other. 


(ii) Find the smallest number of days for which the probability that no employee is absent 
through illness from the Administration Department is less than 0.01. [2] 


Each employee absent on a day represents one ‘day of absence’. So, one employee absent 
for 3 days contributes 3 days of absence, and 5 employees absent on 1 day contribute 5 days 
of absence. 


(iii) Find the probability that, in a 5-day period, the total number of days of absence in 
the two departments is more than 20. [3] 


(iv) Use a suitable approximation, which should be stated together with its parameter(s), 
to find the probability that, in a 60-day period, the total number of days of absence 
in the two departments is between 200 and 250 inclusive. [4] 


Exercise 811. (9740 N2012/II/5.) (Answer on p. 2146.) 


The probability that a hospital patient has a particular disease is 0.001. A test for the 
disease has probability p of giving a positive result when the patient has the disease, and 
equal probability p of giving a negative result when the patient does not have the disease. 
A patient is given the test. 


(i) Given that p = 0.995, find the probability that 


(a) the result of the test is positive, [2] 
(b) the patient has the disease given that the result of the test is positive. [2] 


(ii) It is given instead that there is a probability of 0.75 that the patient has the disease 
given that the result of the test is positive. Find the value of p, giving your answer 
correct to 6 decimal places. [3] 
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Exercise 812. (9740 N2012/II/6.) (Answer on p. 2146.) 


On a remote island a zoologist measures the tail lengths of a random sample of 20 squirrels. 
In a species of squirrel known to her, the tail lengths have mean 14.0cm. She carries out 
a test, at the 5% significance level of whether squirrels on the island have the same mean 
tail length as the species known to her. She assumes that the tail lengths of squirrels on 
the island are normally distributed with standard deviation 3.8 cm. 


(i) State appropriate hypotheses for the test. [1] 
The sample mean tail length is denoted by %cm. 


(ii) Use an algebraic method to calculate the set of values of x for which the null hypothesis 
would not be rejected. (Answers obtained by trial and improvement from a calculator 


will obtain no marks.) [3] 
(iii) State the conclusion of the test in the case where Z = 15.8. [2] 
Exercise 813. (9740 N2012/II/7.) (Answer on p. 2146.) 


A group of fifteen people consists of one pair of sisters, one set of three brothers and ten 
other people. The fifteen people are arranged randomly in a line. 


(i) Find the probability that the sisters are next to each other. [2] 
(ii) Find the probability that the brothers are not all next to each other. [2] 
(iii) Find the probability that the sisters are next to each other and the brothers are all 
next to each other. [2] 

(iv) Find the probability that either the sisters are next to each other or the brothers are 
all next to each other or both. [2] 








Instead the fifteen people are arranged randomly in a circle. 


(v) Find the probability that the sisters are next to each other. [1] 
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Exercise 814. (9740 N2012/II/8.) (Answer on p. 2147.) 


Amy is revising for a mathematics examination and takes a different practice paper each 
week. Her marks, y% in week 2, are as follows. 





Week x 1|/2|-3|)4) 5] 6 
Percentage mark y | 38 | 63 | 67 | 75 | 71 | 82 
































(i) Draw a scatter diagram showing these marks. [1] 
(ii) Suggest a possible reason why one of the marks does not seem to follow the trend.[1] 
(iii) It is desired to predict Amy’s marks on future papers. Explain why, in this context, 
neither a linear nor a quadratic model is likely to be appropriate. [2] 


It is decided to fit a model of the form In(L - y) = a+ bx, where L is a suitable constant. 
The product moment correlation coefficient between x and In(L - y) is denoted by r. The 
following table gives values of r for some possible values of L. 


L| 91 92 93 
r | ~0.929944 -0.929918 





(iv) Calculate the value of r for L = 91, giving your answer correct to 6 decimal places. [1] 


(v) Use the table and your answer to part (iv) to suggest with a reason which of 91, 92 








or 93 is the most appropriate value for L. [1] 
(vi) Using the value for L, calculate the values of a and b, and use them to predict the 
week in which Amy will obtain her first mark of at least 90%. [4] 
(vii) Give an interpretation, in context, of the value of L. [1] 
Exercise 815. (9740 N2012/II/9.) (Answer on p. 2148.) 


In an opinion poll before an election, a sample of 30 voters is obtained. 


(i) The number of voters in the sample who support the Alliance Party is denoted by 
A. State, in context, what must be assumed for A to be well modelled by a binomial 
distribution. [2] 


Assume now that A has the distribution B (30, p). 
(ii) Given that p = 0.15, find P(A =3 or 4). [2] 
(iii) Given instead that p = 0.55, explain whether it is possible to approximate the distri- 
bution of A with 
(a) a normal distribution, 
(b) a Poisson distribution. [3] 


(iv) For an unknown value of p it is given that P(A = 15) = 0.06864 correct to 5 decimal 
places. Show that p satisfies an equation of the form p(1- p) = k, where k is a constant 
to be determined. Hence find the value of p to a suitable degree of accuracy, given 
that p< 0.5. [5] 
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Exercise 816. (9740 N2012/II/10.) (Answer on p. 2148.) 


Gold coins are found scattered throughout an archaeological site. 


(i) State two conditions needed for the number of gold coins found in a randomly chosen 
region of area 1 square metre to be well modelled by a Poisson distribution. [2] 


Assume that the number of gold coins in 1 square metre has the distribution Po (0.8). 


(ii) Find the probability that in 1 square metre there are at least 3 gold coins. [1] 


(iii) It is given that the probability that 1 gold coin is found in x square metres is 0.2. 
Write down an equation for x, and solve it numerically given that x < 1. [2] 


(iv) Use a suitable approximation to find the probability that in 100 square metres there 
are at least 90 gold coins. State the parameter(s) of the distribution that you use. [3] 








Pottery shards are also found scattered throughout the site. The number of pottery shards 
in 1 square metre is an independent random variable with the distribution Po(3). Use 
suitable approximations, whose parameters should be stated, to find 


(v) the probability that in 50 square metres the total number of gold coins and pottery 


shards is at least 200, [4] 
(vi) the probability that in 50 square metres there are at least 3 times as many pottery 
shards as gold coins. [3] 
Exercise 817. (9740 N2011/II/5.) (Answer on p. 2149.) 
The continuous random variable X has the distribution N (n, a), It is known that 
P(X < 40.0) = 0.05 and P(X < 70.0) = 0.975. Calculate the values of wand o. [4] 
Exercise 818. (9740 N2011/II/6.) (Answer on p. 2149.) 


It is desired to interview residents of a city suburb about the types of shop to be opened 
in a new shopping mall. In particular it is necessary to interview a representative range of 
ages. 
(i) Explain how a quota sample might be carried out in this context. [2] 
(ii) Explain a disadvantage of quota sampling in the context of your answer to part (i). 
[1] 
(iii) State the name of a method of sampling that would not have this disadvantage, and 
explain whether it would be realistic to use this method in this context. [2] 
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Exercise 819. (9740 N2011/II/7.) (Answer on p. 2149.) 


When I try to contact (by telephone) any of my friends in the evening, I know that on 
average the probability that I succeed is 0.7. On one evening I attempt to contact a fixed 
number, n, of different friends. If I do not succeed with a particular friend, I do not 
attempt to contact that friend again that evening. The number of friends whom I succeed 
in contacting is the random variable R. 


(i) State, in the context of this question, two assumptions needed to model R by a 
binomial distribution. [2] 
(ii) Explain why one of the assumptions stated in part (i) may not hold in this context. 
1] 
Assume now that these assumptions do in fact hold. 


(iii) Given that n = 8, find the probability that R is at least 6. [1] 


(iv) Given that n = 40, use an appropriate approximation to find P(R< 25). State the 
parameters of the distribution you use. [4] 


Exercise 820. (9740 N2011/II/8.) (Answer on p. 2150.) 


(i) Sketch a scatter diagram that might be expected for the case when z and y are related 
approximately by y = a+bx”, where a is positive and b is negative. Your diagram should 
include 5 points, approximately equally spaced with respect to x, and with all x- and 
y-values positive. [1] 


The table gives the values of seven observations of bivariate data, x and y. 





x| 2.0 | 2.5 | 3.0 | 3.5 | 4.0] 4.5 | 5.0 
18.8 | 16.9 | 14.5 | 11.7 | 8.6 | 4.9 | 0.8 



































(ii) Calculate the value of the product moment correlation coefficient, and explain why 
its value does not necessarily mean that the best model for the relationship between 
x and y is y=c+ dz. [2] 
(iii) Explain how to use the values obtained by calculating product moment correlation 


coefficients to decide, for this data, whether y = a+ bx” or y = c+ dz is the better 
model. [1] 


(iv) It is desired to use the data in the table to estimate the value of y for which x = 3.2. 
Find the equation of the least-squares regression line of y on x”. Use your equation 
to calculate the desired estimate. [3] 
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Exercise 821. (9740 N2011/II/9.) (Answer on p. 2150.) 


Camera lenses are made by two companies, A and B. 60% of all lenses are made by A and 
the remaining 40% by B. 5% of the lenses made by A are faulty. 7% of the lenses made by 
B are faulty. (Author’s remark: Assume that there are infinitely many lenses.) 


(i) One lens is selected at random. Find the probability that 


(a) it is faulty, [2] 

(b) it was made by A, given that it is faulty. [1] 
(ii) Two lenses are selected at random. Find the probability that 

(a) exactly one of them is faulty, [2] 

(b) both were made by A, given that exactly one is faulty. [3] 
Exercise 822. (9740 N2011/II/10.) (Answer on p. 2150.) 


In a factory, the time in minutes for an employee to install an electronic component is a 
normally distributed continuous random variable 7. The standard deviation of T is 5.0 
and under ordinary conditions the expected value of T' is 38.0. After background music is 
introduced into the factory, a sample of n components is taken and the mean time taken 
for randomly chosen employees to install them is found to be ¢ minutes. A test is carried 
out, at the 5% significance level, to determine whether the mean time taken to install a 
component has been reduced. 


(i) State appropriate hypotheses for the test, defining any symbols you use. [2] 
(ii) Given that n = 50, state the set of values of t for which the result of the test would be 
to reject the null hypothesis. [3] 


(iii) It is given instead that ¢ = 37.1 and the result of the test is that the null hypothesis is 
not rejected. Obtain an inequality involving n, and hence find the set of values that 
n can take. [4] 


Exercise 823. (9740 N2011/II/11.) (Answer on p. 2151.) 


A committee of 10 people is chosen at random from a group consisting of 18 women and 
12 men. The number of women on the committee is denoted by R. 


(i) Find the probability that R = 4. [3] 


(ii) The most probable number of women on the committee is denoted by r. By using the 
fact that P(R=r)>P(R=r+1), show that r satisfies the inequality 


(r+1)!(17-r)!(Q-r)l(r +3)! > rl (18-1)! (10-1)! (r +2)! 
and use this inequality to find the value of r. [5] 
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Exercise 824. (9740 N2011/II/12.) (Answer on p. 2152.) 


The number of people joining an airport check-in queue in a period of 1 minute is a random 
variable with the distribution Po (1.2). 


(i) Find the probability that, in a period of 4 minutes, at least 8 people join the queue. 
[1] 

(ii) The probability that no more than 1 person joins the queue in a period of t seconds 
is 0.7. Find an equation for t. Hence find the value of t, giving your answer correct 
to the nearest whole number. [4] 


(iii) The number of people leaving the same queue in a period of 1 minute is a random 
variable with the distribution Po(1.8). At 0930 on a certain morning there are 35 
people in the queue. Use appropriate approximations to find the probability that 
by 0945 there are at least 24 people in the queue, stating the parameters of any 
distributions that you use. (You may assume that the queue does not become empty 


during this period.) [5] 
(iv) Explain why a Poisson model would probably not be valid if applied to a time period 
of several hours. [1] 
Exercise 825. (9740 N2010/II/5.) (Answer on p. 2152.) 


At an international athletics competition, it is desired to sample 1% of the spectators to 
find their opinions of the catering facilities. 


(i) Give a reason why it would be difficult to use a stratified sample. [1] 
(ii) Explain how a systematic sample could be carried out. [2] 
Exercise 826. (9740 N2010/II/6.) (Answer on p. 2152.) 


The time required by an employee to complete a task is a normally distributed random 
variable. Over a long period it is known that the mean time required is 42.0 minutes. 
Background music is introduced in the workplace, and afterwards the time required, t 
minutes, is measured for a random sample of 11 employees. The results are summarised as 
follows. 


n=11, Sot = 454.3, yale 4g. 


(i) Find unbiased estimates of the population mean and variance. 


(ii) Test, at the 10% significance level, whether there has been a change in the mean time 
required by an employee to complete the task. [7] 
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Exercise 827. (9740 N2010/II/7.) (Answer on p. 2153.) 
For events A and B it is given that P(A) =0.7, P(B) =0.6 and P(A|B’) = 0.8. Find 


(i) P(AnB’), 2] 
(ii) P(AUB), [2] 
(iii) P(B"|A). [2] 
For a third event C, it is given that P(C’) = 0.5 and that A and C are independent. 

(iv) Find P(A’NC). [2] 

(v) Hence state an inequality satisfied by P (A’n Bn C).?? [1] 
Exercise 828. (9740 N2010/II/8.) (Answer on p. 2153.) 


The digits 1, 2, 3, 4 and 5 are arranged randomly to form a five-digit number. No digit is 
repeated. Find the probability that 


(i) the number is greater than 30000, [1] 
(ii) the last two digits are both even, [2] 
(iii) the number is greater than 30000 and odd. [4] 
Exercise 829. (9740 N2010/II/9.) (Answer on p. 2153.) 


In this question you should state clearly the values of the parameters of any normal distri- 
bution you use. 


Over a three-month period Ken makes X minutes of peak-rate telephone calls and Y min- 
utes of cheap-rate calls. X and Y are independent random variables with the distributions 
N (180, 30”) and N (400, 60°) respectively. 


(i) Find the probability that, over a three-month period, the number of minutes of cheap- 
rate calls made by Ken is more than twice the number of minutes of peak-rate calls. 
4] 


Peak-rate calls cost $0.12 per minute and cheap-rate calls cost $0.05 per minute. 


(ii) Find the probability that, over a three-month period, the total cost of Ken’s calls is 


ereater than $45. [3] 
(iii) Find the probability that the total cost of Ken’s peak-rate calls over two independent 
three-month periods is greater than $45. [3] 





°°3This question is terribly vague. A trivial but perfectly correct answer would be P(A’n BNC) > 0, but 
I suspect that any smart aleck who wrote this didn’t get the mark. 
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Exercise 830. (9740 N2010/II/10.) (Answer on p. 2153.) 


A car is placed in a wind tunnel and the drag force F for different wind speeds v, in 
appropriate units, is recorded. The results are shown in the table. 





vj/0O}| 4} 8 | 12] 16 | 20 | 24 | 28 | 32 | 36 
FY) QO} 2.5] 5.1 | 8.8 | 11.2 | 13.6 | 17.6 | 22.0 | 27.8 | 33.9 












































(i) Draw the scatter diagram for these values, labelling the axes clearly. [2] 


It is thought that the drag force F' can be modelled by one of the formulae 
F=atbv or F=c+dv" 


where a, 6, c and d are constants. 


(ii) Find, correct to 4 decimal places, the value of the product moment correlation coeff- 
cient between 
(a) v and F, 
(b) v? and F. [2] 
(iii) Use your answers to parts (i) and (ii) to explain which of F = a+ bv or F =c+ dv? is 
the better model. [1] 


(iv) It is required to estimate the value of v for which F = 26.0. Find the equation of a 
suitable regression line, and use it to find the required estimate. Explain why neither 
the model F = a+ bv nor the model F = c+ dv” should be used.“ [4] 


Exercise 831. (9740 N2010/II/11.) (Answer on p. 2154.) 


In this question you should state clearly all distributions that you use, together with the 
values of the appropriate parameters. 


The number of telephone calls received by a call centre in one minute is a random variable 
with distribution Po (3). 


(i) Find the probability that exactly 8 calls are received in a randomly chosen period of 
4 minutes. 2 

(ii) Find the length of time, to the nearest second, for which the probability that no calls 
are received is 0.2. 3| 

(iii) Use a suitable approximation to find the probability that, on a randomly chosen 
working day of 12 hours, more than 2200 calls are received. 4 








A working day of 12 hours on which more than 2200 calls are received is said to be ‘busy’. 


(iv) Find the probability that, in six randomly chosen working days, exactly two are busy. 
[2| 

(v) Use a suitable approximation to find the probability that, in 30 randomly chosen 
working days of 12 hours, fewer than 10 are busy. [4] 





°°4T have changed the wording of this sentence slightly. 
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Exercise 832. (9740 N2009/II/5.) (Answer on p. 2155.) 


A cinema manager wishes to take a survey of opinions of cinema-goers. Describe how a 
quota sample of size 100 might be obtained, and state one disadvantage of quota sampling. 


3] 


Exercise 833. (9740 N2009/II/6.) (Answer on p. 2155.) 


The table gives the world record time, in seconds above 3 minutes 30 seconds, for running 
1 mile as at lst January in various years. 





Year, x | 1930 | 1940 | 1950 | 1960 | 1970 | 1980 | 1990 | 2000 
Time, é | 40.4 | 36.4 | 31.3 | 24.5 | 21.1 | 19.0 | 16.3 | 13.1 












































(i) Draw a scatter diagram to illustrate the data. [2] 
(ii) Comment on whether a linear model would be appropriate, referring both to the 
scatter diagram and the context of the question. [2] 

(iii) Explain why in this context a quadratic model would probably not be appropriate for 
long-term predictions. [1] 

(iv) Fit a model of the form Int = a+ bz to the data and use it to predict the world record 
time as at lst January 2010. Comment on the reliability of your prediction. [3] 
Exercise 834. (9740 N2009/II/7.) (Answer on p. 2155.) 


A company buys p% of its electronic components from supplier A and the remaining 
(100-p)% from supplier B. The probability that a randomly chosen component sup- 
plied by A is faulty is 0.05. The probability that a randomly chosen component supplied 
by B is faulty is 0.03. 


(i) Given that p = 25, find the probability that a randomly chosen component is faulty.[2] 
(ii) For a general value of p, the probability that a randomly chosen component that is 


ts) 
faulty was supplied by A is denoted by f (p). Show that f (p) = Taps Prove by 
differentiation that f is an increasing function for 0 < p < 100, and explain what this 


statement means in the context of the question. [6] 


Exercise 835. (9740 N2009/II/8.) (Answer on p. 2156.) 
Find the number of ways in which the letters of the word ELEVATED can be arranged if 
(i) there are no restrictions, [1] 
(ii) T and D must not be next to one another, [2] 
(iii) consonants (L, V, T, D) and vowels (E, A) must alternate, [3] 
3 


(iv) between any two Es there must be at least 2 other letters. 
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Exercise 836. (9740 N2009/II/9.) (Answer on p. 2156.) 


The thickness in cm of a mechanics textbook is a random variable with the distribution 
N(2.5,0.17). 


(i) The mean thickness of n randomly chosen mechanics textbooks is denoted by M cm. 
Given that P (M > 2.53) = 0.0668, find the value of n. [3] 


The thickness in cm of a statistics textbook is a random variable with the distribution 
N (2.0, 0.087). 


(ii) Calculate the probability that 21 mechanics textbooks and 24 statistics textbooks will 
fit into a bookshelf of length 1m. State clearly the mean and variance of any normal 
distribution you use in your calculation. [3] 

(iii) Calculate the probability that the total thickness of 4 statistics textbooks is less than 
three times the thickness of 1 mechanics textbook. State clearly the mean and variance 


of any normal distribution you use in your calculation. [3] 
(iv) State an assumption needed for your calculation in parts (ii) and (iii). [1] 
Exercise 837. (9740 N2009/II/10.) (Answer on p. 2156.) 


A company supplies sugar in small packets. The mass of sugar in one packet is denoted by 
X grams. The masses of a random sample of 9 packets are summarised by 


eased, > 2° = 835.92: 
(i) Calculate unbiased estimates of the mean and variance of X. [2] 


The mean mass of sugar in a packet is claimed to be 10 grams. The company directors 
want to know whether the sample indicates that this claim is incorrect. 


(ii) Stating a necessary assumption, carry out a t-test at the 5% significance level. Explain 
why the Central Limit Theorem does not apply in this context. [7] 


(iii) Suppose now that the population variance of X is known, and that the assumption 
made in part (ii) is still valid. What change would there be in carrying out the test? 


[1] 
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Exercise 838. (9740 N2009/II/11.) (Answer on p. 2157.) 


A fixed number, n, of cars is observed and the number of those cars that are red is denoted 
by R. 


(i) State, in context, two assumptions needed for R to be well modelled by a binomial 


distribution. [2] 
Assume now that R has the distribution B(n,p). [2] 
(ii) Given that n = 20 and p=0.15, find P(4< R<8). [2] 


(iii) Given that n = 240 and p = 0.3, find P (R < 60) using a suitable approximation, which 
should be clearly stated. 

(iv) Given that n = 240 and p = 0.02, find P (R = 3) using a suitable approximation, giving 
your answer correct to 4 decimal places and explaining why the approximation is 


appropriate in this case. [3] 

(v) Given that n = 20 and P(R =0 or 1) = 0.2, write down an equation for the value of p, 
and find this value numerically. [2] 
Exercise 839. (9740 N2008/II/5.) (Answer on p. 2157.) 


A school has 950 pupils. 


(i) A sample of 50 pupils is to be chosen to take part in a survey. Describe how the 
sample could be chosen using systematic sampling. [2] 


The purpose of the survey is to investigate pupils’ opinions about the sports facilities 
available at the school. 


(ii) Give a reason why a stratified sample might be preferable in this context. [2] 


Exercise 840. (9740 N2008/II/6.) (Answer on p. 2158.) 


In mineral water from a certain source, the mass of calcium, X mg, in a one-litre bottle is 
a normally distributed random variable with mean jz. Based on observations over a long 
period, it is known that ys = 78. Following a period of extreme weather, 15 randomly chosen 
bottles of the water were analysed. The masses of calcium in the bottles are summarised 
by 


e100, + e707 200.0. 


Test, at the 5% significance level, whether the mean mass of calcium in a bottle has changed. 
6] 
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Exercise 841. (9740 N2008/II/7.) (Answer on p. 2158.) 


A computer game simulates a tennis match between two players, A and B. The match 
consists of at most three sets. Each set is won by either A or B, and the match is won by 
the first player to win two sets. 


The simulation uses the following rules. 


e The probability that A wins the first set is 0.6. 


e For each set after the first, the conditional probability that A wins that set, given that 
A won the preceding set, is 0.7. 


e For each set after the first, the conditional probability that B wins that set, given that 
B won the preceding set, is 0.8. 


Calculate the probability that 


(i) A wins the second set, [2] 
(ii) A wins the match, [3] 
(iii) B won the first set, given that A wins the match. [3] 
Exercise 842. (9740 N2008/II/8.) (Answer on p. 2158.) 


A certain metal discolours when exposed to air. To protect the metal against discolouring, 
it is treated with a chemical. In an experiment, different quantities, xml, of the chemical 
were applied to standard samples of the metal, and the times, t hours, for the metal to 
discolour were measured. The results are given in the table. 





x | 1.2 | 2.0 | 2.7 | 3.8 | 4.8 | 5.6 | 6.9 
2.2 | 4.5 | 5.8 | 7.3 | 7.6 | 9.0 | 9.9 



































(i) Calculate the product moment correlation coefficient between x and t, and explain 
whether your answer suggests that a linear model is appropriate. [3] 


(ii) Draw a scatter diagram for the data. [1] 


One of the values t appears to be incorrect. 


(iii) Indicate the corresponding point on your diagram by labelling it P, and explain why 
the scatter diagram for the remaining points may be consistent with a model of the 
form t=a+blna. [2] 


(iv) Omitting P, calculate least square estimates of a and b for the model t = a+ blnz. [2] 
(v) Estimate the value of t at the value of x corresponding to P. [1] 


(vi) Comment on the use of the model in part (iv) in predicting the value of t when zx = 8.0. 


1] 
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Exercise 843. (9740 N2008/II/9.) (Answer on p. 2159.) 


A shop sells two types of piano, ‘grand’ and ‘upright’. The mean number of grand pianos 
sold in a week is 1.8. 


(i) Use a Poisson distribution to find the probability that in a given week at least 4 grand 
pianos are sold. [2] 


The mean number of upright pianos sold in a week is 2.6. The sales of the two types of 
piano is independent. 


(ii) Use a Poisson distribution to find the probability that in a given week the total number 








of pianos sold is exactly 4. 2 
(iii) Use a normal approximation to the Poisson distribution to find the probability that 
the number of grand pianos sold in a year of 50 weeks is less than 80. 4 
(iv) Explain why the Poisson distribution may not be a good model for the number of 
grand pianos sold in a year. 2 
Exercise 844. (9740 N2008/II/10.) (Answer on p. 2159.) 


A group of diplomats is to be chosen to represent three islands, kK, L and M. The group 
is to consist of 8 diplomats and is chosen from a set of 12 diplomats consisting of 3 from 
kK, 4 from LZ and 5 from M. Find the number of ways in which the group can be chosen if 
it includes 








(i) 2 diplomats from K, 3 from L and 3 from MV, [2] 
(ii) diplomats from L and M only, [2] 
(iii) at least 4 diplomats from M, [2] 
(iv) at least 1 diplomat from each island. [4] 
Exercise 845. (9740 N2008/II/11.) (Answer on p. 2160.) 


The random variable X has the distribution N (50, 8°). Given that X, and X» are two 
independent observations of X, find 


i. POat S10), [2] 
2. P(X, > X24 15). [3] 


The random variable Y is related to X by the formula Y = aX +b, where a and 6b are 
constants with a> 0. 


3. Given that P(Y < 74) = P(Y > 146) = 0.0668, find the values of E(Y) and Var (Y), and 
hence find the values of a and b. [7] 


Exercise 846. (9233 N2008/I/1.) (Answer on p. 2160.) 


On a bookshelf there are 15 different books; 6 have red covers, 5 have blue covers and 4 
have green covers. All the red books are to be kept together, all the blue books are to be 
kept together and all the green books are to be kept together. In how many ways can the 
15 books be arranged on the bookshelf? [3] 
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Exercise 847. (9233 N2008/II/23.) (Answer on p. 2160.) 


The events A, B and C are such that P(A) = 0.2, P(C) = 0.4, P(AUB) = 0.4 and 
P(BnC) =0.1. Given that A and B are independent, find P (B) and show that B and C 
are also independent. [4] 


Exercise 848. (9233 N2008/II/26.) (Answer on p. 2160.) 
The number of times that an office photocopying machine breaks down in a week follows a 
Poisson distribution with mean 3. Find the probability that 

(i) the machine will break down more than twice in a given week, [2] 
(ii) the machine will break down at most three times in a period of four weeks. [3] 


(iii) Use a suitable approximation to find the probability that the machine will break down 
more than 50 times in a period of 16 weeks. [4] 


Exercise 849. (9233 N2008/II/27.) (Answer on p. 2160.) 


The masses of a certain type of electronic component produced by a machine are normally 
distributed with mean 32.40g. The machine is adjusted and a sample of 80 components 
is now taken and is found to have a mean mass 32.00g. The unbiased estimate of the 
population variance, calculated from this sample, is 2.892 g?. 








(i) Test at the 5% significance level whether this indicates a change in the mean. [5] 
(ii) Explain what you understand by the phrase ‘at the 5% significance’ in the context of 
this question. [2] 

(iii) Find the least level of significance at which this sample would indicate a decrease in 
the population mean. [3] 
Exercise 850. (9233 N2008/II/29.) (Answer on p. 2161.) 


Mr Sim and Mr Lee work in the same office and are expected to arrive by 9 a.m. each day. 
Both men drive to work. 


(i) The time taken for Mr Sim’s journey follows a normal distribution with mean 50 
minutes and standard deviation 4 minutes. Given that he regularly leaves home at 
8.05 a.m., find the probability that he will be late no more than once in a working 
week of 5 days. [5] 


(ii) Mr Lee’s journey time follows a normal distribution with mean 40 minutes and stan- 
dard deviation 5 minutes. Mr Lee leaves home at 8.10 a.m. each day. Find the 
probability that Mr Sim will arrive at work before Mr Lee on any particular day. [5] 


(iii) Find the probability that in a working week of 5 days, Mr Sim arrives at work before 
Mr Lee on at least 3 days. [2] 
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Exercise 851. (9233 N2008/II/30.) (Answer on p. 2161.) 


(i) The masses of valves produced by a machine are normally distributed with mean ju 
and standard deviation a. 12% of the valves have mass less than 86.50 g and 20% have 
mass more than 92.25g. Find yu and o. [4] 


(ii) The setting of the machine is adjusted so that the mean mass of the valves produced 
is unchanged, but the standard deviation is reduced. Given that 80% of the valves 
now have a mass within 2g of the mean, find the new standard deviation. [3] 


(iii) After the machine has been adjusted, a random sample of n valves is taken. Find the 
smallest value of n such that the probability that the sample mean exceeds p by at 
least 0.50g is at most 0.1. [5] 


Exercise 852. (9740 N2007/II/5.) (Answer on p. 2161.) 


(i) Give a real-life example of a situation in which quota sampling could be used. Explain 
why quota sampling would be appropriate in this situation, and describe briefly any 


disadvantage that quota sampling has. [4] 
(ii) Explain briefly whether it would be possible to use stratified sampling in the situation 
you have described in part (i). [1] 
Exercise 853. (9740 N2007/II/6.) (Answer on p. 2162.) 


In a large population, 24% have a particular gene A, and 0.3% have gene B. Find the 
probability that, in a random sample of 10 people from the population, at most 4 have 
gene A. [2] 


A random sample of 1000 people is taken from the population. Using appropriate approx- 
imations, find 


(i) the probability that between 230 and 260 inclusive have gene A, [3] 
(ii) the probability that at least 2 but fewer than 5 have gene B. [2] 
Exercise 854. (9740 N2007/II/7.) (Answer on p. 2162.) 


A large number of students in a college have completed a geography project. The time, 
x hours, taken by a student to complete the project is noted for a random sample of 150 
students. The results are summarised by 


ya = 4626, >> 2? = 147691. 
(i) Find unbiased estimates of the population mean and variance. [2] 


(ii) Test, at the 5% significance level, whether the population mean time for a student to 
complete the project exceeds 30 hours. [4] 
(iii) State giving a valid reason, whether any assumptions about the population are needed 
in order for the test to be valid. 1 
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Exercise 855. (9740 N2007/II/8.) (Answer on p. 2162.) 


Chickens and turkeys are sold by weight. The masses, in kg, of chickens and turkeys are 
modelled as having independent normal distributions with means and standard deviations 
as shown in the table. 





Mean Mass | Standard Deviation 
Chickens 2d 0.5 
Turkeys 10.5 2.1 























Chickens are sold at $3 per kg and turkeys at $5 per kg. 
(i) Find the probability that a randomly chosen chicken has a selling price exceeding $7. 


[2] 
(ii) Find the probability of the event that both a randomly chosen chicken has a selling 
price exceeding $7 and a randomly chosen turkey has a selling price exceeding $55.[3] 


(iii) Find the probability that the total selling price of a randomly chosen chicken and a 


randomly chosen turkey is more than $62. [4] 
(iv) Explain why the answer to part (iii) is greater than the answer to part (ii). [1] 
Exercise 856. (9740 N2007/II/9.) (Answer on p. 2163.) 


A group of 12 people consists of 6 married couples. 


(i) The group stand in a line. 


(a) Find the number of different possible orders. [1] 
(b) Find the number of different possible orders in which each man stands next to his 
wife. [3] 


(ii) The group stand in a circle. 


(a) Find the number of different possible arrangements. [1] 

(b) Find the number of different possible arrangements if men and women alternate. 
[2] 

(c) Find the number of different possible arrangements if each man stands next to 
his wife and men and women alternate. [2] 
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Exercise 857. (9740 N2007/II/10.) (Answer on p. 2163.) 
A player throws three darts at a target. The probability that he is successful in hitting the 
target with his first throw is 3° For each of his second and third throws, the probability of 
success is 

e twice the probability of success on the preceding throw if that throw was successful, 


e the same as the probability of success on the preceding throw if that throw was unsuc- 
cessful. 





Construct a probability tree showing this information. [3] 
Find 
(i) the probability that all three throws are successful, [2] 
(ii) the probability that at least two throws are successful, [2] 
(iii) the probability that the third throw is successful given that exactly two of the three 
throws are successful. [4] 
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Exercise 858. (9740 N2007/II/11.) (Answer on p. 2163.) 


Research is being carried out into how the concentration of a drug in the bloodstream varies 
with time, measured from when the drug is given. Observations at successive times give 
the data shown in the following table. 





Time (t minutes) 15 | 30 | 60 | 90 | 120 | 150 | 180 | 240 | 300 
Concentration (~ micrograms per litre) | 82 | 65) 43) 37) 22 | 19 12) 6 | 2 









































It is given that the value of the product moment correlation coefficient for this data is 
—0.912, correct to 3 decimal places. The scatter diagram for the data is shown below. 


x (micrograms per litre) 


100 





























t (minutes) 





0 50 100 150 200 250 300 300 


(i) Calculate the equation of the regression line of x on t. [2] 
(ii) Calculate the corresponding estimated value of x when t = 300, and comment on the 
suitability of the linear model. [2] 


The variable y is defined by y = Inz. For the variables y and f, 
(iii) calculate the product moment correlation coefficient and comment on its value, [2] 
(iv) calculate the equation of the appropriate regression line. [3] 


(v) Use a regression line to give the best estimate that you can of the time when the drug 
concentration is 15 micrograms per litre. [2] 
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Exercise 859. (9233 N2007/1/4.) (Answer on p. 2164.) 


The diagram shows two straight lines, ABCD and AEFGHIJ, which intersect at A. 
Triangles are to be drawn using three of the points A, B, C, D, E, F, G, H, I, J as 
vertices. 


D 





J 
(i) How many different triangles can be drawn which have the point A as one of the 
vertices? [1] 
(ii) How many different triangles in total can be drawn? [4] 
Exercise 860. (9233 N2007/II/23.) (Answer on p. 2164.) 


(i) A random sample of size 100 is taken from a population with mean 30 and standard 
deviation 5. Find an approximate value for the probability that the sample mean lies 


between 29.2 and 30.8. [6] 
(ii) Giving a reason, state whether it is necessary to make any assumptions about the 
distribution of the population. [1] 
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Exercise 861. (9233 N2007/II/25.) (Answer on p. 2164.) 


The numbers of men and women studying Chemistry, Physics and Biology at a college are 
given in the following table. 





Chemistry | Physics | Biology 
Men 12 16 a2 
Women 8 ile 20 


























One of these students is chosen at random by a researcher. Events IM, W, C and B are 
defined as follows. 


M: the student chosen is a man 
W: the student chosen is a woman. 
C’: the student chosen is studying Chemistry 
B: the student chosen is studying Biology 
Find 
(i) P(W|B). 
(ii) P(B|W). 
(iii) P(BUW). 


opine 


State, with a reason in each case, whether W and B are independent, and whether MV an 
C’ are mutually exclusive. 


1 Qu 








= 


Exercise 862. (9233 N2007/II/26.) (Answer on p. 2164.) 


At a fire station, each call-out is classified as either genuine or false. Call-outs occur at 
random times. On average, there are two genuine call-outs in a week, and one false call-out 
in a two-week period. 


(i) Calculate the probability that there are fewer than 6 genuine call-outs in a randomly 


chosen two-week period. [?] 
(ii) Using a suitable approximation, calculate the probability that the total number of 
call-outs in a randomly chosen six-week period exceeds 19. [?] 
Exercise 863. (9233 N2007/II/27.) (Answer on p. 2165.) 


An oil mixture is produced by mixing L litres of light oil with H litres of heavy oil. The 
random variables L and H are independent normal variables. The expected value of L is 5 
and its standard deviation is 0.1. The expected value of H is 3 and its standard deviation 
is 0.05. 


(i) Find the probability that the volume of the mixture lies between 7.9 litres and 8.2 
litres. 6] 


The density of light oil is 0.74 kilograms per litre, and the density of heavy oil is 0.86 
kilograms per litre. 
(ii) Find the probability that the mass of the mixture lies between 6.1kg and 6.2kg. [6] 


Mass 
Volume 
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[Density is defined by Density = 


Exercise 864. (9233 N2006/I/4.) (Answer on p. 2165.) 


A box contains 8 balls, of which 3 are identical (and so are indistinguishable from one 
another) and the other 5 are different from each other.°® 3 balls are to be picked out of the 
box; the order in which they are picked out does not matter. Find the number of different 
possible selections of 3 balls. [4] 


Exercise 865. (9233 N2006/II/23.) (Answer on p. 2165.) 


Two fair dice, one red and the other green, are thrown. 


A is the event: The score on the red die is divisible by 3. 
B is the event: The sum of two scores is 9. 


(i) Justifying your conclusion, determine whether A and B are independent. [3] 
(ii) Find P(AUB). [2] 
Exercise 866. (9233 N2006/II/25.) (Answer on p. 2165.) 


The mass of vegetables in a randomly chosen bag has a normal distribution. The mass of 
the contents of a bag is supposed to be 10kg. A random sample of 80 bags is taken and 
the mass of the contents of each bag, x grams, is measured. The data are summarised by 


>) (z- 10000) =-2510, >> (x-10 000)* = 2010203. 
(i) Test, at the 5% significance level, whether the mean mass of the contents of a bag is 
less than 10kg. [7] 


(ii) Explain, in the context of the question, the meaning of ‘at the 5% significance level’. 


1] 


Exercise 867. (9233 N2006/II/26.) (Answer on p. 2165.) 


In a weather model, severe floods are assumed to occur at random intervals, but at an 
average rate of 2 per 100 years. 


(i) Using this model, find the probability that, in a randomly chosen 200-year period, 
there is exactly one severe flood in the first 100 years and exactly one severe flood in 
the second 100 years. [3] 
(ii) Using the same model, and a suitable approximation, find the probability that there 


are more than 25 severe floods in 1000 years. [5] 





565 Assume also that each of the latter 5 balls is different from each of the first 3. 
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Exercise 868. (9233 N2006/II/28.) (Answer on p. 2166.) 


Observations are made of the speeds of cars on a particular stretch of road during daylight 
hours. It is found that, on average, 1 in 80 cars is travelling at a speed exceeding 125kmh"?, 
and 1 in 10 is travelling at a speed less than 40kmh‘t. 


(i) Assuming a normal distribution, find the mean and the standard deviation of this 
distribution. [4] 


(ii) A random sample of 10 cars is to be taken. Find the probability that at least 7 will 
be travelling at a speed in excess of 40kmh7?. [3] 
(iii) A random sample of 100 cars is to be taken. Using a suitable approximation, find the 
probability that at most 8 cars will be travelling at a speed less than 40kmh™!. [3 
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139. All Past-Year Questions, Listed and Categorised 
There are in total 200 points, so each point accounts for 0.5% of your final A-Level grade. 


Below I list points for curveball questions” in red. 


The clickable four-digit numbers are the page numbers for the Questions and Answers. 


139.1. 2019 (9758) 





©6See Preface/Rant—p. xlix. 
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139.2. 2018 (9758) 
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139.3. 2017 (9758) 


There are in total 200 points, so each point accounts for 0.5% of your final A-Level grade. 
Below I list points for curveball questions”®’ in red. 


The clickable four-digit numbers are the page numbers for the Questions and Answers. 





°©7See Preface/Rant—p. xlix. 


1498, Contents www.EconsPhDTutor.com 































































































Paper I: Pure Mathematics [100] 
Q A Part Topics Points 
1 1422 2088 Calc. Maclaurin 4=4 
1374 1994 F&G graphs, absolute value 2+4=6 
3 1499 2088 one differentiation, stationary points, 44327 
turning points, maximum, minimum 7 
4 1375 1995 F&A conic sections, differentiation, eh 
asymptotes, transformations 
factorisation, Remainder Theorem, 7 
| ee ue B&G differentiation, quadratic sla 
6 | 1402 2044 | Vectors vector equations, lines, planes 2+3+3=8 
t | (AQ2 2089 Calc. integration, trigonometry 3+5=8 
8 | 1410 2058 | Complex quadratic, factorisation 3+44+3=10 
9 | 1390 | 2024 | S&S summation, limits, Maclaurin e ; oe =13 
10! 1402 | 2044 | Vectors vector Pastors, 223) 44+445=18 
scalar product, quadratic 7 
11 | 1422 2089 Calc. differential equations : : | =13 
Paper II, Section A: Pure Mathematics [40] 
1 1375 1996 F&C parametric, differentiation, 345=8 
equations, points 
2 | 1390 2024 S&S arn Esene oh aan 24+4+3=9 
geometric progression 
3 1376 1996 F&C inverse, composite functions A+2+_ 12 
transformations, conic sections 4+2- 
4} 1423 | 2090 | Calc. Sep caaaranic 44+44+3=11 
integration, volume 
Section B: Probability and Statistics [60] 
5 | 1459 2136 P&S 34+24+2=7 
6 | 1459 2136 P&S 24+3+4=9 
7 | 1459 | 2136 | P&S vie a0 
5+2 
8 | 1460 2136 | P&S oe tage 
3+2 
2+1+ 
9 | 1461 2136 | P&S eto 
1+3+ 
1+1 
10| 1461 | 2136 | P&S Pos 20 
4+4 
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Paper I: Pure Mathematics [100] 










































































Q A Part Topics Points 
1! Deere 1996 F&G inequalities 24+3=5 
2.| 1423 2091 Calc. differentiation, calculator 24+3=5 
3 | 1376 1997 2+4=6 
4 | 1390 | 2025 4+3=7 
5 | 1403 | 2045 2+4+2=8 
6 1391 2024 9+2+3=10 
7 | 1411 2058 59+5=10 
8 1423 2091 9+3+3=11 
9 1424 2092 14+6+34+2=12 
10 | 1377 1998 3+54+3+2=13 
11 | 1403 2046 9+54+3=13 
Paper II, Section A: Pure Mathematics [40] 
1 1424 2093 tet 
2 1425 2093 3+2+5=10 
3 1425 2094 4+3+4=11 
4 1411 2059 2+44+3+3=12 
Section B: Probability and Statistics [60] 
9) 1462 2136 P&S 2+14+2=5 
6 | 1463 2136 P&S 1+14+1+4+3=10 
7 | 1463 2136 P&S 1+3+3+3=10 
8 1464 2136 P&S 24+14+2+3+3=11 
9 1464 2136 P&S 2+4+4=10 
10 | 1465 2136 P&S 24+24+2+4+4=14 
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Paper I: Pure Mathematics [100] 




















































































































Q A Part Topics Points 
1 | 1377 1998 44+2+1=7 
2 | 1377 Seo 3+3=6 
3 | 1425 2095 2+3=5 
4 | 1426 2096 6=6 
5 | 1378 2000 boa 2S] 
6 | 1426 2096 2+6=8 
7 | 1403 2047 2+3+5=10 
8 | 1391 2026 4+4+3=11 
9 | 1411 2059 9+44+4=13 
10 | 1426 2097 4+2+6=12 
11 | 1427 2098 3+6+4+34+3=15 
Paper II, Section A: Pure Mathematics [40] 
1 | 1427 | 2099 1+5=6 
2 | 1404 2048 2+54+3=10 
3 | 1378 2001 2+3+5=10 
4 | 1391 | 2026 6+1+4+3=14 
Section B: Probability and Statistics [60] 
5 | 1465 | 2136 P&s 1+2+1=4 
6 | 1465 2136 P&S ee eg eae b 
7 | 1466 2137 P&S 2or l= 7 
8 | 1466 2137 P&S 1=7 
9 | 1466 2137 P&S 1+3+4=8 
10 | 1467 2138 P&S 14+34+3+2=9 
11} 1467 | 2139 P&s 2+1+2+4=9 
12 | 1467 2139 P&S 2+3+4=9 
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Paper I: Pure Mathematics [100] 































































































Q A Part Topics Points 
1 | 1378 2001 4+1=5 
2 | 1427 2099 6=6 
3 | 1404 2048 24+2+12=5 
4 | 1379 2001 4+1=5 
5 | 1412 2060 4+3=7 
6 | 1392 2027 ot3+24+2=12 
7 | 1428 2100 2+2+3+4=11 
8 | 1428 2101 1+4+4=9 
9 | 1404 2048 4+4+5=13 
10 | 1429 2102 1+54+14+1+5=13 
11 | 1429 2103 64+2+3+3=14 
Paper II, Section A: Pure Mathematics [40] 
1 | 1379 2002 3+4=7 
2 | 1430 2104 9=9 
3 | 1392 2028 2+4+5=11 
4 | 1412 2060 2+44+3+4=13 
Section B: Probability and Statistics [60] 
5 | 1468 2140 P&S 2+2=4 
6 | 1468 2140 P&S 2+34+3=8 
7 | 1468 2140 P&S leo+e2 
8 | 1469 2141 P&S 2+2+3+1=8 
9 | 1469 2141 P&S 2+44+3=9 
10 | 1470 2142 P&S 14+24+3+4=10 
11 | 1470 | 2142 P&S 24+2+5+3+2=14 
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Paper I: Pure Mathematics [100] 





































































































Q A Part Topics Points 
1 | 1404 2048 2+3=5 
2 | 1379 2002 5=5 
3 | 1380 2002 4+2=6 
4 | 1412 2061 2+3+3=8 
5 | 1430 2104 3+5=8 
6 | 1405 2049 1+1+5=7 
7 | 1393 2029 3+2+4=9 
8 | 1412 2062 2+44+3=9 
9 | 1393 2029 5+54+3=13 
10| 1430 2105 44+2+3+4=13 
11| 1431 2106 3+5+3+6=17 
Paper II, Section A: Pure Mathematics [40] 
1 | 1380 2003 2+4=6 
2 | 1432 2107 3+6=9 
3 | 1432 2108 7+5=12 
4 |) 1405 2049 3+4+6=13 
Section B: Probability and Statistics [60] 
5 | 1470 2143 P&S 2+2=4 
6 | 1471 2143 P&S 4=4 
7 | 1471 2143 P&S 2+ 14326 
8 | 1471 2143 P&S 1+2+3=6 
9 | 1471 2143 P&S 2+9=7 
10| 1472 2144 P&S 3+14+24+3=9 
11| 1472 2145 P&S 2+2+4+4=12 
12 | 1473 2145 P&S 34+2+3+4=12 
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139.8. 2012 (9740) 
Paper I: Pure Mathematics [100] 
Q A Part Topics Points 
1 | 1380 2004 4=4 
2) 1432 2108 2+3+1=6 
3 | 1393 2030 2+2+4=8 
4A | 1433 2109 4+4=8 
5 | 1405 2050 4+4=8 
6 |) 1413 2063 2+2+4=8 
7 | 1381 2004 2+3+4=9 
8 | 1433 2109 4+3+2=9 
9 | 1406 2050 3+5+4=12 
10| 1434 2110 7+9=12 
11| 1434 2111 5+3+54+3=16 
Paper II, Section A: Pure Mathematics [40] 
1 | 1435 Allez 3+5=8 
2 | 1413 | 2063 2+2+2+3=9 
3 | 1381 |) 2005 1+3+1+1+4=10 
4A | 1394 2031 9+54+3=13 
Section B: Probability and Statistics [60] 
5 | 1473 2146 P&S 22 p= 7 
6 | 1474 2146 P&S Leste = 6 
7 | 1474 2146 P&S 2+2424+24+1=9 
8 | 1475 | 2147 P&S 14+14+2+14+1+4+1=11 
9 |) 1475 2148 P&S 2+2+3+5=12 
10| 1476 2148 P&S 2+14+24+34+4+3=15 
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139.9. 2011 (9740) 





Paper I: Pure Mathematics [100] 


































































































Q A Part Topics Points 
1 | 1381 2007 4=4 
2 | 1381 2007 3+2=5 
3 | 1435 Z1i2 2424327 
4 |) 1435 ABP: 3+3+2=8 
5 | 1435 2lls aria S at 
6 | 1394 2032 2+3+6=11 
7 | 1406 | 2051 6+2+1+2=11 
8 | 1436 2114 2+9+3+2=12 
9 | 1395 2033 6+4=10 
10| 1413 2064 4+3+14+1+1=10 
11 | 1406 2051 4+4+4+3=15 
Paper II, Section A: Pure Mathematics [40] 
1 | 1413 2065 3+24+3=8 
2 | 1436 2114 3+6=9 
3 | 1382 2007 4+4+3=11 
4 | 1436 2115 5+14+6=12 
Section B: Probability and Statistics [60] 
5 | 1476 2149 P&S 4=4 
6 | 1476 2149 P&S 2+142=5 
7 | 1477 | 2149 P&S 2+1+1+4=8 
8 | 1477 | 2150 P&S 14+2+14+3=7 
9 | 1478 2150 P&S 21 243=8 
10| 1478 2150 P&S 24+3+4=9 
11| 1478 2151 P&S 3+5=8 
12 | 1479 2152 P&S 14+4+5+1=11 
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139.10. 2010 (9740) 





Paper I: Pure Mathematics [100] 































































































Q A Part Topics Points 
1 | 1407 2052 2+3=5 
2 | 1437 2115 3+3=6 
3 | 1395 2034 3+2=5 
4) 1437 2115 4+4=8 
5 | 1382 2008 5+3=8 
6 | 1437 2116 2+2+44+2=10 
7 | 1438 2116 7+4=11 
8 | 1414 2066 24+34+44+2=11 
9 | 1438 ziiy 64+2+24+2=12 
10 | 1407 2052 2+44+34+3=12 
11] 1439 2118 4+4+4=12 
Paper II, Section A: Pure Mathematics [40] 
1 | 1414 2067 2+9=7 
2 | 1395 2034 9+4+2=11 
3 | 1439 2119 94+24+24+2=11 
4 | 1383 | 2009 14+2+2+3+3=11 
Section B: Probability and Statistics [60] 
5 | 1479 2152 P&S 14+2=3 
6 | 1479 2152 P&S 1=7 
7 | 1480 2153 P&S 24242424129 
8 | 1480 2153 P&S 14+2+4=7 
9 | 1480 2153 P&S 4+3+3=10 
10 | 1481 | 2153 P&S 24+2+1+4=9 
11| 1481 2154 P&S 24+34+4424+4=15 
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139.11. 


2009 (9740) 





Paper I: Pure Mathematics [100] 































































































Q A Part Topics Points 
1 | 1383 2010 4+2=6 
2 | 1439 2119 5=5 
3 | 1396 2035 24or22S7 
4 | 1439 21240) 2545 =8 
5 | 1396 2035 4+4=8 
6 | 1383 2011 44+24+2=8 
7 | 1440 2120 59+4=9 
8 | 1396 2036 4+3+4=11 
9 | 1414 2068 94+24+5=12 
10 | 1407 2052 3+4+5=12 
11) 1440 | 2121 2+4+4+2+2=14 
Paper II, Section A: Pure Mathematics [40] 
1 | 1440 Zee 1+3+4=8 
2 | 1407 | 2053 2+2+2+4=10 
3. | 1384 2012 5+2+3=10 
4 | 1441 2123 5+7=12 
Section B: Probability and Statistics [60] 
5 | 1482 2155 P&S 3=3 
6 | 1482 2159 P&S 24+24+14+3=8 
7 | 1482 2155 P&S 2+6=8 
8 | 1482 2156 P&S 14+24+34+3=9 
9 | 1483 | 2156 P&S 3+3+3+1=10 
10 | 1483 2156 P&S 2+7+1=10 
11 | 1484 | 2157 P&S 2424343 +2=12 
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139.12. 2008 (9740) 





Paper I: Pure Mathematics [100] 































































































Q A Part Topics Points 
1 | 1441 2120 4=4 
2 | 1397 | 2036 5=5 
3 | 1408 2053 1+3+2=6 
4 | 1441 2124 2 ze Se 
5 | 1442 2124 3+4=7 
6 | 1442 2125 o+to=l1 
7 | 1442 2125 10= 1 
8 | 1414 | 2069 3+4+4=1. 
9 | 1384 2012 gt24+145=1 
10) 1397 | 2036 9+2+3+4=1: 
11 | 1408 2053 24+44394+2+4=1 
Paper II, Section A: Pure Mathematics [40] 
1 | 1442 2126 2+34+143=9 
2 | 1443 2126 3+34+3=9 
3 | 1415 | 2070 24+2+3+4=1 
4 | 1385 | 2016 2+34+1+5=1 
Section B: Probability and Statistics [60] 
5 | 1484 2157 P&S 24+2=4 
6 | 1484 2158 P&S 6=6 
7 | 1485 2158 P&S 2373028 
8 | 1485 | 2158 P&S ot le 2e2e1 eli 
9 | 1486 2159 P&S 24+2+44+2=1 
10 | 1486 2159 P&S 24+24+2+4=1 
11 | 1486 2160 P&S 2637 SA. 
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139.13. 2007 (9740) 





Paper I: Pure Mathematics [100] 































































































Q A Part Topics Points 
1 | 1385 2018 14+4=5 
2 | 1386 2018 3+3=6 
3 | 1415 2073 3+4=7 
4) 1445 2130 6+1=7 
5 | 1386 2019 4+3=7 
6 | 1408 2054 2+3+4=9 
7 | 1415 2074 3+4+3=10 
8 | 1409 2054 o+a+3=11 
9 | 1398 | 2038 2+24+34+2+2=11 
10 | 1398 2038 4+5+5=14 
11| 1446 2130 2+64+5=13 
Paper II, Section A: Pure Mathematics [40] 
1 | 1386 2019 6=6 
2.| 1389 2039 4+2+2+2=10 
3 | 1446 2131 4+5+2=11 
4 | 1446 2131 64+5+2=13 
Section B: Probability and Statistics [60] 
5 | 1488 2161 P&S 4+1=5 
6 | 1488 2162 P&S 2G 2=7T 
7 | 1488 2162 P&S 2+44+1=7 
8 | 1489 2162 P&S 2+3+4+1=10 
9 | 1489 2163 P&S l+3+¢142+2=9 
10 | 1490 2163 P&S 34+2+2+4=11 
11| 1491 2163 P&S 2424+24+34+2=11 
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139.14. 2008 (9233) 


The 9233 syllabus was significantly heftier. In particular, in addition to “Pure Mathemat- 
ics” and “Probability and Statistics”, there was also “Particle Mechanics”. This textbook 
has omitted the questions on Particle Mechanics and also any other questions that would 
also be out of the 9740 syllabus. (This explains why there seem to be some missing ques- 


tions. ) 


The format of the papers was also somewhat more complicated. The last question of Paper 
1 was an either-or question (i.e. examinees had a choice of doing one of two questions 
given). Paper 2 contained four sections, of which only Section A (Pure Mathematics) was 
mandatory and examinees had to choose to do one of Sections B, C, or D. And again, the 


last question of each of these four sections was an either-or question. 


Note also that Permutations and Combinations fell under “Pure Mathematics”. 





Paper 1: Pure Mathematics 








































































































Q A Part Topics Points 
1 | 1486 2160 3=3 
2 | 1443 2126 4+5=9 
3 | 1443 2127 5=5 
4 | 1444 2127 4=4 
6 | 1444 2127 34+2=5 
8 | 1444 2127 5=5 
9 | 1415 2071 2+2+4=8 
10 | 1444 2128 3+5=8 
11] 1408 2054 5+4=9 
13 | 1444 2128 5+2+5=12 
14] 1445 2129 6+6=12 
14] 1385 2014 4+3+1+2=10 
Paper 2, Section A: Pure Mathematics 
1 | 1445 2129 3=3 
2 | 1397 | 2037 6=6 
3 | 1415 2073 3+4=7 
5 746 2129 5+3=8 
Paper 2, Sections B—D: Probability and Statistics 
23 | 1487 | 2160 P&S 4=4 
26 1487 | 2160 P&S 2+3+4=9 
27| 1487 | 2160 P&S 5+2+3=10 
29} 1487 | 2161 P&S 5+5+2=12 
30 | 1488 2161 P&S 4+3+5=12 
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139.15. 2007 (9233) 





Paper 1: Pure Mathematics 








































































































Q A Part Topics Points 
2 | 1446 2131 a= 5 
3 | 1447 2132 5=5 
4 | 1492 2164 L+4=5 
7 | 1409 2054 reo 
8 | 1447 2132 3+4=7 
9 | 1416 2075 5+3=8 
10 | 1447 2132 3+5=8 
11 | 1447 Zio 9=9 
13 | 1448 2133 34+44+2+3=12 
14] 1399 2040 6=6 
14] 1448 2133 4+4+4=12 
Paper 2, Section A: Pure Mathematics 
1 | 1399 2040 =o 
2 | 1409 2054 2+5=7 
4 | 1387 2020 2+34+3=8 
5 | 1416 2075 3+24+34+2=10 
Paper 2, Sections B—D: Probability and Statistics 
23 1492 2164 P&S 6+1=7 
25 | 1493 | 2164 P&S 1+1+2+4=8 
26 | 1493 2164 P&S fs? 
27 | 1493 2165 P&S 6+6=12 
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139.16. 2006 (9233) 





Paper 1: Pure Mathematics 
















































































Q A Part Topics Points 
1 | 13899 2040 4=4 
3 | 1387 | 2020 3+1=4 
4 | 1494 2165 4=4 
5 | 1416 2076 34+2=5 
6 | 1416 2077 3+3=6 
7 | 1449 2130 6=6 
8 | 1449 2134 t=7 
9 | 1449 2134 2+6=8 
11] 1399 2040 4+1+4=9 
14] 1450 2135 24+2+34+2+3=12 
14] 1409 2055 7+2=9 
Paper 2, Section A: Pure Mathematics 
1387 | 2021 5=5 
1450 2135 3+3=6 
Paper 2, Sections B—D: Probability and Statistics 

23} 1494 2165 P&S 34+2=5 
25} 1494 2165 P&S 7+1=8 
26} 1494 2165 P&S 3+5=8 
28} 1495 2166 P&S 4+3+3=10 
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140. H1 Maths Questions (2016-19) 


140.1. Hi Maths 2019 Questions 


Answers for Probability and Statistics questions to be written. 
Exercise 869. (8865 N2019/1.) (Answer on p. 1518.) 


An online company sells three styles of television: Style A, Style B and Style C. Last month, 
the company sold 22 Style A, 16 Style B and 8 Style C televisions and the total income 
from these sales was $96480. The income from the Style A sales was $3120 more than 
the income from Style B sales. This month, the company has sold two more of each of the 
three styles of television than last month and the total income has increased by $13 260. 
The prices instead of the televisions have not changed. 


Find the price of a Style B television. [5] 
Exercise 870. (8865 N2019/2.) (Answer on p. 1518.) 











The diagram shows a sketch of the curve with equation y = x + In(3a2+4). The points A 
and B are where the curve cuts the x-axis and the y-axis respectively. 

(i) Find the coordinates of A, correct to 3 significant figures. [1] 
(ii) Find the equation of the asymptote to the curve. [1] 


(iii) Without using a calculator, find the equation of the tangent to the curve at the point 
B, giving your answer in the form y = mx +c, where m and c are exact constants. [5] 


Exercise 871. (8865 N2019/3.) (Answer on p. 1518.) 


(i) Without using a calculator, solve the simultaneous equations 
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y = 2x7 + 6a — 3, 
y=1lr+9. [4] 


(ii) Hence, or otherwise, solve the inequality 


2x7 +6r-3<1lxr+9. [2] 
Exercise 872. (8865 N2019/4.) (Answer on p. 1519.) 
il 
(i) Differentiate t———, with respect to z. [2] 
(2 - 32) 


2 
a 

(ii) Differentiate (2va - “| with respect to x, giving your answer in the form p+ s 
a £ 


where p and q are constants to be determined. [3] 
1 
(iii) Without using a calculator, find the exact value of [ (2? +2-e7*) dz. [4] 
0 
Exercise 873. (8865 N2019/5.) (Answer on p. 1519.) 


Mr Tan has just set up a company to manufacture and sell computers. He will monitor the 
profit from the sale of his computers over a period of six years, before he decides whether 
to continue with his business. 


Mr Tan is a mathematician and he wants to model his total profit, P thousand dollars, at 
time t years. He believes that 


P=kx15'-0.6, 


where k is a positive constant. 

(i) Explain why k = 0.6. 

(ii) Sketch the graph of P against t for 0<t<6. 
(iii) Find the value of P when t = 6. 


(iv) Find the rate at which P is increasing when t = 3. 





oe, 





Mr Tan is also interested in modelling his costs, C thousand dollars per year, at any point 
in time. ‘The model he uses is 


C=2P—10F + 25¢ +10, for 0<t<6. 

(v) Use differentiation to find the values of t which give stationary points on the graph of 
C against t. For each point, justify whether it is a minimum or a maximum. [5] 

(vi) Sketch the graph of C against t, stating the coordinates of any intersections with the 
coordinate axes. [2] 

6 
(vii) Use your calculator to find the value of [ ti — 10¢? + 25¢ + 10) dt. In the context of 
0 

the question, what does this value represent? [2] 
Exercise 874. (8865 N2019/6.) (Answer on p. 1515.) 
(i) Find the number of different arrangements of the 9 letters of the word CHEMISTRY 
in which the two vowels (E and I) are next to each other. [2] 
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(ii) Find the number of ways in which 4 letters can be selected from the 9 letters of the 
word CHEMISTRY, given that at least one vowel must be included. [3] 


A874 (8865 N2019/6). XXX 
Exercise 875. (8865 N2019/7.) (Answer on p. 1515.) 


In a large insurance company, the most successful members of the sales team are rewarded 
with an annual bonus. In 2016 exactly 32% of the members of the sales team received the 
annual bonus. 


E(dhterhindsthe ro karditeathataeetha benny eetbemareceived the bonus in 2016. [1] 


(b) Find the probability that at least five of them received the bonus in 2016. [2] 


(ii) For random samples of size eight, find the mean and variance of the number of members 


of the sales team who received the bonus in 2016. [2] 
A875 (8865 N2019/7). XXX 
Exercise 876. (8865 N2019/8.) (Answer on p. 1515.) 
Two events A and B are such that P(A) = 0.6, P(B) =0.4 and P(A|B) =0.1. 
(i) Find the probability that both A and B occur. [2] 
(ii) Describe in words what is meant by P(A’n B). [2] 
(iii) Find P(A’UB). [2] 
A876 (8865 N2019/8). XXX 
Exercise 877. (8865 N2019/9.) (Answer on p. 1515.) 


A large number of students at a college take two tests in science: a written test marked out 
of 30 and a practical test marked out of 5. The table shows the marks, x, for the written 
test and the marks, y, for the practical test, for a random sample of 9 students. 





x| 16 | 20 | 25 | 19 | 17 | 30 | 21 | 24 | 25 
y | 3.1/3.6 | 3.9 | 3.4 | 2.9 | 4.8 | 3.9 | 3.8 | 3.7 









































(i) Give a sketch of the scatter diagram for the data, as shown on your calculator. [2] 


(ii) Find the product moment correlation coefficient and comment on its value in the 
context of the data. [2] 


(iii) Find the equation of the regression line of y on x, giving your answer in the form 
y = ax +b, with the values of a and 6 correct to 3 significant figures. Sketch this line 
on your scatter diagram. [2] 


(iv) Use the equation of your regression line to calculate an estimate for the mark obtained 
in the practical test by a student who obtained 22 marks in the written test. Comment 
on the reliability of your estimate. [2] 


A877 (8865 N2019/9). XXX 
Exercise 878. (8865 N2019/10.) (Answer on p. 1516.) 


Mia’s sock drawer contains 10 red socks, 8 pink socks and 7 blue socks. She chooses 2 socks 
at random from the drawer. 


(i) Draw a tree diagram to represent this situation, showing all possible outcomes. [2] 
(ii) Find the probability that both socks are red. [1] 
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(iii) Find the probability that the 2 socks are of different colours. [3] 


Mia now chooses a third sock from the drawer, without replacing the other 2 socks. 


(iv) Find the probability that Mia now has at least 2 red socks. [4] 
A878 (8865 N2019/10). XXX 
Exercise 879. (8865 N2019/11.) (Answer on p. 1516.) 


The times taken, in minutes, for runners from two large athletics clubs, the Arrows and 
the Beavers, to run 5000 metres have independent normal distributions with means and 
standard deviations as shown in the following table. 





Standard 
Deviation 
Arrows | 14.8 0.55 
Beavers | 15.2 0.65 


Mean 























(i) Find the probability that the time taken to run 5000 metres by a randomly chosen 
runner from the Arrows is more than 15.0 minutes. [1] 


(ii) Find the probability that the sum of the times taken by 2 randomly chosen runners 
from the Arrows and 3 randomly chosen runners from the Beavers is less than 75 
minutes. [4] 


In a competition, teams of 6 runners enter a 5000-metre race. The score for each team is 
the sum of the times taken, in minutes, by the 6 runners to complete the race. The team 
with the lowest score wins the competition. The Arrows and the Beavers each enter a team 
of 6 randomly chosen runners from their clubs. 


(iii) Find the probability that the score of the Arrows team is within +5 of the score of the 
Beavers team. [4] 


Any team that scores fewer than 90 wins a medal. 


(iv) Find the probability that neither the Arrows nor the Beavers win a medal. [4] 
A879 (8865 N2019/11). XXX 
Exercise 880. (8865 N2019/12.) (Answer on p. 1517.) 


A baker states that a certain a small cake that he produces has a mean mass of 100 grams. 
A food inspector wishes to investigate whether the mean mass of these cakes is actually 
less than 100 grams. The food inspector selects a random sample of 60 cakes of this type. 
The masses, x, in grams, are summarised by 


Six =5928, Ya? =587000. 


(i) Find unbiased estimates of the population mean and variance. [3] 
(ii) Determine the conclusion the food inspector should reach if she carries out a test at 
the 5% significance level. [5] 


The baker still states that the mean mass of these cakes is 100 grams. Using a 5% signif- 
icance level, a test is carried out on a new random sample of 60 cakes of this type with 
the null hypothesis ~ = 100 and the alternative hypothesis ~ # 100, where grams is the 
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population mean mass. The test indicates that there is sufficient evidence to reject the 
baker’s statement. 


(iii) Find the set of values within which the mean mass of this sample must lie. [5] 


A880 (8865 N2019/12). XXX 
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140.2. H1 Maths 2019 Answers 


A869 (8865 N2019/1). Let A, B, and C be the price ($) of a Style A, Style B, and Style 
C television, respectively. Then the information we were given may be written as 


224+16B+8C 296480, 224216B+3120, 2(A+B+C) 213260. 
Rearrange 2 to get 22A-16B <3 120. 
1 
Take ++ = < _-4x 3 to get 


1 1 
22A + 16B + 8C' += x (224 - 16B) -4 x 2(A+ B+C) = 96480 + 5 x 3120-4 x 13 260, 


2% 954=45000 or A21800. 


Plug 2 into = to get 22 x 1800 - 16B = 3120 or B = (22 x 1800 - 3120) /16 = 2280. 
The price of a Style B television is $2280. *% 
A870 (8865 N2019/2)(i) You’re supposed to just mindlessly use your calculator: 





A x (-0.677,0). a 
(ii) 32+ 4=0 or = -4/3. 
(iii) B = (0,ln4). Compute a =1+ = rie 2 ae : Hence, the requested tangent 





line is 
t c 
y—In4= 7 (x-0) or y= qr+ ind. 
A871 (8865 N2019/3)(i) Plug the second equation into the first and rearrange: 
llr +9 =227+62-3 = 0 = 227 - 5x - 12. 


By the quadratic formula, 


5+ (-5) -4(2)(-12)_52V25+96 541l_, 3 


D(2) 4 , "= 





C= 


15 
Plugging these values of x into the second given equation, we get y = 53, rigs Hence, the 


solutions to the pair of given simultaneous equations are 
3 15 
(4,53) and (-5 -=). 


(ii) In the quadratic polynomial 2x? — 5a - 12, the coefficient on x” is 2 > 0. Hence, the 
eraph of the quadratic equation y = 2x? — 5a - 12 is U-shaped. 
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3 4 
9 


So, 227+62-3<1lr+9 <=> 02227-52-12 <> ve|-5,4]. 

















i - 12 
A872 (8865 N2019/4) (i) © ——— oe ® 
dx (2 - 32)" (2-32) (2-32) 
2 

iy 3 \ 3 1 ° 1 1 7 
(i) 2 (2ve-5] -2(2ve-—.)(s *s3a)- VEL 73/2 (22 - 3) (20+3) = (40° -9)= 

: © 

a 
1 2 1 
(iit) f (x? + 2-0) de =[7 +204 50% =(54+2+50°)-(0+0+5)- =a oe ® 
0 2 2° In 2 2) 6 2 
A873 (8865 N2019/5)(i) At t=0, P=0=kx1.5'-0.6 <= k=0.6. 
(ii) A p 
xX 
t 
> 

(iii) P (6) = 0.6 x 1.5°° — 0.6 x 6.23. 

dP dP 
iv) — =6(In1.5) 1.5". =| = 6 (ln 1.5) 1.6? 8.21, 
(iv) = 6(In1.5)1.5'. So, ls 6 (In 1.5) 1.53 8 

_ 20+ y/(-20)? — 4(3) (25 /100 

@)\ = 23P = oe =0 = it= Seca Saal Je me 

dt dt 't=8 2 (3) 6 
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20410 | K 5 
Goes 
2: ; ; dc 
The coefficient on t* is 3 > 0, so the graph is U-shaped. Sign diagram for ae 
+ - | + 
5 5 
3 


dC 5 dC 5 5 
pute ap > 0 to the left of 3 and 7 < 0 to the right of 3? the point 3 is a strict local 
minimum. 


dc dc 
Since ai < 0 to the left of 5 and ae > 0 to the right of 5, the point 5 is a strict local 
maximum. 








(vi) Ac 
(0, 10) 
> 
t 
(vii) 114. Total costs over the first six years. ® 
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140.3. Hi Maths 2018 Questions 


Answers for Probability and Statistics questions to be written. 











Exercise 881. (8865 N2018/1.) (Answer on p. 1526.) 
Find the values of the real constant k for which the equation 3k? - (k-6) 2-2 = 0 has 
two distinct roots. [4] 
Exercise 882. (8865 N2018/2.) (Answer on p. 1526.) 
(i) Differentiate 31n (42° + 2). [2] 
an = 2 9] 

(ii) Integrate Fe [3] 
Exercise 883. (8865 N2018/3.) (Answer on p. 1526.) 


Three large companies P, Q and R recruit graduates. The companies take part in a scheme 
to train the graduates as accountants, financial advisers or consultants. The training cost 
for each accountant is the same for each company, as are the training costs for each financial 
adviser and each consultant. 


Last year, Company P paid $294100 to train 5 accountants, 12 financial advisers and 8 
consultants. 


Company Q paid $270100 to train 9 accountants, 7 financial advisers and 3 consultants. 
Company R paid $122 700 to train 3 accountants and 6 financial advisers. 


(i) Express this information as 3 linear equations and hence find the cost of training one 
financial adviser last year. [5] 


Company S decides to join the scheme this year. The company will train an equal number 
of accountants and financial advisers but no consultants. Training costs have increased by 
10%. The company has $200 000 to spend on this training. 


(ii) What is the greatest possible number of accountants that company S will be able to 
train this year? [3] 
Exercise 884. (8865 N2018/4.) (Answer on p. 1526.) 


A curve C has equation y = x + e!?”. 


(i) Find the exact coordinates of the turning point of C. [3] 


(ii) Sketch the graph of C, stating the coordinates of any points of intersection with the 
axes. [2] 


(iii) Find the equation of the tangent to C' at the point where x = 2, giving your answer in 
the form y = ma+c, with m and c correct to 3 significant figures. [3] 








(iv) Find [ ydzx and hence find the area between C’, the x-axis and the lines x = 0 and 
x = 1, giving your answer in terms of e. [3] 
Exercise 885. (8865 N2018/5.) (Answer on p. 1527.) 


A company buys televisions from a manufacturer and stores them in a warehouse before 
selling them in it retail outlets. Space in the warehouse is limited. On every occasion 
throughout the year when the stock in the warehouse is low, the manager places an order 
for x televisions, where x is a fixed positive integer. He estimates his total cost for the year 
using a model with three components: 
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e the purchase from the manufacturer, 

e the storage cost, 

e the ordering cost. 

Next year, the manager estimates that the company will need to buy 1200 televisions. ‘The 


purchase order will be $200 each, the storage cost will be $6x for the year and the ordering 
cost will be $50 per order. 


(i) Show that the estimated total cost for the year, $C, is given by 


60 000 
C = 6x + 240000 + ——. [1] 
4 


(ii) The manager wishes to determine the value of x that will minimise C. Use differ- 
entiation to find the minimum value of C, justifying that this value is a minimum. 


5] 


A new manager takes over at the company and he wants to maximise the profits on sales 
of televisions. He uses a new model in which the total cost per year is constant and equal 
to $240 000. 


(iii) Is this a reasonable model? Justify your answer. [1] 


Market research suggests that if televisions at $300 each, then 1200 will be sold per year. 
If they are priced at $700 each, then none will be sold. Between these values, the graph of 
the number of sales against the selling price will be a straight line. 


Assume that the total cost per year is $240000, the profit per year is $P, and the selling 
price is $S. 


(iv) Show that P = 21005 - 3S? — 240000. [3] 
(v) Find the value of S that leads to the maximum profit. [2] 
Exercise 886. (8865 N2018/6.) (Answer on p. 1522.) 


In a large population, the proportion of people who have blood group A is 40%. A sample 
of 6 people is randomly chosen from the population. 

(i) Find the probability that exactly 2 of these people have blood group A. [1] 
(ii) Find the probability that at least 4 of these people do not have blood group A. _ [3] 
A886 (8865 N2018/6). XXX 
Exercise 887. (8865 N2018/7.) (Answer on p. 1523.) 


A music producer is forming a new band consisting of 3 guitarists, 1 drummer and 2 
vocalists. The members of the band will be selected from the 8 guitarists, 4 drummers and 
6 vocalists who have passed the first stage of the auditions. 


(i) How many different selections of the band can the producer make? [2] 
Three of the guitarists are from the same family. 


(ii) Given that at most one of these three can be in the band, how many different selections 
of guitarists can the producer make? [2] 


(iii) Given instead that the producer selects the guitarists at random, find the probability 
that the band will contain none of these three. [2] 
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A887 (8865 N2018/7). XXX 


Exercise 888. (8865 N2018/8.) (Answer on p. 1523.) 
Two events A and B are such that P(A) =p, P(B) = 2p, P(A|B) = 0.3 and P(Au B) =0.8. 
(i) Find the value of p. [4] 


(ii) Explain what is meant by P (A‘’n B) and find its value. 


A888 (8865 N2018/8). XXX 
Exercise 889. (8865 N2018/9.) (Answer on p. 1523.) 


At a certain college, the numbers of male and female studying Art, Business and Statistics 
are as shown in the following table. Each student studies exactly one of these three subjects. 


= 

















Art | Business | Statistics Total 
Male 54 72 34 160 
Female | 36 AO 64 140 
Total 90 112 98 300 




















A student is chosen at random from these 300 students. Let 
A be the event that the student studies Art, 
B be the event that the student studies Business, 
F be the event that the student is female, 
M be the event that the student is male. 

(i) Find P(B). 
(ii) Find P(FuB). 
(iii) Find P(Mn A). 


(iv) Determine whether the events M and A are independent. 





Se Ne 





On another occasion, 3 of these students are chosen at random, without replacement. 


(v) Find the probability that exactly 2 are studying Art, giving your answer correct to 3 


decimal places. [3] 
A889 (8865 N2018/9). XXX 
Exercise 890. (8865 N2018/10.) (Answer on p. 1524.) 


The Harriers is an athletics club with a large number of members. At the beginning of 
each season, the athletes are timed running 100 metres and running 10000 metres. For 
a random sample of 8 athletes, the times, x seconds, to run 100 metres and the times, y 
minutes, to run 10000 metres, are shown in the following table. 





Athlete | A B C D BE PB G H 
ti 12.6 | 11.9 | 10.8 | 12.2 | 14.1 | 15.6 | 14.8 | 11.1 
y 37.4 | 40.6 | 42.2 | 39.2 | 34.1 | 33.1 | 32.6 | 42.2 









































(i) Give a sketch of the scatter diagram for the data, as shown on your calculator. — [2] 


(ii) Find the product moment correlation coefficient and comment on its value in the 
context of the data. [3] 
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(iii) Find the equation of the regression line of y on x, in the form y = mz +c, giving the 
values of m and c correct to 2 decimal places. Sketch this line on your scatter diagram. 


2] 
(iv) Calculate an estimate of the time taken to run 10000 metres by an athlete who runs 
100 metres in 13.1 seconds. [1] 
(v) Give a reason why you would expect this estimate to be reliable. [1] 


A890 (8865 N2018/10). XXX 
Exercise 891. (8865 N2018/11.) (Answer on p. 1524.) 


A website states that the mean length of adults in a particular species of fish is 30cm. 
A scientist claims that the true mean length is greater than 30cm. To test this claim, a 
random sample of 100 adult fish of this species is captured from a lake. The lengths of the 
fish, «cm, are summarised by 


Si (e-30)=15 and YS \(4-30)* =82. 


(i) Find unbiased estimates of the population mean and variance. [3] 


(ii) Test at the 2.5% significance level whether the scientist’s claim is supported by the 
data. [4] 


(iii) State, with a reason, whether it is necessary to assume that the lengths of these fish 
are distributed normally for the test to be valid. [1] 








A new random sample of 100 adult fish of this species is captured from a different lake. 
The mean length of the fish in this sample is mcm and the population variance is 0.9 cm?. 
A test at the 10% significance level supports the scientist’s claim that the mean length of 
the fish is greater than 30cm. 


(iv) Find the range of possible values of m. [3] 
A891 (8865 N2018/11). XXX 
Exercise 892. (8865 N2018/12.) (Answer on p. 1525.) 


A company produces two electrical components, Type A and Type B. The masses of Type 
A components are normally distributed with mean 250 grams and standard deviation 3 
grams. The masses of Type B components are normally distributed with mean 240 grams 
and standard deviation 4 grams. The masses of the electrical components are independent 
of each other. 


(i) Find the probability that the mass of a randomly chosen Type A component is more 
than 2% above the mean mass. [2] 


(ii) Find the probability that the mass of a randomly chosen Type A component is greater 
than the mass of a randomly chosen Type B component. [3] 


(iii) Find the probability that the total mass of 6 randomly chosen Type A components 
and 3 randomly chosen ‘Type B components is between 2190 grams and 2230 grams. 
[4] 

The cost of producing a Type A component is $0.02 per gram. 

The cost of producing a Type B component is $0.03 per gram. 


(iv) Find the probability that the cost of producing 10 Type B components exceeds the 
cost of producing 10 Type A components by more than $22.50. [5] 
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A892 (8865 N2018/12). XXX 
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140.4. H1 Maths 2018 Answers 


A881 (8865 N2018/1). The equation has two distinct roots if and only if the discriminant 
is positive: 


(k — 6)? — 4 (3k) (-2) = k2 +36 — 12k + 24k = k? + 12k + 36 > 0. 


The coefficient on k? is 1 > 0, so that the graph of y = k? + 12k +36 = (k+ 6) is U-shaped 


and just touches the horizontal axis at k = —-6. Hence, 5 holds if and only k # -6. 


d a.1092 S62 
A882 (8865 N2018/2)(i) 7 on (42° +2) - _ _ | 


2_9 2 2 
(ii) [ = dz = ‘ 35/2 — 9M? da = 3245! — gl? 4.0 = 8 5/2 — Ag? + C, 
Sx 5 if 5 
A883 (8865 N2018/3)(i) Let a, f, and c be last year’s cost ($) of training one accountant, 
financial adviser, and consultant, respectively. So, the given information may be written as 








Ba+12f+8c2294100, 9a+7f+3c2270100, 3a+6f 2122700. 
Take 3x + -8x 2 +19x 2 to get 
3 (5a + 12f +8c) -8(9a+7f +3c) +19(3a+6f) =3 x 294100 — 8 x 270 100 + 19 x 122 700 
or, 94 f = 1052800 or f = 11200. 


Hence, the cost of training one financial adviser last year was $11 200. 
(ii) From 3 a+ f = (122700 - 3f) /3 = 40900 - 11 200 = 29 700. 


Hence, the answer is the greatest integer smaller than or equal to 200000/29 700, which is 
6. 


d 
A884 (8865 N2018/4)(i) Compute — =1-2e)**, 
x 
d _ 
Any stationary points are given by = = 0 or 1—2e'** = 0 or In0.5 = 1-2% or = = 
X '!x=x 


0.5-0.51n0.5 = 0.5(1+In2). 


d d 
Observe that el 0 for x < z and 7 > 0 for x > x, so that z is a strict extremum and 
z 


5 
hence, also a turning point. 
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(ii) 











d 
(iii) At r=2, y=2+e'?% =2+e%. Compute 7 
x 





a 1-2e'?? = 1-2e%. Hence, the 
requested tangent line is y — (2 + e3) = (1 - 2e3) (x - 2) or 
y = (1-2e%) (w-2)+2+e%=(1-2e3)x+2+e%-2(1-2e%) =(1-2c%)r+5e3 x 
0.900x + 0.249. 


| 
(iv) [ ydz = [ e+e" dz = 5 - + D (where D is the constant of integration). 


! a ‘(11 1 1 
The requested area is " ate! ** dr = E - =| = (; — 507)-(0 — 5°] =~ (1 aed e). 
0 0 











2 2 2 

i 60 000 
A885 (8865 N2018/5)(i) C = 6x +1200 x 200 + 0 ** 9 - 6x + 2400004 _ (The 

aE 
number of orders made is 1200/2.) 

60 000 
(ii) es =6- a a, The stationary points are given by aS = 0 or 6- —— =0or 
dx ee da !n=% a 





x = +100 (reject negative value). 


dC dC 
Observe that — > 0 for all x € [0,Z) and ie 0 for all x € (%,00). Hence, Z is a strict 
ae Ha 
global minimum. 
(iii) My answer would be, “Maybe, it depends,” but this would have gotten you zero marks. 


The “correct” answer that would’ve gotten you one mark was probably something like, “No, 
this is not a reasonable model. The total cost is not constant but varies depending on such 
factors as the number of televisions bought and orders made.” 


(iv) Let Q be the number sold. Then Q = 2100-35 (5 >0). Hence, profit is 
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P =QS - Total Cost = (2100-3) S — 240000 = 21009 — 3S? — 240000. 


dP 7 
— =(hor i$ = 300. 
ds les = 

aP = dP = a 
Observe that dg <0 for all Se [0, 9) and as > 0 for all Se LS, 00). Hence, S is a strict 


global maximum. 


dP 
(v) Compute ae 2100-65. The stationary points are given by 
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140.5. Hi Maths 2017 Questions 


Answers for Probability and Statistics questions to be written. 
Exercise 893. (8865 N2017/1.) (Answer on p. 1534.) 


Find algebraically the set of values of k for which 


a? +(k-4)x2-(k-7)>0 


for all real values of x. [4] 

Exercise 894. (8865 N2017/2.) (Answer on p. 1534.) 
ll 

i) Differentiate — with respect to 2. 2 

() aS with resp 2 
ee (222-1)? 

(ii) Find [ a dx. [4] 
Exercise 895. (8865 N2017/3.) (Answer on p. 1534.) 
A D 

| 
B C 


The diagram shows a sign in the shape of a rectangle ABC’'D with two semicircles, one 
attached to AB and one attached to CD. The length of AB is 27 cm and the total perimeter 
of the sign is 10cm. 


(i) Show that the area of the sign is x (10 -7tx) cm’. [3] 
The area of the sign is to be as large as possible. 
(ii) Use a non-calculator method to find the maximum value of this area, giving your 


answer in terms of 7. Justify that this is the maximum value. [4] 


Exercise 896. (8865 N2017/4.) (Answer on p. 1534.) 


The equation of a curve is y = ln (4x —- 5). 


(i) Sketch the curve, stating the equations of any asymptotes. [2] 


(ii) Find the equation of the tangent to the curve at the point where x = 2.5, giving your 
answer in the form ax + by = c, where a and 0 are integers and c is in exact form. [4] 


This tangent meets the z-axis at P and the y-axis at Q. 
(iii) Find the length of PQ, giving your answer correct to 3 decimal places. [4] 
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Exercise 897. (8865 N2017/5.) (Answer on p. 1535.) 


A company produces three types of sports shoes: Supers, Runners and Walkers. The man- 
ufacturing cost of a pair of Supers is twice the manufacturing cost of a pair of Walkers. The 
total manufacturing cost of 10 pairs of Runners is $50 more than the total manufacturing 
cost of 7 pairs of Supers. The total manufacturing cost of 2 pairs of Supers, 6 pairs of 
Runners and 4 pairs of Walkers is $481. 


(i) By writing down three linear equations, find the manufacturing cost of a pair of 
Runners. [5] 
An economist advises the company on how to increase their profit. 
[Profit = selling price - manufacturing cost] 
In a simple model, the economist suggests that the selling price of all sports shoes should 
be $80 a pair. 
(ii) Find the profit from the sale of 100 pairs of each of Supers, Runners and Walkers. [2] 


The company is trialling a new type of sports shoes, Extremes. The economist predicts 
that the profit $P will be related to the manufacturing cost ($x) by the equation 


P=7/zr -0.92. 


(iii) Sketch the graph of P against x, stating the coordinates of the intersections with the 
X-axis. [2| 
(iv) Use your calculator to estimate the maximum value of P. State also the value of « 
for which this maximum value occurs. [2] 


(v) Given that the manufacturing cost of a pair of Extremes is $55, find the selling price. 
[1] 

(vi) If the manufacturing cost of a pair of Extremes increased to $65, would you advise 
the company to produce Extremes? Justify your answer. [1] 


Exercise 898. (8865 N2017/6.) (Answer on p. 1536.) 


As part of an assessment of the health of people in a particular country, the heights of 
a large number of adult males have been recorded. The results show that 20% of them 
have a height less than 1.6 mand 30% of them have a height greater than 1.75m. Assuming 
that the heights of adult males are normally distributed, find the mean and variance of the 
distribution. [4] 


Exercise 899. (8865 N2017/7.) (Answer on p. 1536.) 


Printers in a busy office produce large numbers of documents each week. The ink cartridges 
used in the printers often need replacing. The probability that an ink cartridge will last for 
one week or more is 0.7, independently of all other cartridges. The cartridges are supplied 
in boxes of 8. A box is selected at random. 


(i) Find the probability that exactly 5 of the cartridges in the box will last for one week 
or more. 1] 
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(ii) Find the probability that at least half of the cartridges will last for less than one week. 
[2] 

The office has 6 boxes of ink cartridges in stock. 

(iii) Find the probability that, for at most 2 of the boxes, at least half of the cartridges 


will last less than one week. [2] 


Exercise 900. (8865 N2017/8.) (Answer on p. 1536.) 


A code consists of 6 characters. The first 3 characters of the code consist of 3 digits chosen 
from {1,2,3,4,5,6}. The last 3 characters of the code consist of 3 letters chosen from 
{A, B, C, D, E, F, G, H}. 


(i) How many codes can be formed if repetitions are not allowed? [1] 
Now suppose that repetitions are allowed. 


(ii) Find the probability that a code chosen at random 


(a) contains the digit 5 exactly once and the letter H exactly twice, [3] 
(b) has 2 as its first character or H as its sixth character, but not both. [3] 
Exercise 901. (8865 N2017/9.) (Answer on p. 1536.) 


A computer manufacturing company employs a large number of workers on the production 
line. ‘The owner encourages his employees to stay with the company by giving them increases 
in their earnings to reward their length of service. The weekly earnings, y hundred dollars, 
of a random sample of 8 employees from the production line who have been with the 
company for x years are given in the following table. 





Employee} A| B|C | D|E|F|GI|A 
x 14 | 16) 11 | 24 | 36 | 28 | 22 | 40 
y 4.9 | 5.5 | 5.2 | 6.5 | 9.7 | 7.5 | 6.2 | 9.8 









































(i) Give a sketch of the scatter diagram of the data. [2] 
(ii) Find the product moment correlation coefficient and comment on its value in the 
context of the data. [2] 


(iii) Find the equation of the regression line of y on x in the form y = ax +b, giving the 
values of a and b correct to 3 significant figures. Sketch this line on your scatter 
diagram. [2] 


Sue has been employed by the company for 2 years and she earns 190 dollars per week. 


(iv) Use the equation of your regression line to calculate an estimate of the weekly earnings 
for employees on the production line who have been with the company for 2 years./1] 


Sue concludes that she should be earning more. 


(v) Give two reasons why her conclusion might not be justified. [2] 


Exercise 902. (8865 N2017/10.) (Answer on p. 1536.) 
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Bottles of a certain type of juice are said to contain 0.6 litres. A random sample of 50 
bottles is taken and the volumes of juice (in litres) in the bottles are measured. The 
unbiased estimates for the population mean and variance are 0.568 and 0.01528. The 
population mean volume is denoted by p. The null hypothesis = 0.6 is to be tested 
against the alternative hypothesis p < 0.6. 


(i) Find the p-value of the test and state the meaning of this p-value in this context. [2] 


(ii) State, giving a reason, whether it is necessary to assume a normal distribution for this 
test to be valid. [1] 


A second random sample of bottles of this juice is taken. The sample size is 110 and the 
volumes, y, are summarised by 


Yy=70.4, > y? = 49.42. 


(iii) Find unbiased estimates for the population mean and variance using this second sam- 


ple. [3] 
(iv) Using this second sample, test at the 5% significance level, whether there is evidence 
that the population mean volume of juice differs from 0.6 litres. [4] 
Exercise 903. (8865 N2017/11.) (Answer on p. 1536.) 
Marketing Economics 
Finance 


Marketing, Economics and Finance are three subjects offered at a business college. The 
numbers of students studying different combinations of these subjects are shown in the 
above Venn diagram. Every student studies at least one of these subjects. The number 
who study all three subjects is x. One of the students is chosen at random. 

e M is the event that the student studies Marketing. 

e« F is the event that the student studies Economics. 


e Fis the event that the student studies Finance. 


(i) Write down expressions for P (WV) and P(£) in terms of z. [2] 
(ii) Given that events M and F are independent, find the value of x. [3] 
(iii) Find P(Mu F’). [1] 
(iv) Explain, in the context of this question, what is meant by P (F'|/), and find its value. 

3] 
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Three students are chosen at random, without replacement. 


(v) Find the probability that each studies exactly two of these three subjects. [3] 


Exercise 904. (8865 N2017/12.) (Answer on p. 1536.) 


There are bus and train services between the towns of Ayton and Beeton. The journey 
times, in minutes, by bus and by train have independent normal distributions. The means 
and standard deviations of these distributions are shown in the following table. 





Mean | Standard deviation 
Bus 45 
Train 42 3 























(i) Find the probability that a randomly chosen bus journey takes less than 48 minutes. 
[1] 

(ii) Find the probability that two randomly chosen bus journeys each take more than 48 
minutes. [2] 


(iii) The probability that the total time for two randomly chosen bus journeys is more 
than 96 minutes is denoted by p. Without calculating its value, explain why p will be 
greater than your answer to part (ii). [1] 


Lan lives in Ayton and works in Beeton. Three days a week he travels from home to work 
by bus and two days a week he travels from home to work by train. 


(iv) Find the probability that for 3 randomly chosen bus journeys and 2 randomly chosen 
train journeys, Lan’s total journey time is more than 210 minutes. [4] 


Journeys are charged by the time taken. For bus journeys the charge is $0.12 per minute 
and for train journeys the charge is $0.15 per minute. 


Let B represent the cost of one journey from Ayton to Beeton by bus. 


Let T' represent the cost of one journey from Ayton to Beeton by train. 


(v) Find P(3B -2T <3) and explain, in the context of this question, what your answer 
represents. [5] 
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140.6. H1 Maths 2017 Answers 


A893 (8865 N2017/1). The coefficient on x? is 1 > 0, so the given quadratic is a U-shaped 
curve. 


Hence, the given inequality holds for all real x if and only if the determinant is negative: 
0>(k-4)° -4(1) [-(k-7)] =k? - 4k - 12 = (k-6) (k +2). 


This latter inequality in turn holds if and only if k € (-2,6). 





d 1 5 
A894 (8865 N2017/2)(i) — ee nee, ® 
( ; )(i) dz \/5x —2 2(5x -2)°/ 
2x2 —1)? Art — 4x2 +1 1 A 1 
Gy ee oe ee as a ® 
x? x? x2 3 x 


A895 (8865 N2017/3)(i) The total perimeter of the sign is 10 = 2;|AD|+7(2z). Hence, 
|AD| = 5-72. 


The area of the rectangle ABCD is 2x (5 — mx) = 10x — 272”. 
The area of the two semicircles is 7tx?. 


Hence, the area of the sign is y = 10a — 27x? + mx? = 10x — mx? = x (10-712). 





_=0or 10-2nr =0 


THX 


d d 
(ii) Compute 7 = 10-272. The stationary points are given by 7 
as 2 
5 
or T=-—. 
T 


Observe that a > 0 for all x € [0,z) and a > 0 for all x € (%,00). Hence, Z is a strict 
7 2 


global maximum. 


Asodi’sh6# N2017/4) 


y = In (4x -5) 








A vertical asymptote is 
4x -5 =O or v=5/4. 
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_ 4 
a25 40-5 


d 
(ii) At x = 2.5, y=In5. Compute at 


4 
7 = —. Hence, the requested tangent 
ae 


r=2.5 O 











line is 


a 


4A 
y—-In5==(#-2.5) or y— px = In5 2. 


5 
5 
(iii) Plug x = 0 and y =0 into + to get, respectively, y =In5-2 and x =2.5- i In5. So, 
2 4) : 
IPO) =4/ (ind -2)?4 (2.5 -" In} ws 0.625. 


A897 (8865 N2017/5)(i) Let s, r, and w be the manufacturing cost of a pair of Supers, 
Runners, and Walkers, respectively. The given information may be written as 


s+2w, 10r27s+50, 2s+6r+4w 248i. 


Plug + into 2 to get 10r = 14w + 50 or Sr - 7w = 95. 
Plug + into 2 to get 6r + 8w 2 481. 

Now take 8x 4+7x 2 to get 82r = 3567 or r £ 43.5. 

The manufacturing cost of a pair of Runners is $43.50. % 
(ii) Plug 2 into 2 to get 435 = 7s +50 or 5 £55. 

Plug = into + to get 55 = 2w or w = 27.5. 

So, the answer is 100[3 x $80 -$(s+r+w)] = $11 400. 

(iii) 7/c -0.92 =0 — > 7-0.9/r=0 — > 70/9=/ct — r= (70/9). 


PA 
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(iv) Pw 136 ates 15.1, %& 


(v) P(55) = 755 - 0.9(55) » 2.41. Hence, using the economist’s model, the selling price 
is approximately $55 + $2.41 = $55.71. 


(vi) No, 65 > (70/9)° and profit would be negative (according to the economist’s model). 
A898 (8865 N2017/6). XXX 

A899 (8865 N2017/7). XXX 

A900 (8865 N2017/8). XXX 

A901 (8865 N2017/9). XXX 

A902 (8865 N2017/10). XXX 

A903 (8865 N2017/11). XXX 

A904 (8865 N2017/12). XXX 
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140.7. Hi Maths 2016 Questions 


Answers for Probability and Statistics questions to be written. 


Exercise 905. (8864 N2016/1.) (Answer on p. 1541.) 
Differentiate 

(i) 2In ee [2] 

Z 

) ape 2 


Exercise 906. (8864 N2016/2.) (Answer on p. 1541.) Do not use a calculator in 
answering this question. 


Use the substitution u = e” to solve the inequality 2c” > 9 — 3e”, giving your answer in 








logarithmic form. [5] 
Exercise 907. (8864 N2016/3.) (Answer on p. 1541.) 
The curve C has equation y =e” - 2”. 
(i) Sketch the graph of C. [1] 
(ii) Find the numerical value of the gradient of C at the point where x = 0.5. [1] 
(iii) Find the equation of the normal to C' at the point where x = 0.5. Give your answer 
in the form y = ma +c, with m and c correct to 3 significant figures. [3] 
k 
(iv) Find [ (e" —a*) dx, where k > 0. Give your answer in terms of k. [3] 
0 
Exercise 908. (8864 N2016/4.) (Answer on p. 1541.) 
The curve C' has equation y = 1+ 6a - 32? - 42°. 
d 
(i) Find = Hence find the coordinates of the stationary points on the curve. [4] 
Z 
(ii) Use a non-calculator method to determine the nature of each of the stationary points. 
2] 
(iii) Sketch the graph of C,, stating the coordinates of any points where the curve crosses 
the a-axis. [2] 


(iv) Find the numerical value of the area under the curve C' between x = 0.5 and x = 1.[1] 


Exercise 909. (8864 N2016/5.) (Answer on p. 1542.) 
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A 


The diagram shows a V-shape which is formed by removing the equilateral triangle DEF, 
in which DE = ycm, from an equilateral triangle ABC’, in which AB = 27cm. The points 
E and F are on BC such that BE = FC. The area of the V-shape ABEDFCA is 2V3 cm”. 


(i) Show that 4x? - y? = 8. [3] 
(ii) Given that the perimeter of ABE DFCA is 10cm, find the values of x and y. [6] 
Exercise 910. (8864 N2016/6.) (Answer on p. 1543.) 


A music store manager intend to carry out a survey to investigate how much money students 
at a local college spend on music each year. There are 1260 male students and 1140 female 
students at the college. 


(i) Describe how to obtain a stratified sample of 80 students to take part in the survey. 


[2] 
(ii) State, in this context, one advantage that stratified sampling has compared to simple 
random sampling. [1] 


The amount of money, in hundreds of dollars, spent by a student in a year is X. For a 
simple random sample of 80 students, it is found that Ss G=312 and > ag? = 1328. 


(iii) Calculate unbiased estimates for the population mean and variance of X. [3] 


Exercise 911. (8864 N2016/7.) (Answer on p. 1543.) 
The events A and B are such that P(A) = 0.6, P(B) =0.25 and P(An B) = 0.05. 


(i) Draw a Venn diagram to represent this situation, showing the probability in each of 
the four regions. [3] 


(ii) Find the probability that 


(a) at least one of A and B occurs, [1] 
(b) exactly one of A and B occurs. [1] 
(iii) Find P(A|B’). [2] 
Exercise 912. (8864 N2016/8.) (Answer on p. 1543.) 
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Two boxes, A and B, contain balls of different colours. Box A contains 5 blue balls, 3 red 
balls and 2 green balls. Box B contains 4 blue balls and 2 green balls. One of the boxes is 
selected at random. Two balls are then chosen at random, without replacement, from the 
selected box. Find the probability that 


(i) both balls are red, [2] 


(ii) the two balls are of different colours, [4] 
(iii) both balls are red, given that they are the same colour. [3] 
Exercise 913. (8864 N2016/9.) (Answer on p. 1543.) 


Watch batteries are supplied to a shop in packs of 8. The probability that any randomly 
chosen battery has a lifetime of less than two years is 0.6, independently of all other bat- 
teries. 


(i) For a single pack of batteries, find the probability that 


(a) all of the batteries have a lifetime of less than two years, [1] 
(b) at least half of the batteries have a lifetime of less than two years. [2] 


(ii) For any 4 packs of batteries, find the probability that, for no more than 2 of the packs, 
at least half of the batteries have a lifetime of less than two years. [2] 


(iii) A customer buys 10 packs of these batteries. Use a suitable approximation to estimate 
the probability that at least 40 of these batteries have a lifetime of less than two years. 
State the mean and variance of the distribution that you use. [4] 


Exercise 914. (8864 N2016/10.) (Answer on p. 1543.) 


A scientist claims that the mean top speed of cheetahs, in km/h, is 95. The top speed of 
each cheetah in a random sample of 40 cheetahs is recorded and the mean is found to be 
96.3. It is known that the top speeds of cheetahs are normally distributed with standard 
deviation 4.1. 


(i) Test the scientist’s claim at the 5% significance level. [4] 


The scientist now decides to test the claim that the mean top speed of cheetahs, in km/h, 
is greater than 95. He takes a second random sample of 40 cheetahs and records their top 
speeds. Using a 5% significance level, he finds that the mean top speed of cheetahs is not 
greater than 95. 


(ii) Find the set of values within which the mean top speed of this second sample must 
lie. [5] 
Exercise 915. (8864 N2016/11.) (Answer on p. 1543.) 


Members of an athletics club are training for a ‘Swim-Run’ charity event, in which each 
athlete has to complete a 1000-metre swim followed by a 1000-metre run. The times, x 
minutes, to swim 1000 metres and the times, y minutes, to run 1000 metres, for a random 
sample of 8 members of the club, are given in the following table. 
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Athlete | A B C D B PB G H 
x 15.0 | 16.1 | 18.2 | 16.3 | 17.2 | 18.1 | 15.6 | 16.3 
y 2.0 | 2.7 | 3.3 | 3.0 | 3.5 | 3.4 | 2.7 | 2.8 









































(i) Give a sketch of the scatter diagram for the data, as shown on your calculator. — [2] 
(ii) Find the product moment correlation coefficient. [1] 


(iii) Find the equation of the regression line of y on x in the form y = mz +c, giving the 
values of m and ¢ correct to 3 significant figures. [1] 


(iv) Calculate an estimate of the time taken to run 1000 metres by an athletes who swims 
1000 metres in 16.9 minutes. State two reasons why you would expect this to be a 
reliable estimate. [3] 


The time taken by a new member of the club to swim 1000 metres and run 1000 metres 
are 18.4 minutes and 2.6 minutes respectively. 


(v) Calculate the new product moment correlation coefficient when the times for the new 
member are included. [1] 


(vi) State, with a reason, which of your answers to parts (ii) and (v) is more likely to 
represent the correlation between swimming and running times for all members of the 


club. [1] 


Exercise 916. (8864 N2016/12.) (Answer on p. 1543.) 


Shortbread biscuits of a certain brand are sold in boxes containing 12 biscuits. The masses, 
in grams, of the individual biscuits and of the empty boxes have independent normal dis- 
tributions with means and standard deviations as shown in the following table. 





Mean | Standard deviation 
Individual biscuit 20 dell 
Empty box 5 0.8 























(i) Find the probability that the mass of an individual biscuit is less than 19 grams. [2] 
(ii) Find the probability that the total mass of a box containing 12 biscuits is more than 
248 grams. State the mean and variance of the distribution that you use. [4] 


The cost of producing biscuits if 0.6 cents per gram and the cost of producing empty boxes 
is 0.2 cents per gram. 


(iii) Find the probability that the total cost of producing a box containing 12 biscuits is 
between 142 cents and 149 cents. State the mean and variance of the distribution that 
you use. [5] 
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140.8. H1 Maths 2016 Answers 


62 _ 122 
32244 32244" 





A905 (8864 N2016/1)(i) “2m (3g? +4)=2 ® 


Gy 41. ae: 
dro(=acy° O1=3e) C=3z) 


A906 (8864 N2016/2). 2c?” >9-3e7 <>» 2u?>9-3u <> 0 < 2u?4+3u-9 = 
(2u-3)(uw+3) <> we (-~«,-3]U[1.5,00) <— > e*€[1.5,00) — > re[In1.5, 0). 


(i) yA 


® 











A907 (8864 N2016/3) 


=e) 1 #=1,61, 
«=0.5 


+ dy 7 dy 
ia em) Lo ad 
(ii) Compute ao x. So, a 





(iii) At 2 = 0.5, y=e °° — 0.5%. So, the equation of the normal is approximately 


y - (e-? - 0.57) = (c-0.5) or  y=0.622r + 0.453. 


—] 
—e—9.5 or | 


k 5 3 


(iv) ie (e* - =} dx = J-e - =|, = (-<" ~ =) ~(-1-0)=1-e*- = 


d 
A908 (8864 N2016/4) (i) 7 = 6 - 6x - 1227. 
x 








1 
aie 0 = 6-62-1277 = 277+ Z-1 = (22-1) (4+1) — t=-I1,5. The 
X 'x=2 


corresponding y-coordinates are 





146-1) =3Cly-4Ciy==4 and 1+6(2)-3(2) -a(2) =o 
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i | 
Hence, the two stationary points are (-1,-4) and (=. =). 
(ii) ra 6-62 - 122° is n-shaped. Hence, -1 is a strict minimum point, while 5 isa strict 
a 


maximum point. 


= + _ 
=i 1 
2 
(iii) A 





y =1+62 - 327 - 423 





(-0.157, 0) 











ae 


1 3 1 1 15 
)- ® 


1 
(iv) f (1+ 80-32? - dn dz = [2 +30? 2° -a4],,-2-(543-5-— a7 


A909 (8864 N2016/5)(i) The heights of the triangles ABC and DEF are, respectively, 
\V (20)° - 2? = V3x and \/y? - (y/2)° = By, 

1 1 
Hence, their areas are, respectively, . (V3x) (27) = 32? and 5 (S0) Cae By , 


3 
Thus, their difference is V 32? - V3.2 = 2\/3 or 42? - y" £8. 


(ii) The perimeter is 10 = |AB|+|AC|+|BE]| +|F'C|+|DE|+|DF| = 27+2x+2x - yt+y+y = 6r+y. 
Rearrange to get y 2 10-6z and plug 2 into = to get 


4a? — (10 - 6x)" =8 or 0 = -3227 — 108 + 120xr = 8x? - 30x + 27 = (42 - 9) (2x - 3) 
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NW | co 


or i 
= 


9 3 £ 
Plug in # = is into 2 to get the corresponding values for y: y = (reject) and y = 1. 


Hence, x = 3/2 and y= 1. 

A910 (8864 N2016/6). XXX 
A911 (8864 N2016/7). XXX 
A912 (8864 N2016/8). XXX 
A913 (8864 N2016/9). XXX 
A914 (8864 N2016/10). XXX 
A915 (8864 N2016/11). XXX 
A916 (8864 N2016/12). XXX 
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Part VIII. 
Appendices 


The main text above has not always been complete, precise, or rigorous. In these appen- 
dices, I go some way towards filling in these gaps. In particular, I give formal definitions, 
statements of claims, and proofs of claims. 


Where there is a trade-off between generality of a result and the simplicity of its proof, I 
usually favour the latter. 


I’ve written these appendices mostly for the sake of completeness (and also to persuade 
myself that I probably haven’t screwed up too much). Unlike the main text, I would not 
recommend that the student (or anyone) read these appendices from top to bottom. They 
are written rather tersely with few (and often zero) examples. 
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141. Appendices for Part 0. A Few Basics 


141.1. Division 


Definition 262. (Euclidean division) Let x and d be non-zero integers. If x,d < 0, 
then let q be the smallest integer that satisfies dq < x; otherwise, let q be the largest 
integer that satisfies dg < x. Then let r=x-dq>0. 


In the equation x = dq+r, we call x the dividend, d the divisor, q the quotient, and r the 
remainder. 


And if r =0, then we say that x divided by d leaves no remainder and that d is a factor 
of (or divides) x. 


Theorem 50. (The Euclidean Division Theorem) Let x and d be non-zero integers. 
Then there exists a unique pair of integers (q,r) such that 





edger and Wera a. 


Proof. Let q and r be as constructed in Definition 262. This proves existence. 


For uniqueness, we consider only the case where x, d > 0 (the other cases are similarly dealt 
with). 


Let # and p be integers that satisfy 7 = d@+p and 0< p<d. Then 
O=x-a2=(dq+r)-(d6+p)=d(q-0)+r-p. 


Rearranging, r- p=d(0-q). 
If 0 =q, then r-p=0 and p=r, so that (q,r) is indeed unique. 


If 0 <q, then 0-q <-1, so that r-p<-dandr< p-d<0, contradicting our assumption 
that O<r. 


If 8 >q, then @-q>1,s0 that r-p>dandr>p+d2d, contradicting our assumption that 
r< dd, 
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141.2. Logic 


Fact 1. Suppose P and Q are statements. Then 


NOT-(PANDQ) << #NOT-PORNOT-Q. 





Proof. We want to show that NOT-(P AND Q) and (NOT-P OR NOT-Q) always have 
the same truth value. That is, we want to show that 

1. Whenever NOT- (P AND Q) is true, (NOT-P OR NOT-Q) is also true; and 

2. Whenever NOT- (P AND Q) is false, (NOT-P OR NOT-Q) is also false. 


We now do so: 


1. Suppose NOT- (P AND Q) is true. Then P AND Q is false. So, either P or Q is false. 
So, either NOT-P or NOT-@Q is true. So, NOT-—P OR NOT-Q is true. 

2. Next, suppose NOT-(P AND Q) is false. Then P AND Q is true. So, both P and Q 
are true. So, both NOT-P and NOT-@ are false. So, NOT-P OR NOT-Q is false. 


Since NOT- (P AND Q) and as (NOT-P OR NOT-Q) always have the same truth value, 
they are equivalent. 














Letting “1” = true and “0” = false, we can also construct this truth table where there are 
four possible cases depending on whether P and Q are true or false: 














P|Q|NOT-P | NOT-Q | PANDQ | NOT-(P AND Q) | NOT-P OR NOT-Q 
1/1 0 0 1 0 0 
1] 0 0 1 0 1 1 
0} 1 1 0 0 1 1 
0} 0 1 1 0 1 1 




















This truth table is a compact way to prove Fact 1. From the table, we can quickly see that 
in each of the four possible cases, both 


NOT- (P AND Q) and (NOT-P OR NOT-Q) 


always have the same truth value. Thus, the two statements are equivalent. 


By the way, we could also have defined equivalence using truth tables—two statements are 
equivalent if their truth “columns” are identical. We’d have to take care though to ensure 
that our truth table lists all possibilities. 
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Fact 2. Suppose P and Q are statements. Then 


NOT-(PORQ) <— NOT-PANDNOT-Q. 





Proof. Construct this truth table: 














P|Q|NOT-P | NOT-Q| PORQ | NOT-(PORQ) | NOT-P AND NOT-Q 
1/1 0 0 1 0 0 
1/0 0 1 1 0 0 
0] 1 1 0 1 0 0 
0} 0 1 1 0 1 1 




















Since NOT- (P OR Q) and NOT-P AND NOT-@ always have the same truth value, they 
are equivalent. 














Bact: 6, CP >= QO AND) SP) = (P=). 


Proof. We show P <> Q always has the same truth value as P => QANDQ => P: 
1. Suppose P = > Q is true. Then 


(a) Whenever P is true, Q is also true—so, P => Q is always true. 
(b) Whenever P is false, Q is also false—so, Q => P is always true. 


Altogether, P => QANDQ ==> P is true. 
2. Suppose P <=> (Q is false. Then there exists some instance where either 


(a) P is true while Q is false, so that P => Q is false; or 
(b) P is false while Q is true, so that Q ==> P is false. 


Altogether, P => QANDQ ==> P is false. 
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141.3. The Four Categorical Statements 


Before Ch. 3.13, we were discussing only propositional logic. 


Ch. 3.13 began discussing predicate (or first-order) logic. The terms all and some 
correspond to the for every (V) and there is at least one (4) quantifiers: 


Definition 263. The All Yes statement, “All S are P,” means, “For every z, if x is S, 
then x is P.” 


The All No statement, “No S is P,” (or “All S is NOT-P”) means, “For every 2, if x is 
S, then x is NOT-P.” 


The Some Yes statement, “Some S is P,” means, “There is at least one x such that x is 
S AND z is P” 


The Some No statement, “Some S is NOT-P,” means, “There is at least one x such that 
xis S AND z is NOT-P.” 





Note that thus defined, the All Yes and All No statements inherit from the implication the 
property that if the hypothesis (in the implication) is false, then the statement as a whole 
is automatically true: 


Example 1565. The All Yes statement, “All unicorns are green,” means, 
“For every x, if x is a unicorn, then x is green.” 
Since unicorns don’t exist, the statement “x is a unicorn” is always false. So, the im- 


plication “if 2 is a unicorn, then x is green” is always true. Hence, the above All Yes 
statement is also always true. 


Example 1566. The All No statement, “No unicorns is green,” means, 


“For every x, if x is a unicorn, then x is not green.” 


Since unicorns don’t exist, the statement “x is a unicorn” is always false. So, the impli- 
cation “if x is a unicorn, then x is not green” is always true. Hence, the above All No 
statement is also always true. 





Note that we are following modern logic. Which contrasts with traditional, pre-19th- 
century, Aristotelian logic, where the All Yes and All No statements in the last two examples 
would instead have been considered false. See e.g. Parsons (2017), “The Traditional Square 
of Opposition”. 
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To negate the categorical statements, we’ll use this axiom:°°° 


Axiom 1. The negation of “For every x, P” is “There is at least one x such that 
NOT-P.” 


The negation of “For every x, NOT-P” ts “There is at least one x such that P.” 


Fact 7. (a) The negation of an All Yes statement is the corresponding Some No state- 
ment. 





(b) The negation of an All No statement is the corresponding Some Yes statement. 


Proof. (a) By Definition 263, the All Yes statement, “All S are P,” means, “For every , 
xis S = > «xis P”. By Axiom 1 and Fact 3, this statement’s negation is, “There is at 
least one x such that x2 is S AND x is NOT-P,” which by Definition 263 is equivalent to 
the Some No statement, “Some S is NOT-P.” 


(b) Similar. 

















°°8This could also be a result derived from more basic definitions and axioms. But to keep things short 
and simple, Ill simply assert this as an axiom. 
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141.4. Sets 


The set is what mathematicians call a primitive notion. That is, sets are left undefined 
(though they do have to satisfy certain axioms). But having summoned out of the void this 
single undefined object called the set, we can then go on to define every other mathematical 
object based on the set. The set is thus the single Lego block out of which all of mathematics 
is built. Every mathematical object can be defined solely in terms of sets. The idea is to 
have just one undefined object, then define everything else based on this single undefined 
object. 


And so for example, in conventional set theory, we first define the number 0 to be the 
empty set. We then define the number 1 as the set that contains 0; the number 2 as the 
set that contains 0 and 1; the number 3 as the set that contains 0, 1, and 2; etc. 


Ost} =. 

feq0; =a = {2}. 

2=40,1) Sats ={2,{@}}. 
3={0,1,2F= {tb tb td SSE = {2,12} bbe. 


As another example, perhaps surprisingly the function is also defined to be a set. We’ll 
see this shortly in Ch. 142.10 below. 


In the main text (Ch. 4.9), we made this claim: 


Proposition 25. A real number is rational <= > It eventually recurs. 


Proof. We'll formally prove this only below (see p. 1552). For now, we'll give only this 
proof sketch: 


(<=) Consider for example x = 8.344571 93 = 8.344571 935 719357 193... where the digits 
97193 eventually recur. Now consider 99999 000x. We have 


99 999 000z = 100 000 000z — 1 000z 
= 834457 193.571 935 719 3--- — 8 344.571 935 719 357193... 
= 834457 193 — 8344 = 834 448 849. 


We’ve just shown that x = 834448 849/99 999 000—-= is the ratio of two integers and is thus 
rational. 


( == ) Consider for example 9/7 = 1.285 714 = 1.285 714285 714.... 
Long division (the remainder at each step is highlighted in blue): 
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Line 1 L2é& & 7 1 4 
2 7|9 Explanation 
3 (ij Leff] 7 
4 2-0 9-7=2 
5 1 4 ax7=14 
6 6 0 20-14=6 
7 ao 8x 7=56 
8 4 0 60-56 =4 
9 Be Oxf Sau 
10 5 0 40-35=5 
11 4 9 42 7=49 
12 1 0 50-49 =1 
13 — Llx{=7 
14 3 0 10-7=3 
15 28 ix 7 =26 
16 2 30 — 28 = 2 


At line 16, the division isn’t complete. But observe that the remainder at line 16 is the 
same as in line 4—namely, 2. And so clearly, the process will simply repeat. Lines 16 
through 28 of the long division will look exactly the same as lines 4 through 16. Lines 28 
through 40 will again look the same. Etc. The digits 2857714 will thus recur. 


The key insight here is that there are only finitely many possible remainder values (namely, 
0,1,...,9) And thus, the remainder must eventually repeat (or hit zero). This completes 
the proof sketch. 














To formally prove Proposition 25, let’s first work towards a formal definition of what the 
term eventually recurs means. 


Definition 264. Let x ¢ R. We say that x has a decimal representation if there exist 
unique Xp € Z and x; € {0,1,2,...,9} for each i € Z* such that 


co 
L=2%9+ ee ine 
i=l 


Theorem 51. Every real number has a decimal representation. 


Proof. Omitted. 

















Definition 265. Let x be a real number with decimal representation x = %.11%9%3.... 
We say that x eventually recurs if there exist s,t ¢ Z* such that for every née Z", 


Ls = Lsint 





We are now ready to formally prove Proposition 25 (reproduced for convenience): 
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Proposition 25. A real number is rational <> It eventually recurs. 


Proof. Let x be a real number with decimal representation x = 2 + » x, 107. 
i=l 


(<—) Suppose x eventually recurs. Then there exist s,t ¢ Z* such that for every ne€ Z", 


Ls = Lsint- 
s-l lore) ; t-1 
And hence, c=20+ >) 2,10" + (> ww") (>: 10). 
=| i=l i=0 
s-l : fore) : t-1 : 
Now, observe that 10° *z = }* x10°'* + 10°? (5: 10") (5: 10.) 
i=0 i=] i=0 
s-l lore) ; t-1 
So, 10 ey a0 4 10 1 (5: ww") (5: 10) 
i=0 i=l i=0 


sl 0° t-1 t-1 
=10" > eis 107" (5: 0") (5: 10". + 10°" (5: 10) 


i=0 i=1 i=1 i=0 


s-l t-1 
So, 10° '*¥2 - 10%! 2 = (10' — 1) > e100 (5: 10.) = A, which is an integer. 
i=0 i=0 


Hence, x = A/ lo - 10°) is the ratio of two integers and thus rational. ¥ 


( ==> ) Let x = a/b, where a and 0 are positive integers. (The other cases are dealt with 
similarly. ) 


We'll make use of the Euclidean division algorithm (Definition 1): 
Let go be the largest integer such that a = q9b+7ro and ro is a non-negative integer. 


For each 7 € Z", let g; be the largest integer such that 10r;_, = q;b+r; and r; is a non-negative 
integer. Hence, 


a = = ~ 
z= = q+ 10 ‘gi + 1077p + 10-F gg + +++ = Go.G1 gags - -- 


Note that since r; € {0,1,2,...b-1}, there must exist s,t€ Z* rs =1s+t. 


And since r, = 154;, by definition of q;, 


Qs+t+1 = Us+1) Us+t+2 = Us+2) -++> Ys+(n41)t = Wt- 
We also have Ts =Vsat = Ts42t = Ts+38ts = +--+ 
+ 
So for every n€ Z", Qstnt+1 =Qs+1, Ystnt+2 = Qst2 +++) Wst(n+1)t = U- 














Hence, x eventually recurs. 
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To prove De Morgan’s Laws in set theory, we can actually reuse our earlier proofs (from 
logic). But here as an exercise, let’s prove these two laws using the set theory notation 
we've just learnt. 


Fact 9. Suppose P and Q are sets. Then 


(a) CP iO) = i)” and. (by Ce UI) =P .@" 

















Proof. (a) re(PnQ)’ (b) xe(PuQ)’ 
—= «rctPnQ — rtPuQ 
— rtPORrEQ — «rctPANDréeQ 
<=> «reP’ORz.eQ <— «r¢P'ANDzeQ 
—> rePud’. —= rePnQ. 


1553, Contents www.EconsPhDTutor.com 


142. Appendices for Part I. Functions and Graphs 


142.1. Ordered Pairs and Ordered n-tuples 


As mentioned on p. 1550, every mathematical object can be defined in terms of sets. The 
ordered pair is directly defined as a set. 


Definition 266. Let x and y be objects. The ordered pair (x,y) is defined by 


(vy) = teh te, yt 





There’s actually more than one way we can define an ordered pair.°°? What’s important 
is that our definition correctly captures the idea that (a,b) = (2,y) <> a= and b=y. 
This is accomplished by the above definition: 


Fact 24. Suppose a, b, x, andy are objects; and (a,b) and (x,y) are ordered pairs. Then 


(a,b) = (ay) — a=2 AND. b= 4%. 





Proof. (<= ) la=a AND b=y, then (a,b) ={{a},{a,0}} = {{z} {a g}} =(ay): 

(==> ) We'll prove the contrapositive: If a# 72 OR b#y, then (a,b) # (2, y). 

Suppose a # x. Then {a} # {z}. 

Case 1. If {a} # {x,y}, then {a} is an element of (a,b) but not of (x,y). So, (a,b) # (a, y). 


Case 2. If {a} = {x,y}, then x # y (otherwise {a} = {x,y} = {x}, a contradiction). So 
(a,b) = {{a}, {a, b}} = {{2x, y}, {x, y, b}} does not contain {x} and thus (a,b) # (x,y). 


Now suppose a = x and b#y. Then (a,b) = {{a},{a,b}} = {{x}, {x,b}} does not contain 
{x,y} and therefore (a,b) # (x,y). 














We then define the ordered triple (z,y,z) to be the ordered pair ((2,y),z). Similarly, 
we define the ordered quadruple (17, 22,273,274) to be the ordered pair ((x1, 22,23) , 4). 
Etc. More generally, 


Definition 267. Let n > 3 and 2, %, ..., ®_ be objects. The ordered n-tuple 
(%1,%2,...,%y) is defined recursively to be the ordered pair whose first coordinate is 
the ordered (n - 1)-tuple (11, %2,...,2@n-1) and second coordinate is x,,. That is, 


Ci ta ae oe) = Gainer a0 nea) on) 8 


Given the ordered n-tuple (x1, %2,...,2n), we call each x; its ith coordinate. 


We can easily prove the analogue of Fact 24 for ordered n-tuples: 








°°°The above definition by Kuratowski (1921, p. 171) is today the one that’s usually used. 
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Proof. (<==) Trivial. 


(==> ) Let n>3. By Definition 267, 


(i, 3; sae la) = (41, Y2, tee a) — CGameury . tnt) yin) = ((y1, Ye; oe Una) is 
By Fact 24, this last equation is true if and only if 
(2, cots) = is. Se) AND Pp Ss 


Applying the above recursively, we find that for all 7 > 3, x; = y;. We also find that 


(21,22) = (41, y2), whereupon Fact 24 tells us that x, = y, and x2 = yo. 
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142.2. The Cartesian Plane 


Definition 268. The cartesian product of two sets A and B, denoted A x B, is defined 
by 


Ax B={(a,y):ceA,ye B}. 


Definition 269. Given a set A, we will also write the cartesian product A x A more 
simply as A?. 


Definition 270. Given a set A and n > 3, we define A” recursively to be the cartesian 
product A”? and A. That is, 


ACA Ay 


Definition 271. An n-dimensional space is a subset of R”. 


Definition 272. Given 71,22,...,2n € R, the ordered n-tuple (21, %2,...,%p) is called a 
point in n-dimensional space. 








We reproduce from Ch. 7.2 this definition of the cartesian plane: 


Definition 36. The cartesian plane is the set of all points: 


1(2,y) = 2,y eR}. 





In light of the preceding definitions, we could also have defined the cartesian plane more 
simply as this set: 


R? =RxR= {(z,y):2,y€R} = {(2,y) € R’}. 
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142.3. Lines 
This subchapter restricts attention to the cartesian plane R?. 


Fact 25. Suppose ax + by+c=0 describes a line. Then 


(a) The line is horizontal —> a=0. 
(bb) Thetine is) verical ~<——— 6-0). 





Proof. (a) Let (71,41) and (2, y2) be points on the line with x # x2. Since both are points 
on the line, we have 


ax; + by; +c =0 and ax + byo+c=0. 
( <= ) Suppose a = 0 (so that b #0). Then by; +c =0 and by2+c=0, so that y, = yo = —c/b. 
Hence, the line is horizontal. J 
( == ) Suppose a # 0. 


If b =0, then the line also contains the point (21, y,; +1), so that it contains two points with 
different y-coordinates and hence is not horizontal. 


ax, +C avg +C 
b b 


But since 7; # Z2 and a #0, it follows that y, # yo, so that again the line is not horizontal. 


Jv 
The proof of (b) is similar and thus omitted. 


Fact 250. Two distinct points can be contained by at most one line. 


Proof. Suppose the lines az + by+c=0 and dx+ey+ f =0 contain the distinct points (p,q) 
and (r,s). Then 


If b #0, then w= 








and Yy2 = 














ap +bq+c=0, ar +bs+c20, dp +eq+ f 20, and dr +es+f <0. 


From = — 2, we get a(p-r) 2b(s-q). 

Similarly, from 2 - 2, we get d(p-r) £e(s-4q). 

Case 1. Suppose p=r. Then s # q (because (p,q) # (r, s)). Since p—r=0 and s-q#0, 2 
and 2 imply that b =e =0. Our two lines are both vertical and are simply x = p and x =r. 
But since p =r, the two lines are identical. J 


Case 2. Suppose p#r. Then we can rewrite 2 and 2 as 


S- S- 
| and dee d 


azb ; 
p-r p-rT 





Since at least one of a or b must be non-zero, £ implies that both a and 6 must be non-zero. 
Similarly, since at least one of d or e must be non-zero, 3 implies that both e and e must 
be non-zero. Now use = and ® to rewrite = and 2 as: 
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S- Cc 
awe eee) and 
p-r b p-r e 








And now, = - 2 yields c/b = f/e. 

We now show that a point (t,u) is in the line az + by +c = 0 if and only if it is also in the 
line dx + ey+ f = 0. Equivalently, (t,u) satisfies ax + by + c= 0 if and only if it also satisfies 
dx +ey+ f =0. We will thus have shown that the two lines are identical: 

















s- s- 'é 
eee ee a ee ee Oe hips 
p-r p-r b 
Btu st e(tisur 2) Sdreeusy. 
p-r e p-r e 


Fact 251. Given two distinct points (p,q) and (r,s), the unique line that contains both 


points is (q—-s)x+(r—p)y+ps—qr=02° 





Proof. We need merely plug in and verify that the given line contains (p,q) and (r,s): 


(q-s)p+(r-p)q+ps—-qr=0. ¥ (q-s)r+(r-p)s+ps-qr=0. o 














By Fact 250 then, this is the unique line that contains both points. 


Fact 26. The unique line that contains both of the distinct points (21, y1,) and (x2, y2) is 


(2-21) (y- 41) = (Ye - 1) (@- 21). 





Proof. By Fact 251, the unique line that contains both points is (y, — y2) x + (%2- 271) y+ 
L1Y2 — yi®2 = 0. Rearranging, (%2 - 71) (y-y1) = (ya-y1) (@ - 21). 














Definition 45. Two lines are perpendicular if 


(a) Their gradients are negative reciprocals of each other; or 


(b) One line is vertical while the other is horizontal. 





If two lines / and m are perpendicular, then we will also write / 1 m. 
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142.4. Distance 


Definition 274. Let A = (aj, q2,...,@,) and B = (b1,bo,...,bn) be two points in R” 
(n€Z"*). The distance between A and B is denoted |AB| and is defined as this number: 


IAB|=4 | > (a - bi). 





So, for n= 1 (R! = R, the real number line), the distance between two points A and B (A 
and B are simply real numbers) is simply 


|AB|=\/ (A- B)’ =|A- Bl =|B- Al. 


(This corresponds to Definition 46 in the main text.) 


For 7-= 2 (R?, the cartesian plane), the distance between two points A = (a,,a2) and 


b= (b1, bz) is 
|AB] = \/ (ay - b1)* + (a2 - bp)”. 


(This corresponds to Definition 48 in the main text.) 


For n = 3 (R®), the distance between two points A = (a1, a2,a3) and B = (by, by, bs) is 


|AB| =V (ay = by)” + (a2 = by)? oF (a3 = by 


Etc. 


Given a point a and ¢ > 0,°" the e-neighbourhood (or sometimes ball) of a point a is 
the set of points each of whose distance from a is less than ¢. (So, this captures the idea of 
“nearby” points.) A bit more formally and precisely, 


Definition 275. Let A be a point in R"” (n€ Z*) and e > 0. The e-neighbourhood of A, 
denoted N; (A), is the set of points in R” defined by 


N. (A) = {B eR”: |AB| <e}. 





So, for n = 1 (R' = R, the real number line), the e-neighbourhood of the point A (A is 
simply a real number) is simply the open interval centred on A and with width 2e: 


N.(A) ={B¢€R:|AB|<e}=(A-e,Ate). 





*“l'The symbol ¢ is the Greek lower-case letter epsilon. 
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Figure to be 


inserted here. 





For 7=2 (R?, the cartesian plane), the ¢-neighbourhood of the point A = (a1, a2) is 


N-(A) = {Be R?:|AB| <e} = {B= Gi BYR 4) Gah) +a <ef. 


For n = 3 (R®), the e-neighbourhood of the point A = (a1, a2, a3) is 
3 2 / 2 2 2 
N-(A)={BeR :|AB| <2} = {B= (bisba,bs) €R : (a; — by) + (a2 — be) + (a3 — bs) <e}. 


Informally, an isolated point in a set S' is one that isn’t “close” to any other point in S. 
Formally, 


Definition 276. Let c¢.S CR” (néZ*). We call x an isolated point of S if there exists 
some € > 0 such that 


NeGelns =fak. 
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142.5. Maximum and Minimum Points 


In Ch. 142.4, we learnt that given a point P ¢ R? and ¢ > 0, the symbol N, (P) denotes the 
e-neighbourhood of P (i.e. the set of points each of whose distance from P is less than ¢). 


Definition 277. Let P = (a,b) be a point in the graph G ¢ R? (which may be of a 
function f). We say that P is 


. A global maximum (point) of G (and f) if for all (x,y) € G, we have b> y. 
. The strict global maximum (point) of G (and f) if for all (2, y) € G, we have b > y. 
. A local maximum (point) of G (and f) if there exists ¢ > 0 such that 


For all (x,y) € N-(P) NG, we have b> y. 
_ A strict local maximum (point) of G (and f) if there exists ¢ > 0 such that 


For all (x,y) € N- (P) OG, we have 6 > y. 


. A global minimum (point) of G (and f) if for all (x,y) € G, we have b< y. 


. The strict global minimum (point) of G (and f) if for all (x,y) € G, we have b < y. 
. A local minimum (point) of G (and f) if there exists ¢ > 0 such that 


For all (x,y) € N- (P) OG, we have b< y. 
._ A strict local minimum (point) of G (and f) if there exists « > 0 such that 


For all (x,y) € N-(P)nG, we have b< y. 





Note: A graph can have at most one strict global maximum and at most one strict global 
minimum (hence the use of the definite article the). 


In contrast, a graph can have more than one of each of the other six types of extreme points 
(hence the use of the indefinite article a). 
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142.6. The Distance between a Line and a Point 
This subchapter restricts attention to the cartesian plane R?. 


Proposition 3. The unique point on the line ax + by+c=0 that is closest to the point 
(p,q) is 


B=( ee | 
PO aR” a2 +B 





Proof. Case 1. Suppose b= 0. Then the line may be rewritten as x = -c/a. 
The distance between (p,q) and an arbitrary point (—c/a,y) on the line is 


(p- c/a) +(q-y) = Vp? + 2/a? - Qcfa+ P + y? - 2ay, 
which by Fact 34(d) is minimised at y = -(-2q) /(2x 1) =¢. 
So, the unique closest point is simply (—c/a,q), which, as claimed, is equal to 


7 ap +bq+e 7 pee) -( _ wre ) 
(» az+b? ” +e ) WOO ge 8): 





Case 2. Suppose b #0. Then the line may be rewritten as y = (-a/b) x - c/b. 
The distance between an arbitrary point (x,y) on the line and the point (p,q) is 


(ep) + (ya? =y/ (e-n)? + (-Sr-£-0) 









2 


2 
x? + p?-2px + a + 7 +q ae ~ 2a + id 





which by Fact 34(d) is minimised at 


2(%+%-p) &n-ac-abq ap +bq+c 
CE SS SS SS — Q@—_———_ 
2(1+ 4) a? + b? 7 a? + b? 


Plug this x-coordinate into = to get 


a 7 ( C a abp + © (ap +bq+c) - Sf -be 
oe GN eae b oma: 


4 (a + 0) ep abp+ § (p+ Mir —be_ pape dg te 


a2 + b? a2 + b? 














=qt 
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Corollary 3. The distance between a point (p,q) and a line ax + by+c=0 is 


lap + bq + c| 


Var+h 
Fact 35. Let A be a point that is not on the line 1. Suppose B is a point onl. Then 


B is the point on | that is closest to A a 11 AB. 





Proof. We already proved <— in the main text. We now prove =>: 
(= = ) Suppose B is the point on / that is closest to A. By Proposition 3, 


B ( ap +bq+e 2 
— — Q(*————. 9g —- >————wuX~ 
: e+e 4 a? + b? 


By Fact 251, line AB is given by 


ap+bq+e “pees ap+bq+e ap+bq+e 
(24+ ET +e Je+( 7 +e ares aad Pad (1-' a? +b? )-af OPP )=0. 
If a = 0, then by Fact 25, / is horizontal and AB is vertical, so that by Definition 45, / 1 AB. 


If instead 6 = 0, then again by Fact 25, / is vertical and AB is horizontal, so that again by 
Definition 45, / 1 AB. 


So suppose a#0#b6. Then by Fact 27, the gradient of AB is 





) 


peor e(- pt). b 
a? + b? a2+bB } a 


which is the negative reciprocal of the gradient -—a/b of the line |. So, by Definition 45, 
11 AB. 














In Ch. 144.13, we’ll prove the above results again using the language of vectors. 
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142.7. Stationary and Turning Points of a Graph 


So far, we’ve defined only stationary points of functions. We have not defined sta- 
tionary points of graphs. 


Example 1567. Define f :R\ {0} > R by f (x) = x”. Then f has the stationary point 
(0,0). 


Now consider instead the graph of y = x?—this is simply the set {(x, y) € R?:y= Z|, We 


would quite sensibly like to say that this graph also has the stationary point (0,0). The 
problem though is that we haven’t formally defined what a stationary point of a graph 
is. So, right now, we can’t actually say that this graph has any stationary point. 





It turns out that writing down general definitions of stationary points of graphs is tricky, 
because a graph is simply any set of points (in R?). 


Example 1568. Below is a graph G. Intuitively and visually, we want to be able to say 
that P is a stationary point of G. 


Figure to be 
inserted here. 


But it’s tricky to write down a general definition of stationary points where this is so. 





Here’s one possible definition of stationary points for graphs: 


Definition 278. Let G ¢ R? be a graph. We call P = (a,b) €G a turning point of G if 
there exist ¢ > 0 and a function f whose graph is GN N-(P) and has turning point (a,b). 





Having defined a graph’s stationary points, we can also easily define its turning points: 


Definition 279. A turning point of a graph is any point that is both a stationary point 


and a strict local maximum or minimum of that graph. 





This subchapter will be paralleled by Ch. 146.5 (“Asymptotes of a Graph”). 
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142.8. Reflections 
This subchapter restricts attention to the cartesian plane R?. 


Proposition 26. Suppose P and Q be distinct points. Then there exists a unique point 
R#P that satisfies these two properties: 


(a) |PO|=|OR| (b) R is on the line PQ. 





Proof. In Exercise 93 (main text), we already proved existence—specifically, we proved that 
the point R = (2c-— a, 2d -— b) satisfies properties (a) and (b). 


We now prove uniqueness. To do so, we’ll prove that there are at most two points S that 
satisfy properties (a) and (b)—one is P and the other is R. We will thus have shown that 
R unique. 


Suppose S = (5;,5,) is a point that satisfies properties (a) and (b). 

Let P = (a,b) and Q = (c,d). 

So, |PQ| = V (c-a)?+(d-5)" = |QS| = V/(S2-c) +(S,-d)” or (c-a)? + (d-b)" 4 
(S20) (6,20) 

By Fact 26, the line PQ is (c- a) (y- b) = (d- 6) (w@- a). So, (c- a) (S, — 6) 2 (d-b)(S,-a). 
If c=a, then S,=a=c, and S,=d+\/(c- a)” + (d—b)°—these are the two points S that 
satisfy properties (a) and (b). 


d—b) (Sz - . 
If c#a, then S, 3 ( ) a) +6. Plugging 3 into S, we have a quadratic equation in 

















c-a 
S,, so that by the Fundamental Theorem of Algebra, S, has at most two real roots. 
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Fact 252. The graph of x? + y? =1 is symmetric in every line through the origin. 


Proof. Let | be the line y = kx (where k € R) (so that / is some line through the origin). 
Let G be the graph of x? + y? = 1. Pick any point A = (a, +V1- a”) eG. 
By Fact 41, the reflection of A in / is 








. ak? -a+2kV1—-a2 4V1—a24+2akF k?V1- a2 
7 1+k2 14+ k2 


which we now verify satisfies the equation x? + y? =1 (and so Be G): 


(“ 2_ qt 2kVJV1- ey (eae) 


1+k? 1+ k? 


a 6 acy [we nat ete (Led) 20 shal T=eEdald Te 


7 wy | La” + 4a? + kt (L-?) sak T— a? - 2h? 1-0”) sda VT- 0? ] 
+ 
— 2k? +14kt 
(1+k2)° 
We’ve just shown that the reflection B of any A é€G in / is also in G. So, I is a line of 
symmetry for G. 
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142.9. The Quadratic Equation 


We reproduce Fact 34 (from Ch. 14): 


Fact 34. Consider the graph of the quadratic equation y = ax? + bx +c. 
(a) The only y-intercept is (0,c). 
(b) There are two, one, or zero x-intercepts, depending on the sign of b? — 4ac: 
(i) If b?-4ac> 0, then there are two x-intercepts: 
ae ~b+ Vb? = dac 
2a 





2a 2a 


And as? +beve=a(e- 


(ii) If b? - 4ac = 0, then there is one x-intercept x = —b/2a (where the graph just 
touches the x-axis). 


2 
And aa? +bn+c-a(2+=) . 
2a 
(iii) If b? -4ac <0, then there are no x-intercepts. There is also no way to factorise 


the quadratic polynomial ax? + bx + ¢ (unless we use complex numbers). 


(c) It is symmetric in the vertical line x = —b/2a. 


(d) The only turning point is (-b/2a, Sy c), which is a strict global (i) minimum if 
a>0O; or (ii) maximum if a <0. 





Proof. (a) If x = 0, then y=a-07+b-0+c=c. 
(b) On p. 168, we already showed that the two roots of the quadratic equation are 


ae —b+ Vb? - 4ac 
7 2a 


If b? — 4ac > 0, then both roots are real. 
If b? — 4ac = 0, then there is only one real root, —b/2a. 
If b? — dac < 0, then there are no real roots (but there are two complex roots). 


(Proof continues below ...) 
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(... Proof continued from above.) 
b 
(c) Pick any point (p, ap’ +bp+c) in the graph. Its reflection in the line x = “55 is 
a 


b 
(-2 —p, ap’ + bp+ c}. This reflection point is also in the graph, as we now verify: 
a 


b \ b b? 2b.\ PB 
a(-2-p) +b(- -p] +e=a(5 +p +—p] -—-bp+c=ap*+bpte. J 
a a a a a 
(d) Differentiate?” the quadratic equation y = ax* + br +c with respect to x to get 
y' (x) = 2ax +b. 


The sole stationary point (and hence also the only possible turning point) is given by 
I =e 
y (2) =0 oF 


fa d y(z) (32) +03) +< e — a 

x= —an x)=al— — =—-——+c¢C=-— +6, 
20 : 2a 2a 4a 2a Aa 

(i) Suppose a > 0. Then y’ (x) < 0 everywhere to the left of Z and y’ («) > 0 everywhere to 

the right of x. Hence, by the First Derivative Test for Extrema, % is a strict local minimum 

and thus, by Definition 58, also a turning point. 


(ii) Suppose a < 0. Then y’ (a) > 0 everywhere to the left of Z and y’ (x) < 0 everywhere to 
the right of x. Hence, by the First Derivative Test for Extrema, 7 is a strict local maximum 
and thus, by Definition 58, also a turning point. 














It was easy to prove that y = ax* + br +c is symmetric in the vertical line x = —b/2a (Fact 
34(c)). What’s harder to prove is that this is the unique line of symmetry: 


Proposition 27. There is only line of symmetry for y = ax? + bx +c. 


Proof. Let G be the graph of y = ax? + ba +c and | be a line of symmetry for G. 
Let dx + ey + f = 0 be the equation for J. 
b 
Since a +0, we can rewrite G as y = 27+ gx +h, where g = — and h= “a 
a a 
Case 1. Suppose / is vertical (or e = 0). Then d # 0 and we can rewrite the equation of | 
as x-—=0. 
d 


Let P, = (21,27 + 92, +h) €G. Its reflection in / is the point 
1 
an i 9 
P, = 27, F121 + ge +h : 


The line / is a line of symmetry for G if and only if the above reflection point is also in G, 
Lee 





°”2Here we rely on results given only later on in Part V (Calculus). 
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2 
e+gn+h=(24-m) rg(25-m)+h 


: (omitted algebra) 


— 0 = (dx, — f)(dg+2f) 
b 
The above equation holds for all x, ¢ R. Hence, dg + 2f =0 or : = -5 =—37° 
a 


We’ve just shown that there is exactly one vertical line of symmetry for G, namely x = a 
a 
This completes our examination of Case 1. 


Case 2. Suppose / is not vertical (i.e. e # 0). (We will show that this produces a 
contradiction. We will hence have shown that no non-vertical line can be a line of symmetry 


b 
for G and thus that x = aoe is the only line of symmetry for G.) 
a 


d 
Then we can rewrite the equation of / as 7x + y+ 7=0, where 7 = — and 7 = f 
e e 


Claim 1. G andl intersect at most twice. 


Proof. Plug the equation for G into that for I to get 
O=int+yt+jrintartgrths+ j, 


which is a quadratic equation in x and hence has at most two solutions. Oo 
Claim 2. There exist infinitely many points in G each of whose reflection in 1 is not itself. 


Proof. By Claim 1, G and / intersect at most twice. These are also the only two points 
each of whose reflection in / is itself. Since G has infinitely many points, the claim holds.o 


Pick any point P» = (xo, v2 + gt + h) ¢ G whose reflection in / is not itself. By Fact 41,°° 


its reflection in I is the point 


. 2 . 
e 1X9 +45 4+ 9%o+h+ 
P= (0-21 : a J kt gay t+h-2 
12 + 








ig +03 + gua th+ j 0 
+1 


Observe that if tx + x2 + 9% 2+h+j7=0, then Py = Py, contradicting our choice of P». So, 
ing + 22+ gto+h+ 7 #0. 


Now, 1 is a line of reflection for G if and only if Py €G, ice. 





573 ap+bq+c to 
— 2a————_., q - 2h -————_ 
(» Oa 8 a? + b? 
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ao : 
jag + 23+ gtgth+j ( yee vale 9 tet 83 
——————————————————————— = = SS...» 2— ——__ = 


2 
a aa ae a4 


: (omitted algebra) 


) 
> 0 = (ing +0} + g0y+h+j)[1— Qiang + — (ing +o} + 
i 
© 242 
—> 0. = 1 2iey + (ira +03 + gra +h+ j) +98 
i 











This last equation in x2 has at most two solutions for x2, contradicting Claim 2. 
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142.10. Functions 


Definition 280. Let A and B be sets. A function f with domain A and codomain B— 
more simply denoted f : A + B—is the ordered triple (A, B,G), where G is any subset 
of Ax B that satisfies this property: 


For every x € A, there is exactly one y € B such that (r,y) €G. 


We call G the graph of f. 


For each (x,y) € G, we write f (x)= y or f:2+ y. We refer to the full specification of = 
as the mapping rule of f. 


Fact 45. (Vertical Line Test) Suppose G is a graph (i.e. any set of points in the 
cartesian plane). Then G is the graph of a nice function <> No vertical line intersects 
G more than once. 





Proof. Let D = {x: x is the first coordinate of a point in G}. 


No vertical line intersects G more than once = > For any a€R, there exists at most one 
béR such that (a,b) ¢G <=> G is the graph of the function f : D > R defined by f (2) 
equals the second coordinate of the point in G whose first coordinate is x. 














142.11. When a Function Is Increasing or Decreasing at a Point 


Definition 73 defined what it means for a function to be increasing or decreasing on a set. 
But in the main text, we did not define what it means for a function to be increasing or 
decreasing at a point. 


Nonetheless, Definition 73 does specify what it means for a function to be increasing or 
decreasing on a set that contains exactly one point. ‘This is not very interesting: 


Fact 253. Let DCR, f: D>R, and SCD. If|S|=1, then f is increasing, decreasing, 
strictly increasing, and strictly decreasing on S. 





Proof. Suppose S contains only one point. Then there do not exist a,bée S with a <b. So, 
the conditions in Definition 73(a), (b), (c), and (d) are vacuously true. 














Here’s a possible definition of when a function is increasing or decreasing at a point: 


Definition 281. Let D,C CR, f: D>C, andae D. We say that f is increasing at a if 
there exists « > 0 such that f is increasing on (a-¢,at+e)nD. 


To obtain three more definitions, replace the two instances of the word increasing in the 
last sentence with decreasing, strictly increasing, or strictly decreasing. 
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Example 1569. Define f :R—> R by f (x) = 2”. 


At 0, f is not increasing, strictly increasing, decreasing, or strictly decreasing because 


for all e > 0, f is not increasing, strictly increasing, decreasing, or strictly decreasing on 
(0O-€,0+¢€)N R= (-<e,¢). 





The above definition implies that if a is an isolated point of D, then f is increasing, 
decreasing, strictly increasing, and strictly decreasing at a: 


Example 1570. Define f : {1,2} > R by f(a) =1. 


At 1, f is increasing, strictly increasing, decreasing, and strictly decreasing because there 
exists ¢ > 0 (e.g. €¢ = 0.5) such that f is increasing, strictly increasing, decreasing, and 
strictly decreasing on (1 —¢,1+¢)n {1,2} = (0.5, 1.5) n {1,2} = {1} (Fact 253). 





Here’s a second possible definition of when a function is increasing or decreasing at a point: 


Definition 282. Let D,C CR, f: D>C, and ae D. We say" that f is increasing at 
a if there exists ¢ > 0 such that for every s € (a—¢,a)N D and every t € (a,at+e)nD, we 
have f(s) < f (a) < f(t). 

To obtain three more definitions, in the last sentence, replace (a) increasing with de- 
creasing, strictly increasing, or strictly decreasing; and (b) the two instances of < with >, 
SOL >: 





It turns out that this last definition is weaker than (or implied) by the previous definition: 


f 
Example 1571. Define?” f :R—R by f (x) = - fet Q, 
2m. for x €Q. 


At 0, f is increasing under Definition 282. but not under Definition 281. 


Note that f is not differentiable and is, under Definition 282, increasing only at 0 and 
not increasing, decreasing, strictly increasing, or strictly decreasing at any other point. 








°This definition (or similar) is given by Spivak (2006), Did (2013), and Sohrab (2014). 
°This example was stolen from Did (2013). 


1573, Contents www.EconsPhDTutor.com 


142.12. One-to-One Functions 


Fact 47. (Horizontal Line Test) A function f is one-to-one if and only if no horizontal 





line intersects the graph of f more than once. 


Proof. Let cé R and define g: R= R by g(x) =c. Let F and G be the graphs of f and g. 
Observe that 


F and G intersect at least twice 














— There exist a,b with a # b such that (a,c),(b,c)e FAG 

—> There exist a,b with a # b such that f (a) =c and f(b) =c 

—> There exists y € Codomain f (namely, y = c) such that there are 
more than one xz € Domain f (namely, x = a,b) with f (x) = y. 

—> f is not one-to-one. 


Lemma 2. Let DER and f: D>R be a function. 


(a) If f (21) < f (2) < f (x3) for any 21 < tq < 23, then f is strictly increasing. 
(b) If f (v1) > f (v2) > f (v3) for any 41 < 42 < 23, then f is strictly decreasing. 





Proof. (a) Let a,b¢ D with a <b. Pick any cé (a,b). By assumption, f (a) < f (c) < f (0). 
So, f (a) < f(b) and by Definition 73, f is strictly increasing. 


(b) Similar, omitted. 














Definition 283. Given a set S CR, the largest and smallest numbers in S (if they exist) 
are denoted max S and min S. 


Example 1572. Let S = {1,2,3}. Then max S =3 and minS = 1. 


Example 1573. Let T = [0,1]. Then maxT = 1 and minT =0. 





Example 1574. Let U = (0,1). Then neither maxU nor minU exists. 


Lemma 3. Let D © R and f : D > R be a function. If f ts neither strictly in- 
creasing nor strictly decreasing, then there exist x1 < x2 < x3 such that either f (x2) < 





min { f (1), f (w3)} or f (v2) 2 max {f (a1), f (wa). 


Proof. Suppose f is neither strictly increasing nor strictly decreasing. 


Then by Lemma 2, there exist 1, %,73 € D with x71 < £9 < ©3 such that 
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NOT-“f (21) < f (v2) < f(v3)” OR  NOT-*f (x1) > f (v2) > f (@3)” 
f (@2) < f (1) OR f (x2) > f (73) AND ff (x2) > f (a1) OR f (x2) < f (#3) 
f (a2) < f (a1) AND f (x2) 2 f (21) OR — f (tz) s f (41) AND f (2) < f (3) 


rtd) 


f (v2) s min {f (a1), f (z3)} OR f (v2) > max {f (21), f (a3)}.- 














Proposition 4. Suppose a nice function is continuous and its domain is an interval. If 


this function is one-to-one, then it is also either strictly increasing or strictly decreasing. 





Proof. Let D be an interval and f : D > R be a continuous function. 


If D is empty or contains only one point, then the claim is vacuously true. So, assume D 
contains more than one point (and hence, infinitely many points). 


Suppose for contradiction that f is neither strictly increasing nor strictly decreasing on D. 
Then by Lemma 3, there exist x, < rq < x3 such that either f (x2) < min{f (21), f (x3)} 
or f (v2) > max {f (21), f (a3)}. Since f is one-to-one, these last two weak inequalities 
may be replaced by strict ones—i.e. either f (x2) < min{f (7), f(73)} = a or f (x2) > 
max {f (1), f (3) } = 6. 

If f (v2) <a, then pick any y € (f (2) ,a). By the Intermediate Value Theorem (Theorem 
12), there exists x4 € (41, £2) such that f (x1) = y, so that f is not one-to-one (contradiction). 
Similarly, if f (v2) > 6, then pick any y € (b, f (v2)). By the Intermediate Value Theorem 


(Theorem 12), there exists x5 € (1, %2) such that f (#5) = y and f (25) =y, so that f is not 
one-to-one (contradiction). 














1575, Contents www.EconsPhDTutor.com 


f(we)=f(a1) OR — f(x) <min{f(7),f(v3)} OR f(a) > max{fi 


142.13. Inverse Functions 


Fact 50. Suppose f: A> B is a function with range C. Then f is one-to-one if and 
only if there exists a function g: C > A where for every ye C, we have 


y=f (2) = g(y) ==. 


(Moreover, if g exists, then by Definition 79, it is the inverse of f.) 





Proof. (==> ) Suppose f is one-to-one. Then for each y € C, there exists a unique x € A 
such that f(a) = y. And so, we can construct the function g : C > A, where for every 
x €C, we have 


y= f(z) — g (y) = x. 
(<= ) Suppose the function g exists. Let a,b¢ A with f (a) = f (0). 


Then g(f(a)) =a and g(f (b)) =b. By =, 9(f (a)) = 9(f (b)) and a=. 
We've just shown that f(a) = f (b) implies a = b. So, by Fact 48(b), f is one-to-one. 


Fact 254. Every inverse is both (a) one-to-one; and (b) onto. 


Proof. Let f: A> B be a function with range C and inverse f~!:C > A. 
(a) Pick any y1,%2 € C with y, # yo.°”° Then there exist 71,22 € A with x, # x2 such that 
f (a1) =y1 and f (x) = yp. Hence, f! (yi) = 21 #29 = f-* (yo) f° is one-to-one, 

(b) For each x ¢ A, there exists y ¢ C such that f~'(y) =x. Hence, f~! “hits” every element 
in A = Codomain f7! and is onto. 


























Fact 51. Suppose the function f has inverse f-!. Then 


y= f(z) — fe Gea, 





Proof. (==> ) By Definition 79 (of inverses). 

(<= ) Suppose f~! (y) 42. Then x € Domain f. 

Let z 2 f(x). By Definition 79, f-! (z) 22. 

By Fact 254(b), f+ is one-to-one. Hence, by + and Z z=y. Thus, y 2 f (2). 














Fact 57. Suppose the function f has inverse f~! and Ga is the inverse of f-!. Then 


f is onto. 








"STF no such yy, yo exist, then C' is either empty or contains exactly one element—in either case, f~ is 
one-to-one. 
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Proof. Below, we'll prove that 
(a) Domain f = Domain (fy. 
(b) f(z) = (fy (x) for all x € Domain f. 


(c) Codomain f = Codomain Cae <=> f is onto. 


We’ll thus have proven that f = Cane <> f is onto. 


(a) By Definition 79 (of the inverse), Codomain z Domain f and Domain (fay 2 
Range f”?. 

But by Fact 254, f7! is onto, ie. Range f7! 3 Codomain ae 

By 3 and 2 Range f~! = Domain f. By * and 2 get Domain (fy = Domain f. 

(b) Let x € Domain f = Domain tae atid Y= F (2). 

Then f~'(y) =2. So, (oe (x) = y. Hence, (ae (aya 7 2): 

(c) Observe that Range f = Domain f = Codomain (ey 











So, Codomain f = Codomain (fy <= Codomain f = Range f <= f is onto. 





= 
Fact 255. Let f be a function and A & Domain f. Suppose fl, has inverse fl, . Then 


zi = 
(a) The composite function f o fl, exists and is defined by (7 ° ‘il )(@) =%. 


i 
(b) The composite function f F of may not exist. 


=i 
(c) Nonetheless, for all xe A, we have al Ce) ae. 





eff ai 
Proof. (a) Since Range fl, = Domain fl, = AC Domain f, the composite function f o fl, 
exists. 


dine Donan |. ser eae | a ee Ge 
a LE omain f| ,ify=/|, (az), then x = f (y)—an o,(fos|,)()=F (4, (v)) - 


(b) Define f : {0,1} + {2,3} by f (0) =2 and f (1) =3. Let A= {0}. 


=f Si 
Then Range f = {2,3} ¢ {2} = Range fl, = Domain fl, , so that fl, o f does not exist. 


| -1 
(c) Suppose x € A. Since Domain fl, = Range fl. we have f (x) € Domain fl, 














1 -1 -l 
For all x ¢ Domain fl, , if y= f (x), then # = fl, (y)—and so, uP (7 Ge) =a: 
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142.14. Standard Definitions of the Identity and Inclusion 
Functions 


In the main text, Definition 101 defined an identity function. But as mentioned in n. 246, 
that was not quite the standard definition of an identity function. The standard definition 
of an identity function is this: 


Definition 284. If S is a set, then the identity function on S is the function Igy: S > S 


defined by Ig (x) = =. 





The main text’s Definition 101 (of an identity function) is actually the standard definition 
of the inclusion function from S to T: 


Definition 285. Let S and T be sets. If S ¢ 7’, then the inclusion function from S' to T 





is the function ig: S > T defined by ig (x) = 2. 


The inclusion function (from S to 7’) differs from the identity function (on S) only in that 
its domain S' might be a strict subset of the codomain T. 
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142.15. Left, Right, and “Full” Inverses 


In the main text, Definition 79 gave a definition of the inverse function (or more simply 
inverse). But as noted in n. 178, that was not quite the standard definition. It turns out 
that the standard definition of an inverse (we might call this the “full” inverse) is that it’s 
both a left and right inverse: 


Definition 286. Let A and B be sets, f: A > B be a function, and J, and Ip be the 
identity functions on A and B, respectively. The function g is 


(a) A left inverse of f if go f =Iy; 


(b) A right inverse of f if fog =Ip; 


(c) An inverse of f if it is both a left inverse and right inverse of f. 





Equivalently, the function g is an inverse of the function f: A> Aif fog=gof=Ty. 


Recall that a function is (a) a surjection (or surjective or onto) if every element in its 
codomain is “hit”; and (b) an injection (injective or one-to-one) if each element in its 
codomain is “hit” at most once. We call a function a bijection if every element in its 
codomain is “hit” exactly once—or equivalently, 


Proposition 28. A function has 


(a) A left inverse <> It is injective. 


(b) A right inverse <> It is surjective. 


(c) An inverse <> It is bijective. 





Proof. Let f: A> B be a function with range C. 
(a) ( = — ) Suppose f is not injective. Then there exist 71,72 € A and y € B such that 
tye ts but y= Ff (a1) = f (a). 


So, for any function g for which the composite function gof is defined, we have (go f) (21) = 


(9) (Ff (t1)) = 9 (y) and (90 f) (2) = g(y). Hence, (go f) (v1) = 2 and (go f) (@2) = x2 
cannot both be true. Thus, g cannot be a left inverse of f. 


( <— ) Suppose f is injective. Then for each y € C, there exists exactly one x € A such 
that. 7 (2) = 

Define g: C > A by g(y) 2 x. So, the composite function go f : A > A exists and is defined 
by (@o7 (ayaa (2) =a) +x. Hence, go f =I, and g is a left inverse of f. 


(b) ( = = ) Suppose f is not surjective. Then there exists y € B such that for all x € A, 
f(x) # y. So, if h is a function for which the composite function f oh is defined, then 
(foh)(y) =f (h(y)) #y and h cannot be a right inverse of f. 


( <— ) Suppose f is surjective. Then for each x € B, there exists some y € A such that 


f(y) 22. 
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Define h: B > A by h(a) 2 y. So, the composite function foh:B > B exists and is 
defined by (f 0h) (x) = f (h(x)) 2 f (y) 2x. Hence, foh = Ig and h is a right inverse of f. 
(c) follows from (a), (b), and Definition 286(c). 














Example 1575. Define f : {0} > {1,2} by f(0) =1 and g: {1,2} > {0} by g(1) = 
g (2) =0. 
Observe that f is injective (but not surjective). So, f has a left inverse (but not a right 
inverse). 


In contrast, g is surjective (but not injective). So, g has a right inverse (but not a left 
inverse). 


Indeed, g is a left inverse of f and f is a right inverse of g, as we now show: 


Since Range f = {1} ¢ Domaing, the composite function go f : {0} > {0} exists and is 
defined by (gf) (0) = 9 (f (0)) = g(1) =0. Hence, go f = I). 


Proposition 29. |Let f: A> B be a function with range C and g be a function. 


(a) Suppose f is injective. Then g is a left inverse of f < > g satisfies these three 
conditions hold: 
(i) Cc Domain g; 
(ii) Codomain g = A; and 
(iii) g maps each x €C to the unique y€ A for which f (y) =«. 
(b) Suppose f is surjective. Then g is a right inverse of f <> g satisfies these three 
conditions hold: 
(i) Domain g = B; 
(ii) A Codomain g; and 
(iii) g maps each x € B to some y€ A for which f (y) =z. 
(c) Suppose f is bijective. Then g is an inverse of f <=> g satisfies these three condi- 
tions hold: 
(i) Domain g = B; 
(ii) Codomain g = A; and 
(iii) g maps each x € B to the unique y€ A for which f (y) =a. 


Moreover, f has only one inverse. (Uniqueness of Inverse) 





Proof. (a) ( <= ) Since Range f = C ¢ Domain g, the composite function (go f): A> A 
exists and is defined by (go f)(y) = g(f (y)) = g(x) =y. So, go f = I, and g is a left 
inverse of f. v 


( = = ) We show that if any of the three conditions is violated, then go f # 4 and g is not 
be a left inverse of f: 


(i) If C¢ Domain g, then go f is not defined. 
(ii) If Codomain g # A, then Codomain (go f) = Codomain g ¢ A. 
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(iii) Suppose that for some xz € C, y is the unique element in A such that f(y) = x, but 
g(a) #y. Then (go f) (y) = 9 (f(y) =9 (x) #y- 


(b) (<=) Since Rangeg ¢ A = Domain f, the composite function (fog): B= B exists. 
Moreover, for each x € B, we have (f og)(x) =f (g(x)) =f(y) =a. So, fog=TIp and g is 
a right inverse of f. J 


( == ) We show that if any of the three conditions is violated, then f og # Ig and g is not 
a right inverse of f: 
(i) If Domain g + B, then Domain (f ° g) = Domain g # B. 
(ii) If Range f = A ¢ Codomain g, then f og is not defined. 
(iii) Suppose that for some ze B, g(x) =y but f(y) #2. Then (fog) (2) = f(g(z)) = 
fy) # a. 


(c) ( <= ) Suppose the three conditions hold. Then by (a) and (b), g is a left and right 
inverse of f. And hence, by Definition 286(c), g is an inverse of f. 


( == ) Suppose any of the three conditions is violated. Then by (a) and (b), g is either 
not a left inverse or not a right inverse of f. And hence, by Definition 286(c), g is not a 
inverse of f. 


There is only one function that satisfies the three conditions, so that the inverse is indeed 
unique. 














It turns out that there is an asymmetry between left and right inverses. We can meaningfully 
single out a left inverse and call it the Left Inverse: 


Definition 288. Let f: A > B bea function with range C’. Suppose f is injective. Then 


the Left Inverse of f is the function g: C' > A that maps each x € C to the unique y € A 
for which f (y) =2. 





The above definition of the Left Inverse is what we’ve been using as our definition of 
“the” inverse throughout the main text of this textbook (and, in particular, corresponds to 
Definition 79). 


In contrast, we cannot meaningfully single out a right inverse and call it the Right Inverse: 


Example 1576. Define f : [0,1] > {2} by f(a) = 2. Then the following functions are 
right inverses of /f: 

e g: {2} + [0,1] defined by g(x) =0; 

e h: {2} > [0,1] defined by h(a) = 0.5; and more generally, 


e 4: {2} > [0,1] defined by i(x) =k, for any k € [0,1]. 


There’s no meaningful criterion by which we can call one of these functions g, h, or 7 the 
Right Inverse. 
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142.16. When Do f and [7! Intersect? 


Fact 54. Let D be an interval and f : DR be a continuous function with inverse fi. 


If f and f intersect at least once, then at least one of their intersection points is on 
the line y= 2. 





Proof. Pick any intersection point (a,b). 


If (a,b) is on the line y = x, then we are done. So, suppose it is not, i.e. a # b, so that either 
a>bora<b. 


Since (a,b) is in the graph of f-', by Fact 52, (b,a) is in the graph of f. 

Now define g(x) = f (x)- ax. By Theorem 26, g is also continuous on [a, }]. 

¢ Ifa<b, then g(a) = f (a)-a=b-a>0, while g(b) = f (b) -b=a-b<0. 

¢ Ifa>b, then g(a) = f (a) -a=b-a<0, while g(b) = f (b) -b=a-b>0. 

Either way, by the Intermediate Value Theorem (Theorem 12), there exists c € (a,b) such 
that g(c) = 0 or equivalently, f (c) =c. 


Hence, (c,c) is in the graph of f and, by Fact 52, is also in the graph of f~'.Thus, f and 
f~ also intersect at (c,c), which is on the line y = x. 














Fact 55. Let D be an interval and f : D > R be a continuous function with inverse f-'. 


If f and f intersect at an even number of points, then all of their intersection points 
are om the line y= a. 





Proof. Suppose (for contradiction) that there exists (a,b) with a # b that is in the graphs 
of f and ft. 


Then by Fact 52, (b,a) is in the graphs of f~! and f. 
Hence, any shared intersection points that aren’t on the line y = x must come in pairs. 


Since the total number of intersection points is even, any shared intersection points that 
are on the line y = x must also come in pairs. 


By Fact 54, at least one of f and f7!’s intersection points is on the line y = x. Thus, there 
exist at least two points on the line y = x at which f and f7! intersect. Call them (c,c) 
and (d,d) with c< d. 


Now, since f is continuous on an interval, by Corollary 7, it is either strictly increasing or 
strictly decreasing. 


Since c<d and f (c) =c< f (d) =d, f is strictly increasing. © 
Since a + b, we have either a> b or a< b. 

¢ Ifa>b, then f(a) =b< f(b) =a, contradicting ©. 

¢ Ifa<b, then f(a) =b> f(b) =a, contradicting ©. 
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142.17. Transformations 


We first formally define what a translation, a stretch, and a compression are. 


Definition 289. Let a>0. We say that the graph 7 is the graph G ... 


(a) Translated a units ... 
(i) Downwards if (p,q)eG <> (p,q-a)€ FH; 
(ii) Upwards if (p,q)€G <= > (p,qt+a)¢H; 
(iii) Rightwards if (p,q)eG <—> (pt+a,q)¢ HF; 
(iv)  Leftwards if (p,q)eG <> (p-a,q)€ FH; 
(b) Stretched by a factor of a, outwards from the ... 
(i) x-axis if (p,q)eG — > (p,aq)¢H; 
(ii) y-axis if (p,q)eG <> (ap,q)¢ HF; 
(c) Compressed by a factor of a, inwards towards the ... 
(i) «x-axis if (p,.q)eG <= (p,q/a)€ FA; 
(ii) y-axis if (p,q)eG — (p/a,q)¢H. 


Fact 58. Let a,b>0, andc,d¢R. Suppose f is a nice function. Then to get from the 
graph of f to the graph of y = af (bx +c) +d, follow these steps: 


1. To get (the graph of) y = f (x+c), translate leftwards by c units. 


. To get y = f (bx +c), compress inwards towards y-axis by a factor of b. 


2 
3. To get y=af (bx +c), stretch outwards from x-axis by a factor of a. 
4 


. To get y=af (bx +c) +d, translate upwards by d units. 





Proof. Let Go be the graph of f; G, of y = f(x+c); Go of y = f (br +c); G3 of y = 
af (bx +c); and Gy of y=af (br+c) +d. 


1. Observe that q=f(p) <=> q=f (p-c+tc). So, (p,q)€Go <> (p-c¢)€G. Hence, 
by Definition 289(a)(iv), G1 is Go translated leftwards by c units. 

2. Observe that q = f(p+c) — > q= f(bp/b+c). So, (p,q) € Gi => (p/b,q) € Go. 
Hence, by Definition 289(c)(ii), Gg is G; compressed by a factor of b, inwards towards 
the y-axis. 

3. Observe that g = f(bp+c) <— > aq=af(bp+c). So, (p,q) ¢€ Go <— (p,aq) € Gs. 
Hence, by Definition 289(b)(i), G3 is Gp stretched by a factor of a, outwards from the 
L-axis. 

4. Observe that q = af (bp+c) +d <> qt+d=af(bp+c)+d. So, (pg) ¢« Gs; <— 
(p,q+d)€G4. Hence, by Definition 289(a)(i), G4 is G3 translated upwards by d units. 
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Fact 256. Let dé R and f be a nice function. Suppose the graph of f has x-intercept 
(p,0), y-intercept (0,q), line of symmetry ax+by+c=0, turning point (r,s), and asymp- 
tote ax+ By+y=0. Then 


(a) The graph of y = f («)+d has y-intercept (0,q +d), line of symmetry ax+b(y-d)+c = 
0, turning point (r,s +d), and asymptote ax + B(y+d)+7=0. 


(b) The graph of y = f (2 +d) has x-intercept (p-d,0), line of symmetry a (a — d)+by+c = 
0, turning point (r-—d,s), and asymptote a(x -—d)+by+7=0. 


(c) The graph of y = df (x) has y-intercept (0,dq), line of symmetry ax + by/d+c = 0, 
turning point (r,ds), and asymptote ax + by/d+y=0. 

(d) The graph of y = f (dx) has x-intercept (p/d,0), line of symmetry ax/d + by +c =0, 
turning point (r/d,s), and asymptote ax/d+ By+y=0. 





Proof. We prove only (a)—the proofs of (b), (c), and (d) are similar and thus omitted. 
(a) Let F' be the graph of f and G be the graph of y = f (x) +d. 

Intercept. Since (0,q) € F, we have q = f (0) or q+ d= f (0) +d. Hence, (0,q+d) is in G 
(and is a y-intercept of G). J 


Line of symmetry. Pick any point P, = (u,v) in G. By Fact 41, the reflection of P, in 
the line az + b(y-d)+c=0 is the point 


au+bu+c-—bd are 
az+b2 a? +b? 


P=(u-2~ v — 2b 


It suffices to show that P,; €G. 


Since P, € G, we have P, = (u,u-d)« F. And the reflection of P, in the line of symmetry 
ax + by+c=0 is the point 


au+b(vu-d)+c 


ae ,(v-d) - 2b 


P= (u-2a ee en 


a? + b? 
Since P) € F, the translation of P) upward by d units is in G. But this point is P;—so, 


we’ve shown that P, € G. J 


Turning point. Define g: Domain f > Codomain f by g(x) = f (x) +d. Then the graph 
of g is G. 


We are given that r is (i) a stationary point; and (ii) a strict local extremum of f. We 
want to show that r is, likewise, also a stationary point and a strict local extremum of g. 


By (i), f(r) = 0 and hence g’(r) = f’(r) + <a = 0+0,°” so that r is also a stationary 
s 
point of f. J 


By (ii), there exists ¢ > 0 such that for all 2 € Domain f n(r-¢,r+e), either f(r) > f (x) 
or f(r) < f(x). Hence, it is also true that for all c ¢ Domaing n (r-e¢,r+e), either 
g(r) =f(r)+d>g(x) =f (x)+dor g(r) = f(r) +d< g(a) =f (x) +d. Thus, r is also a 
strict local extremum of g. 





°7Here we are using results from Part V (Calculus) or the accompanying Appendices. 
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Asymptote. We’ll deal with two cases: 
Case 1. ax + By+v7y =0 is vertical (so 6 =0 and the line may be rewritten as x = -y/a). 
By Definition 322,°” the left- or right-hand limit of f at —y/a is +00. 


Suppose the left-hand limit of f at -y/a is oo (the other three cases are dealt with similarly). 
Then by Definition 321, for every NV € R, there exists 6 > 0 such that x € Domain f nN; (a) 
implies f(x) > N - d—and hence, also that g(x) = f(a) +d> N. Thus, the left-hand 
limit of g at -y/a is oo. And by Definition 322, the line x = -y/a or ax + Gy +7 =0 is an 
asymptote for g. 


Case 2. ax + Py+~y = 0 is not vertical (so 6 # 0 and the line may be rewritten as y = 


—(a/8) 2-4/8). 
By Definition 326, either lim f (cx) =-(a/B)x- 4/8 or jim. f (x) 2 -(a/B)x- 4/8. 


Suppose é (2 is dealt with similarly). Then by Definition 325, for every ¢ > 0, there exists 
N é€R such that Domain f n (N,0o) # @ and for every x € Domain fn (N, 00), we have 
f(z) € Ne (-(a@/8) x - y/8)—and hence, also g(x) = f(z) +d € N-(-(a/8)x-7/8+d). 
And so by Definition 326, y = -(a/8)x-7/8+d or ax + (y+d)+7=0 is an asymptote 
for g. 














Fact 59. Let aéeR and f be a nice function. 


(a) Suppose f (x) +a for all x € Domain f. Then 


(i) f is symmetric in the horizontal line y =a; and 
(ii) f is not symmetric in any other horizontal line. 


(b) Suppose f is not constant. Then f is not symmetric in any horizontal line. 





Proof. We already proved (a)(i) in the main text. 
Let F' be the graph of f. 


(a)(ii) Let be R with b ¢ a. Pick any c € Domain f. 

Consider the point (c, f (c)) € F. Its reflection in x = b is R = (c,2b- f (c)). 

Recall that as a function, f must map c to exactly one object. So, Re F <— > f(c) = 
2b — f (b) or, rearranging, b = f (c) + a, which contradicts , Hence, R¢ F. Thus, f is not 
symmetric in x = 6. 

(b) Let be R. Since f is not constant, there exists some c € Domain f such that f (c) # 519 
Consider the point (c, f (c)) € F. Its reflection in x = b is R = (c,2b- f (c)). 

Recall that as a function, f must map c to exactly one object. So, Re F <— > f(c) = 


3 
2b — f(b) or, rearranging, b = f (c), which contradicts +. Hence, R ¢ F. Thus, f is not 
symmetric in x = 6. 

















°’8Here we are using results from Part V (Calculus) or the accompanying Appendices. 
°“9Tf no such ¢ exists, then f is constant with f (a) =b for all  ¢ Domain f—contradicting our assumption 
that f is constant. 
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Fact 60. Let ae R and f be a nice, one-to-one function (whose domain is not empty). 
(a) Suppose Domain f = {a}. Then 


(i) f is symmetric in the vertical line x =a; and 
(ii) f is not symmetric in any other vertical line. 


(b) Suppose Domain f contains more than one point. Then f is not symmetric in any 
vertical line. 





Proof. We already proved (a) in the main text. 
Let F' be the graph of f. 
(b) Let ce R. We'll show that f is not symmetric in x =c. 


Since Domain f contains more than one point, pick any distinct d,e ¢ Domain f with cor- 
responding distinct points (d, f (d)),(e, f (e)) € F. 

By Fact 39, the reflections of (d, f(d)) and (e,f(e)) in x = c are (2c-d,f(d)) and 
(2c —e, f (e)). 

Suppose for contradiction f is symmetric in x = c. Then both (2c - d, f (d)) and (2c-— e, f (e)) 
are also in F’. 


Then since f is one-to-one, we must have 2c- d = d and 2c-— e = e—or, c=d and c =e, 
which implies d= e. But this contradicts our assumption that d and e are distinct. So, f 
is not symmetric in x =. 


Fact 61. Suppose f is a nice function. Then f is even <= > f is symmetric in the 
Y-axis. 


Proof. Let G be the graph of f. Then f is symmetric in the y-axis <> For each (x,a) €G, 
we have (-z,a)¢G <= For each x€ Domain f, f(z) =a=f(-x) <=> f is even. 


Fact 62. Suppose f is a nice function. Then f is odd =< > f is symmetric about the 
origin. 


Proof. Let G be the graph of f. Then f is symmetric about the origin <=> For each 
(x,a) € G, we have (-x,-a)€G <=> For each x € Domain f, f (4) =a=-f(-r) <> f 
is odd. 
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142.18. Trigonometry 


We reproduce from Ch. 102.3 this textbook’s official definitions of the sine and cosine 
functions: 


Definition 219. The sine function sin: R > R is defined by 


or) ntl 


SGD (2n+1)! 


Definition 220. The cosine function cos: IR > R is defined by 


2 4 6 fore) 
eee Ses (ae 
6! n=0 


yen 
Qa 


2! Al 


Definition 290. Let a,b¢€ R, r>0, and p,q¢ [0,27]. Let P = (rcosp+a,rsinp+b) and 
Q = (rcosqta,rsing+b), so that P and Q are points on the circle of radius r centred 
on (a,b). We define the arc PQ to be this set of points: 


PQ = {(rcost+a,rsint+b):te[p,q)}, On pr <-G, 
{(rcost+a,rsint+b):te[p,q+2z]}, for p> q, 





We reproduce from Ch. 56 the definition of the angle between two vectors: 


Definition 145. The angle between two non-zero vectors u and v is this number: 


= u:vV 





fully] 


Much of the following assumes knowledge of what’s covered in Part IV (Vectors). 


Definition 291. Let AB be an arc of a circle with centre O. The angle subtended by 
AB is the angle between the vectors OA and OB. 
Fact 257. (“Z” Rule) The angle between u and v equals the angle between —u and -v. 
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.580 











Definition 292. A triangle is any set T = {A,B,C} ¢ R? three distinct points that do 
not lie on the same line.?*! We call A, B, and C the triangle’s vertices (singular: verter). 
Its three sides are the line segments AB, BC, and AC. Each of the triangle’s three angles 
a, 0, and ¥y is defined as the angle between AB and AC BA and BC: and CA and CB 
(respectively). Moreover, we call each of a, 6, and y the angle facing or opposite to the 
side BC’, AC, and AB (respectively). 





So, by Definition 145, 
, AB- AC | BA-BC 4 CALCB 


a= ss malleel , and y = cos * ——. 
 paacy Taal] eallon 


Also, given that Range cos! = [0,7] and A, B, and C can’t line on the same line (see 
footnote), we have 


a, B;e (0,7): 


Fact 258. (Law of Cosines) Let {A,B,C} be a triangle. Suppose y is the angle facing 
AB. Then 
Bey, 


ee eee 
(a) |AB| = |BC| +|AC| -2|BC| |AC| cos +; 


ee aT eee) ee) a ee 
(b) [AB] =|BC| +|AC] -2BC’- AC; 


Be + [ac] - |B) 
(c) 0 = = 





2|BC| |AC| 





°80Which was informally stated as Fact 64 in the main text. 
581.4 B, and C lie on the same line <> The angle a, {, or ¥ is 0 or 7. 
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Proof. (a) and (b): 
[4B] =|Ac- BC] = (AC - BC). (AC - BC) = |BC| + |AC] -20A-OB 
CA-CB - [BC] 


BC| + [acy PAC ares + [AC] -2[BC|[AC| cos 4 


CA-CB - [BC] 


=|B oil + [ae] -2|Be| [a0] + [ae] ~2BC- AC. 
calles 











(c) Simply rearrange (a). 





Plug 7 = 71/2 into Fact 258 to get Pythagoras’ Theorem: 


Corollary 51. (Pythagoras’ Theorem) Let {A,B,C} be a triangle. Suppose y = 7/2 
is the angle facing AB. Then 


af = Bef + [ACh 





Fact 65. The sum of any triangle’s three angles is 7. 


Proof. Let {A,B,C} be a triangle; a = BC, b= CA, and c = AB: and a = |al, b = |b|, and 
c=|cl. 
So, atb+c=0,c+-(a+b), and 202+? +2a-b. 


Let a, 3, and y be the angles facing the sides BC’, AC’, and AB. So, 





_,-c:b _,-c:a _,-b-a 
a = cos’ ——., 8 = cost ——, and y= cos"! ’ 
cb ca ba 




















Now, 
cos (a+ 3) =cosacos@ -sinasin 3 
_ =6 ~ —c:a —C: Ey fy -c: =) 
cb ca 
(c-b)(c-a) _ \/ 2c? - (c-b)*\/a2e2 - (c-a) 
7 abc? abc? 
But, 


(c-b)(c-a) 2[(a+b)-b][(a+b)-a] 
=(a- b+b")(a@ +a: bis (a-b)° +070? +(a?+0’?)a-b 
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And, 
Pe -(c-b) 2 a2b? +b! + 2Ba-b-[(a+b)-b]’ = 020? +b! + 2a-b-(a-b+b) 
=a°b? +4 +2ba-b-(a-b) -b4 - 20?a-b = 02h? -(a-b) 


Similarly, 
ac? —(c-a) Pat+@ b? + 2a’a-b-[(a+b)-a]° =a! +a2b? + 2a2a-b- (a? +a- b) 


=a*+a°b? + 2a’a-b-a!-(a-b) - 2a?2a-b =a7b? -(a-b)’. 


Altogether, 
(a-b)? +a2b? + (a2 + ?)a-b-—a2b?+(a-b) 
ab (a? + b? + 2a-b) 
_ 2(a-b)?+(a2+R)a-b 
ab (a? + b? + 2a-b) 
_a-b[2a-b+ (a? +b?)] 
~ ab(a2 +b? + 2a-b) ab 


cos (a+ 6) = 





I]eo 
it 


Below, we show that a+ €[0,7t]. And so, we can apply Fact 99(b) to 3. cos7! (cos (a+ B)) = 


-b 
xp ses. 
ab 


_p- 
Thus, since y = cos a by Fact 101(c), a+ B+ 7=7. 
a 


is ee os 


We now show that a+ 3 €([0,7]. In fact, we’ll show that a+ (6 € (0,7]. 
First, note that since a, 3 € (0,71), a+ G € (0,271). Next, compute 








-c:b\?-c-a -c-a\’?-c-b 
sin(a+ (6) =sinacos § -sin Ssina = 1 — 1- (=) =* 





cb ca ca cb 
_V ab? — (a- b)” —c: a. \/ a2b? — (a- b) —C: b_ \/ ah? — (a- b) Cc: a c:-b 
b2c2 ca a2c? cb abc? “(- b >) 


Now, \/ a2b2 - (a-b)* > 0 and abc? > 0. Moreover, 


c-a c:by (a+b)-a, (atb)-b_a@+b-a_ abt+b) atta blatb) +b _ (a+b) 





b a b a b a 7 ab 
at+b at+b 9 4at+b 
= = —— (a?- ab+b? +a-b) = ——[(a-6) +ab+a-b] 2 ——[(a a-b)° +ab- ab] = a (a: 


4 
where at >, we use Fact 133 (Cauchy’s Inequality). 





So, sin(a + 3) > 0, which implies a + 6 ¢ (7,271), which implies a + 6 € (0, 7]. 
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Fact 67. Two triangles are similar << > Suppose one triangle’s sides have lengths a, b, 


and c. Then there exists k >0 such that the other’s sides’ lengths’ are ka, kb, and kc. 





Proof. Let the first triangle be {A, B,C}, with angles a, 3, and y facing BC, AC, and AB, 
respectively; also, |BC| =0. |AC| = 6, and |AB| =, 


Let the second triangle {D, E, F'} with angles 6, ¢, and ¢ facing EF, DF, and DE, respec- 
tively. 


( <— ) Suppose there exists k > 0 such that EF =ka, IDF = kb, and IDE| = ke. 
By the Law of Cosines, 











a ee 
a=cos! ee and 
2 2 2 ae a2 TP — 
5. (kb) + (kc) - (ka) eee (b? + c? - a?) en: FOTO 3, 
2 (kc) (kb) k?2cb 2cb 


We can similarly show that ¢ = 6 and ¢ = 7. 
(== ) Suppose 6 = a, ¢ = 6, and ¢ =. Then 
AB- AC | DE-DF 


COS & Ss) 1: 


“[agjac] [oe A 
Similarly, 


BA- BC » ED-EF 
cos 3 = —— = cose = ——_.. 
AB] |BC| JED||EF 








Plug BC = AC - AB and EF = DF - DE into 2: 
BA-BG BA-(AC-4B) BA.aé |AB) EB.ER ED-(DF-DE), BB.DF 


4B] [ED] ¢ [ED] a FF 
a 





Now plug = into 2 to cel =, = =, OF or /ED 
IBC] |EF| ¢ |EF 


ac] _[DFl oe _ Pel 





By repeating steps similar to the above, we can also show that > 


Sy or — = = 
|Bc| |EF| a |EF 


or IDF 2 PA, 


ayo 
And of course, we also have EF = a 
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From es 2 and g we see that there exists k > 0 (namely k = EF /a) such that the second 











triangle’s sides have lengths ka, kb, and kc. 








Proof. Let AABC have angle a facing BC. 








Suppose ADE F also has angle a facing EF’ and that 5a = oF 
en (DE|2 aa |AB| and |DF|2 aa ACI. 
Also, a = cos"! ———— AB AC = cos” | DE-DF so that AB-AC DE-DE Next, 
AB||Ac] [BE |[DF| |B |[Ac| [Dz ||DF| 


IEF[ _|DE| +|DFP-2DE-DF _ (Fat ABI) + (Rat |Ac|) - 2(/241 4B)) - (124 \act) 











IBC |ABP' +|AC/?- 2AB- AC |ABP +|ACP -24B- AC 
[DF|’ [DEP 9 DEI 
_ ac? JAB)’ + age ACh = 2ACP |AB|-|AC| _|DEP 
IAB? +|ACP -2A4B- AC _ ACP 
3 |DF| 
Hence, |EF| 2 AC] |BC|. Thus, by 4, 2, 2, and Fact 67 (<=), AABC and ADEF are 
similar. 














In primary school, we learnt that the area of a triangle is 
1 
ma Base x Height. 
Let’s now write this down as a formal definition: 


Definition 293. Let {A,B,C} be a triangle. Let D be the foot of the perpendicular?” 
from C to AB. The area of the triangle is this number: 


5B) 


Fact 259. Let {A,B,C} be a triangle. If a is the angle facing BC, then the triangle’s 
area 18 


; AB |AC| sin a. 








582See Ch. 62. 
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Proof. Let D be the foot of the perpendicular from C' to AB. Then the area of the triangle 
is 0.5|AB [CD]. 


So, it suffices to show that cD = |AC| sina. 
AD. AC ). (4B. ae) 


Blac] 5+ Next, 


Now, wo etaotani-( SAS ~~, 
4 AD] [ac 


(4B.70)' [AB.(aB+ De) [AB-aB+4B.e}’ [[aB) +0] st 
nr 
ge He 

AD-AC a ee, 

: a ae 


—>|2 —> 
So, |AC| sin? a = |AC| - 


a ee, eee 
By Pythagoras’ Theorem, cD = |AC| - |AD 


Hence, AC] sin? = oe Thus, |AC|sin a =+ ICD. (The negative value can be dis- 
carded since |AC| sina, ICD > 0.) 














Fact 73. Suppose A, Be R. Then 


Addition and Subtraction Formulae for Sine and Cosine 
(a) sin(A+B) =sin AcosB+cos Asin B 
(b) cos(A+B) =cosAcosB ¥sin Asin B 


Double Angle Formulae for Sine and Cosine 
(c) sin2A =2sin Acos A 
(d) cos2A = cos? A-sin? A = 2cos? A-1=1-2sin? A 





Proof. (We already proved (c) and (d) in Exercise 140.) 
(a) and (b) We first prove the Addition Formulae for Sine and Cosine: 
Fix BeR. Define f: R> Rand g:R-R by 


f (A) = sin (A+ B) -[sin Acos B + cos Asin B] and 
g(A) 2 cos(A + B) - [cos Acos B - sin Asin B]. 


Then f and g are differentiable, with 


f’ (A) =cos(A+ B) - [cos Acos B - sin Asin B] = g (A) and 
g' (A) =-sin(A+ B) -[-sin Acos B - cos Asin B] = —f (A). 


Moreover, (f \ 4 (9) is differentiable, with 
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((f) + (g)’)' (A) = 2F (A) f(A) + 29 (A) 9 (A) = 2f (A) 9 (A) - 29 (A) f(A) = 0. 


By Proposition 8 (Derivative Is Zero Function Implies Constant Function), there exists 
some c € R such that (Gy + (g)’) (A) =c for all AeR. But 


((f)° + (g)°) (0) = 
{sin (0+ B) - [sin0cos B + cosOsin B]}” + {cos (0 + B) - [cosO0cos B - sin0sin B]}” = 0. 


So, ((f)° + (g)?) (A) 20 for all AeR. 


Since for all A€ R, (f)’ (A) > 0 and (g)" (A) > 0, 2 implies that (f)” (A) = 0 and (g)” (A) = 
0—and hence, f (A) =0 and g(A) =0. And so, by + and 2 we have, for all Ae R, 


sin (A+ B) =sin Acos B + cos Asin B and cos (A + B) = cos Acos B - sin Asin B. 


Since B was arbitrary, these last two equations also apply for any B ¢R. This completes 
the proof of the Addition Formulae for Sine and Cosine. 











The proof of the Subtraction Formulae for Sine and Cosine is similar and omitted. 





Fact 79. The lines of symmetry for 


(a) sin are x = kn/2, for odd integers k; 





(b) cos are x =k, for integers k. 


Proof. (a) We will first show that no line of symmetry for sin can be (i) horizontal; or (ii) 
oblique. We will then show that (iii) sin is symmetric in (I) x = km/2, for odd integers k; 
but (IT) in no other vertical lines. 


(a)(i) Since sin is not constant, by Fact 59, it is not symmetric in any horizontal line. 


(a) (ii) Let a #0, b #0. Consider the oblique line az + by+c=0 or y = ae 


b b 
2b-c¢ 





,-2). This point is the point 
a 
on the line that is closest to some point @ in the graph of sin. 


Let R be the reflection of Q in the line, so that R is also the reflection of Q in P. But since 
the y-coordinate of Q is no lower than —1, that of R must be lower than -2. Hence, R is 
not in the graph of sin and this line cannot be a line of symmetry for sin. 


Suppose 7 > 0. Then the line contains the point P = ( 


The case where 5 <0 is similarly handled and thus omitted. 


(a) (iii) By Fact 42, the vertical line x =e is a line of symmetry for sin if and only if for all 
x, sinz = sin (2e-z). 


(a) (iii) (I) Suppose e is an odd integer multiple of 7/2. Then 
sin (2e — x) = sin 2ecos x — cos 2esinx = Ocosxz — (-1)sinz =sinz. 


So, x =e isa line of symmetry for sin. 
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(a)(iii)(II) Suppose e is not an odd integer multiple of 7/2. Then cose # 0. Moreover, 
there exists x € R such that sin(e- x) #0 and hence 


sin (2e —- x) - sin x = sin 2ecos x — cos 2esinx — sinx 
= 2sinecosecos x — (2cos” e - 1)sinz -sinzx 
= 2cose (sine cos x — cosesin x) 


= 2cosesin(e- x) #0. 


So, there exists « € R such that = is false. Hence, x = e is not a line of symmetry for sin. 
(b) Similar, omitted. 


Fact 92. The graph of tan has no lines of symmetry. 


Proof. We will show that tan is not symmetric in any (a) horizontal line; (b) vertical line; 
or (c) oblique line. 














(a) Since tan is not constant, by Fact 59, it is not symmetric in any horizontal line. 


(b) By Fact 42, the vertical line x =d is a line of symmetry for tan if and only if for all x 
(in Domain tan), 


tang = tan (2d-2). 
Consider the points 0,7t/4 in the graph of tan. Suppose 
tan 0 = tan (2d —- 0) and tan 7/4 = tan (2d - 7/4), 


1 tan2d-tan7/4 » tan2d-1 

or 0 = tan 2d and = = ——_.. 
1+tan2dtan7z/4 1+tan2d 

Plug + into 2 to get 1 = -1, a contradiction. Hence, no vertical line x = d is a line of 

symmetry for tan. 

(c) The reflection of the vertical asymptote x = 7/2 in any oblique line is a non-vertical 

(asymptote). So, the reflection of tan in any oblique line cannot be tan. Hence, no oblique 

line can be a line of symmetry for tan. 














Fact 93. The graph of tan| ae has no lines of symmetry. 
—7/2,7 











Proof. Same as proof of Fact 92. 





Definition 294. Let ACR and f: A> B bea function. We say that f is periodic with 
period p if for any aj, a9 € A, 


a, — a2 = kp for some ke Z f (ai) = fF (as). 
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Fact 98. (a) sin(sin"!x) =a for all x ¢ Domain sin"! 


(b) cos(cos"!x) = x for all z € Domain cos“! = [-1, 1]. 


(c) tan(tan"!x) = 2 for all x ¢ Domain tan”! = R. 








Proof. Apply Fact 255. 











Fact 99. (a) sin! (sinz) =2 


(b) cos? (cosz) =% 


(el tame (tan) = 





Proof. (a) (== ) Ifa ¢ [-7/2, 2/2], then sin’ (sinx) # x because Range sin! = [-7/2, 7/2]. 
(<=) If re [-n/2,7/2], then by Fact 99, sin“! (sinz) = x. 
The proofs of (b) and (c) are similar and thus omitted. 














Fact 101. (Addition Formulae for Arcsine and Arccosine) Let x € [-1,1]. 


(a) costa + sin's =n/2. 


(b) sinha + sin"! (-x) =0. 


(c) cos! + cos! (-2) =7. 





Proof. (a) Let x ¢[-1,1] and A £ sin! x € [-7/2,7/2], so that x = sin A = cos(-A+ 71/2). 
Since —A + 11/2 € [0, 1], by Fact 99(b), cos7! x = cos"! (cos (-A + 71/2)) 2 -A+ 71/2. 

Taking £42 yields the result. 

(b) By the Addition Formula for Sine, Fact 98(a), and Fact 100(b), 


1e+sin? (-x)) = sin (sin x) cos (sin (-x)) + Cos (sin x) sin (sin (-2)) 


=x\/1-(-2)° + V1- 22 (-2x) =0. 


Since Range sin”! = [-7/2, 7/2], we have sin! x + sin"! (-x) € [-7,7]. And so, sin7 x + 
sin’! (-x) must equal —7r, 0, or 71. 
1 


sin (sin7 


But sin! 2 + sin™! (—x) = +7 is not possible for any x. So, sin) a +sin7! (-«) = 0. 
(c) Let x €[-1,1] and A + cos! x [0,7], so that x =cos A = —cos(m-— A). 


Since 2-A¢ [0,7], by Fact 99(b), cos! (—a) = cos! (cos (1 — A)) 2n- A. 











Taking £42 yields the result. 
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Fact 260. The composite functions sin osin, cos"! ocos, and tan”! otan exist and their 
domain, codomain, and mapping rules are as follows: 


Domain Codomain 
x — 2k, 
—x + (2k +1) 


os : 7 77 a : 
sin‘ osin R |-7 | sin! (sinx) = 


pe) 
-—x + 2k71, 
cos”! 0 cos R [0, 7] @osn (C087) = 47 ofa 
—x + (2k + 2) 
tame ortam | IR \(2a-r la) 2 ae 7) (- tame (tan) = = hat. tor fon 





1 1 


Proof. (a) Since Range sin = [-1,1] ¢ [-1,1] = Domain sin™, sin™ osin exists and has 


domain Domain sin = R and codomain Codomain sin“! = [-7/2, 71/2]. 
For each x € R, define t € [-7/2, 37/2] and ke Z by x =t + 2kn.? 
Then sin! (sina) = sin“! (sin (¢ + 2k7t)) = sin”! (sint). 


If t € [-7/2, 2/2], then by Fact 99, 
sin! (sint) =¢ =a - 2k. 


If instead t € [7/2, 37/2], then m-t € [-7/2,7/2]. Hence, by the Difference Formula for 
Sine and Fact 99, 


sin‘ (sin t) = sin‘ (sin(a-t)) =m-t=-2+(2k+1)7. 


1 


(b) Since Range cos = [-1,1] ¢ [-1,1] = Domain cos™!, cos! ocos exists and has domain 


Domain cos = R and codomain Codomain cos“! = [0,7]. 

Again, for each x € R, define t € [-71/2, 37/2] and ke Z by x =t + 2kr. 
Then cos"! (cos) = cos”! (cos (t + 2k7t)) = cos (cost). 

If t € [0,7], then by Fact 99, 


cos! (cost) = # = x — 2kr. 


If instead t € [-7t/2,0], then t+ 7 € [0,7]. Hence, by the Difference Formula for Cosine, 
Fact 101(d), and Fact 99, 


cos! (cost) = cos! (— cos (¢ + 71)) = 2 — cos! (cos (t+ ™)) = m- (t+ 7) = -t = -—2 + Qk. 


And if instead t € [7t, 37/2], then t-7t € [0,7]. Hence, by the Difference Formula for Cosine, 
Fact 101(d), and Fact 99, 





°83The existence of such ¢ and k are given by the Euclidean Division Algorithm (Theorem 50). 
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ap] 


cos”! (cost) = cos! (— cos (t — )) = m-cos™! (cos (t — m)) = m-(t — 1) = 2n-t = —2+(2k + 2) zr. 


(c) Since Range tan = R € R = Domain tan“’, tan“! o tan exists and has domain Domain tan = 
R\ {(2a+ 1) 7/2: a¢ Z} and codomain Codomain tan“! = (-2, =). 
For each x € R, define s € (-71/2,7/2) and ke Z by v=s+kn. 


Now, tan”! (tan z) = tan”! (tan(s + k7)) = tan”! (tans) = s=2-kn. 














Fact 100. (a) sin(cos"!x) = V1- 2? for all x € [-1,1]. 
(b) cos(sin! x) = V1-2? for all x €[-1,1]. 


st for allxeR. 
© 


Vive 


il 
—— forallxeR. 
V1+2? 

Z = for all x € (-1,1). 


V1l-2 


(f) tan (cos"! x — i for all x €[-1,1] \ {0}. 


(c) sin(tan™ x) = 


(d) cos (tan! x) = 


(e) tan(sin™' x) = 





Proof. (a) Let y=cos’!a. Then x = cosy and 1-2? = 1-cos*y = sin’ y. 
Hence, sin (cos x) =siny=+V1-2?. 

For « € [-1,1], we have cos! x € [0,7t] and hence, sin (cos x) > 0.50, 

For all x € [-1, 1], sin(cos! x) = V1-a?, 

(b) Let y=sin"'a. Then x =siny and 1-2? =1-sin? y = cos” y. 

Hence, cos (sin x) =cosy=4V1-2?. 

For « ¢ [-1,1], we have sin™! x ¢ [-7/2, 2/2] and hence, cos (sin x) > 0. So, 
For all x € [-1, 1], cos (sin!) = V1- 2°, 


(c) Let y=tan ‘+2. Then x =tany and 1+2?=1+tan?y=sec”y. So, 








re ee ae 1 
a coszy 1-sin?y 1-sin?y 4_ [sin (tante)] 
And, l+a? = —EEE 
L= [sin (tan! x)| 
awe Jee. ie 
Rearranging, [sin (tan x)| = 1 io eae eee 
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j « Iz 
Hence, in(tan +a) =+ =4 ; 
ence sin (tan! x) eee = 


For x > 0, we have tan! a € [0,7t/2) and hence, sin (tan! 2) >0. So, 


jz] sik 
Vl+xe2 Vita? 


For x < 0, we have tan! x € (-7t/2,0) and hence, sin (tan! at) <0)).50, 





For all x > 0, sin (tan! x) zZ 











For all x <0 sin (tan! x) =— [a See . : 
V1+2x? Vi¢+a2? V1+2? 
Altogether, for all x € R, sin (tan! x) 


Vi+ a2 


(d) Let y=tan' 2. Then x = tany and 1+2?=1+tan?y=sec?y. So, 








1 
sec? y = = ; 
cos? y [cos (tan”! r)] 
And, l+a27= — 
[cos (tan”! x)| 
Rearranging, cos (tan! x) =+ : 


V1+22 
For « € R, we have tan! x € (—7/2, 7/2) and hence, cos (tan! zc) >0. So, 


1 


1l+2 





For all xe R, cos (tan! x) = : 
2 


(e) By (b) (and also Fact 98(a)), tan (sin! «) 2 eae) 
e and also Fac a)), tan (sin = —_____ = ———— for all re (-1,1). 
. COs (sin x) V1-2? 


= waa 
(f) By (a) (and also Fact 98(a)), tan (cos! x) = —— —) 7 *" for all a [-1,1]\< 


{0}. O 
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Fact 261. The composite functions sin-tocos and cos-!osin exist and their domain, 
codomain, and mapping rules are as follows: 


Domain Codomain Mapping rule 
x + (0.5 -2k) 7, 
—x + (2k +0.5) 7, 
—x + (2k +0.5) 7, 
(b) cos losin R a (sine a= (2k +0.5) 7 
—9 + (2k + 2.5) 7, 


(a) sin-'ocos R R sin! (cosz) = 





1 1 


Proof. (a) Since Range cos = [-1,1] ¢ [-1,1] = Domain sin™, sin™*ocos exists and has 


domain Domain cos = R and codomain Codomain sin™! = [-7t/2, 71/2]. 


Let x ¢ IR. We have sin (z + ) = cosz. 
Hence, sin! (cos) = sin”! (sin (z + =): And by Fact 260(a), 
2=2 fe 2€|2kn—71/2,2 2 
sin (sin(x+2)) = x+7/ kr, or + 7/2 € [2kn- 71/2, 2k + 1/2] , 
-~-7/2+(2k+1)a, forx+7/2¢€ [2ku+ 7/2, 2k + 37/2]. 


The result follows. 


-1 


(b) Since Range sin = [-1,1] ¢ [-1,1] = Domain cos™', cos" osin exists and has domain 


Domain sin = R and codomain Codomain cos“! = [0, 71]. 
Let x ¢ IR. We have cos (z - =| = sing. 


Hence, cos (sin) = cos”! (cos (x - =). And by Fact 260(b), 


—x + 7/2 + 2k, for eee [Qk — 1/2, 2k] , 
cos! (cos (« - =) ={2-7/2-2kn for x - e [2k7, 2k + 7], 
—7+7/2+ (2k +2) 7, for a- > ¢ [Qk + 1, 2hot + 37/2]. 











The result follows. 
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Fact 102. (Addition Formulae for Arctangent) Let x<R. 


(a) tan a2+tan?!(-x) =0. 


TU) 2. jor 2 > 0, 
—ik)/ 2 Ore, 0): 


Il 
(Sin tene oe tane == 
a 


+ 
eee tor cy We 


1-ay’ 


+ 
(c) tan‘+e+tanty={tan” — = for ry>1 AND cz >0, 


+ 
ee jor vy > IAND 3 <0. 
ay 





Proof. (a) Since Range tan™! = (-4 > 5) we have tan™! # + tan™! (-2) é (-7, 71). 
By the Addition Formula for Sine and Fact 100(c) and (d), 


sin (tan! 2 + tan”! (-7)) = sin (tan x) cos (tan7! (-x)) + cos (tan «) sin (tan7! (-z)) 


7 + = =0 
V1+27, [1+ (-2x)° vita? ./14(-2)? 


: 1 = 2 
Given ¢, we must have tan7' x + tan! (—x) = 0. 





il 

(b) Since Range tan™! = (-2 a 5), we have tan“! z+ tan! — € (-7, 71). 
a 

By the Addition Formula for Sine and Fact 100(c) and (d), 


1 1 1 
sin (tan x+tan7! - | = sin (tan! a) cos (tan - | + cos (tan! x) sin (tan - | 
He £ £ 
le 


We 1 
M14 A fay? VI+2 14 sy 
ear! 























a+] el 

Ray “Te 
_ r+] 

|r| eat oe r+1- |a| 

ie+l] c¢ 2+l a. 2 


1 2 1 
So, if x > 0, then sin (‘an z+tan! - | = 1 and hence, by €, tan} 2 +tan + — = To 
x x 
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1 1 
While if x > 0, then sin (tan x+tan? - | = -1 and hence, by é, tan et tan —==a/2. 
“a x 


(c) We continue with the proof in the main text. 

Case 2. zy>1 AND x >0. 

Then by Lemma 1(c), tan’ x+ tan! y € (7/2,71). 

So, by Fact 260(c), tant (tan (tan 2+tan? y)) =tanl2+tanly-—m7. Thus, 


-1 u+ry 
n ———. 


=tan ‘2+ tan Urn, 
L=7y 


ta 


Case 3. sy>1 AND «<0. 
Then by Lemma 1(d), tan7' x + tan! y € (-7, -7/2). 
So, by Fact 260(c), tan (tan (tan! a+tan? y)) =tanta+tanly+7. Thus, 


-1 U+rYy 
n ——__——. 


=tan/2s+tan (eens 
Ly 


ta 














Lemma 1. Suppose x,y¢R. Then 


(a) tan ’a+tan ly =47/2 7s eal 


(b) € (-7/2, 7/2) cy < 
(c) € (1/2, 7) zy >1 AND x>0; 
(d) € (-7, -71/2) Zu > ANE <0: 





Proof. (a) (<=) If zy =1, then y = 1/z and by Fact 102(b), 


4 7 1 jn/2, for x > 0, 
tan x+tan y=tan  «x+tan = 


& ‘|nj2;,. “sore <0. 


(a) (== ) Suppose tan"! a+tan!y 2 +7/2. It cannot be that x = 0 or y = 0 (because then 
tan! y= 47/2 or tan! x = +7/2). 


Suppose x > 0. If y + 1/x, then (because tan™! is strictly increasing), tan"'x + tan7ly # 
tan’) 2+tan (1/xr) = 2/2, a contradiction. So, y = 1/z or xy = 1. 


Similarly, if x <0, then again ry = 1. 


(b) If z = 0 or y = 0, then tan ‘2+ tan ‘ly = tan ly € (-7/2,7/2) or tan x+tan+y = 
tan! & € (-7/2,7/2) and ry =0< 1. So, suppose x + 0, y + 0. 


( <= ) Suppose zy < 1. 


Case 1. Suppose x > 0. Since y < 1/z and tan! is strictly increasing, 


it 
tan e2t+tany<tan 2 +tan’— = 7/2. 
r 
Since tan™! x > 0 and tan! y > -71/2, 
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tan }a+tan ly >-mn/2. 


Case 2. Suppose x < 0. Since y > 1/z and tan“! is strictly increasing, 
ak i ait a 
tan a2+tany>tan~ x2+tan~ — =-7/2. 
x 


Since tan7! x <0 and tan! y < 1/2, 
tan e+tan ly <n/2. 


(— > ) Suppose -7/2 < tant a2+tant y < 7/2. 

Case 1. Suppose x > 0. 

Then tan‘ a2+tan/y<7/2=tan 7/2 +tan7 (1/2). 

Since tan is strictly increasing, y < 1 /x or equivalently, ry < 1. 

Case 2. Suppose x < 0. 

Then tan‘ 2+tan 7 y > —-7/2 = tan +2 +tan! (1/2). 

Since tan is strictly increasing, y > 1 /x or equivalently, ry < 1. 

(c) & (d) Since Range tan“! = (-7/2,7/2), tan” +tan' y € (-7,7). And so, by (b), 


tan} e+tan lye (—-7,-7/2) u (1/2, 7) <> ry > 1. 
Now, tant at+tan ly (1/2,7) 
=> tan ta>0OR tan ly>0 
— x>O0ORy>Q0. 


Given that ry > 1> 0, this last condition is equivalent to x >0 AND y>0. 


Similarly, tan} e+tan lye (—7,—-7/2) 
=> tana <0 OR tan ly <0 
— x<0ORy<0. 











Given that ry > 1 > 0, this last condition is equivalent to x <0 AND y <0. 





Definition 295. Quadrants I, IT, III, and IV are the subsets of the cartesian plane 
defined by 


Quadrant I = {(z,y):c¢€R*,yeR*}, 
Quadrant II = {(z,y):reR ,yeR*}, 
Quadrant III = {(7,y):ceR ,yeR }, 
Quadrant IV = {(z,y):r¢€R*,yeR}. 


1 
We say that an angle A is in Quadrant I, IT, II, or IV if A« (26r: (26+ )7) Ae 


((2x +5) n, (2k +1)rt), Ae ((2k + 1), (24+ 5)7) or A€ ((2% + 5)r (2k +2) for 


some integer k. 
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Fact 262. For all x €R, sin (cos! x) 





Proof. Let 6 = cos" x € [0,7]. Then cos@ = x and sin@ > 0. 


Now, from the identity sin? 6+cos? 6 = 1, we have sin@ = +V1 - cos? = +V/1 - x. Since sin 6 > 
0, we can discard the negative value. Thus, sin (cos! x) = sin? = V1-72, as desired. 
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142.19. Factorising Polynomials 


Fact 263. Suppose two polynomials have degrees a and b. Then their 


(a) Sum is a polynomial of degree max {a,b}; and 


(b) Product is a polynomial of degree ab. 





Proof. (a) Omitted.°™* 
(b) Omitted.?* 














Theorem 8. (Euclidean Division Theorem for Polynomials.) Let p(x) and d(x) 
be P- and D-degree polynomials in x with D< P. Then there exists a unique polynomial 


q(x) of degree P—D such that r(x) = p(x) -d(ax)q(2) is a polynomial with degree less 
than D. 





P D P-D 
Proof. Let p(x) = )\ piv’ and d(x) = )idja'. Let q(x) = >> giz’ be a (P- D)-degree 
i=0 i=0 


1= 
polynomial. We have 


d(x) q(x) = qp_pdpx" + (qp_pdp-1 + ¢p_p-1dp) x’ | + (qp_pdp_2 + qp_-p-1dp-1 + dp_p-2dp) x? 
++ (qp_pdo + dp_p-1di +++ Mdp-1+q@dp) 2? +r (x), 


where r (2) is a polynomial of degree less than D. 
Now for i € {0,1,2,..., P-D}, pick q so that 


Pp = qp_-pap, 


Dp-1 = Qp-pdp-1 + ¢p-p-1dp, 








Pp-2 = Yp-pdp-2 + dp-p-1dp-1 + Yp_p-2dp, 


Pp = dp-pdo + dp-p-1d1 +++: + qidp-1 + qodp. 


Then p(x) -d(x)q(x) =r(2). This completes the proof of existence. 


To prove uniqueness, suppose s (a) is a polynomial. Then by Fact 263, the polynomial 


p(x) -d(2)a (2) =p(#)-d(a)q(a)+ d(z) [a(x)-s(2)] 


is of degree less than D if and only if q() = s(#).Hence, the polynomial gq is unique. 

















84See e.g. ProofWiki. 
°85See e.g. ProofWiki. 


1605, Contents www.EconsPhDTutor.com 


We need Euclid’s Lemma to prove the Rational Root Theorem: 


Lemma 4. (Euclid’s Lemma) Let a,b,d¢ Z. Suppose a and d share no common factors 


greater than one. If d divides ab, then d also divides b. 














Proof. Omitted. (See e.g. XXX) 





Theorem 11. (Rational Root Theorem) Let p(x) = pypx”" + ppv" | + +++ + pix + po 
with po,P1,---,;Pn€Z and also N, De Z. Suppose N/D is fully reduced. 





iN n 
#r(5) = 0, then aa eZ. 


N N 
Proof. Suppose x — — is a factor of p(x). (The proof of the case where instead x + D 


factorises p(x) is similar and omitted.) 


N 
Then by the Factor Theorem (Theorem 10), (5) = (or 


(5) +0a(5) +t (Bp) mt 
Pn D Pn-1 D Pl D Po =U. 


1 
Rearrange = as 


mv 
Dn-1N 1 +--+ +p,ND"? + pd”! = a 
Since LHS is an integer, D divides p,N”. But since D doesn’t divide N, by repeatedly 
applying Lemma 4, we find that D divides py. 


a 1 
Next, similarly rearrange = as 


_PoD” 


DrN™? + Dn-1DN™? test p, D1 = N 


Since LHS is an integer, N divides p)D”. But since N doesn’t divide D, by repeatedly 
applying Lemma 4, we find that N divides po. 
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142.20. Conic Sections 


For a very brief introduction to conic sections, either watch this video”*® or read the fol- 
lowing. (Or both.) 


Take a vertical line and an oblique line. Rotate the oblique line about the vertical line to 
form an infinite double cone. We call the vertical line the axis and the oblique line the 
generator. The midpoint of the cone (or the point where the axis and generator meet) is 
called the vertex. 


Now take a two-dimensional cartesian plane and slice the double cone from all conceivable 
positions and at all conceivable angles. The intersection of the plane and the outer surface 
of the double cone then form curves which we call conic sections. 








Generator 








Parabola 








We have three types of conic sections: 

1. An ellipse if the plane is less steep than the generator. A special case is the circle 
which is obtained if the plane is perpendicular to the axis. 

2. A parabola if the plane is exactly as steep as the generator. 


3. A hyperbola if the plane is steeper than the generator. 





°86Not made by me and narrated by an female Indian robot, but awesome nonetheless. 
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The ellipse and the parabola are formed from only one half of the double cone. In contrast, 
the hyperbola is formed from both halves—it thus has two branches.°®” 


We shall not do so, but it is possible to prove that in general, a conic section is the graph 
of the equation 


Ax? + Bry + Cy? + Dr+ Ey+F =0. 


We call B? - 4AC the discriminant, because it is possible to show that 


1. If B? -4AC <0, then we have an ellipse. 


2. If B? -4AC = 0, then we have a parabola. (The quadratic equation is an example 
of a parabola.) 


3. If B? -4AC > 0, then we have a hyperbola. 


In general, for any k €R, y = k/x is symmetric in the lines y = x and y = —2: 


Lemma 5. For allk é€R, y=k/x is symmetric in the lines y= x and y=-«. 





Proof. Recall (Fact 37) the reflection of (p,q) in y = 2 is (q,p). But 


So, y= 2 isa line of symmetry for y = k/z. 


Similarly, recall (Fact 38) the reflection of (p,q) in y = -x is (-q,-p). But 


p= <—_ = 2 Ss 
qd —P 











So, y =—x is a line of symmetry for y = k/z. 








°87There are also four types of degenerate conic sections. In each case, the plane cuts through (or 
contains) the vertex. We have 
1. A point if the plane is less steep than the generator (this is the degenerate ellipse). 
2. A single straight line if the plane is exactly as steep as the generator (this is the degenerate 
parabola). 
3. A pair of intersecting lines if the plane is steeper than the generator (this is the degenerate 
hyperbola). 
Now, suppose the generator, which is usually oblique, is now instead parallel to the axis. Then we get 
a degenerate cone that is the cylinder. Now, any plane that is perpendicular to the cylinder’s base 
produces: 
4. A pair of vertical lines. (This is considered a degenerate parabola, because the plane is exactly as 
steep as the generator). 
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Fact 108. Let b,c,d,e¢ R with d#0 and cd-be +0. Consider the graph of 


br +e 
dx+e 


(a) Intercepts. If e # 0, then there is one y-intercept (0,c/e). (If e = 0, then there are 
no y-intercepts.) And if b #0, then there is one x-intercept (-c/b,0). (If b = 0, then 
there are no x-intercepts. ) 


(b) There are no turning points. 


(c) There is the horizontal asymptote y = b/d and the vertical asymptote x = —e/d. 
(The asymptotes are perpendicular and so, this is a rectangular hyperbola.) 


(d) The hyperbola’s centre is (-e/d, b/d). 
(e) The two lines of symmetry are y = +2 + (b+e) /d. 





Proof. We already proved (a), (c), and (d) in the main text. We now prove (b) and (e). 





(b) dy (5 cd-be 1 db + (= 1 \-* -1 


= + = + — . 
dx da\d d? xtefd}) dxrd dr\ ad -x+e/d a (r+efd) 
By assumption, cd - be = 0. So, dy/dx #0. Hence, by Definitions 278 and 279, this graph 
has no turning points. 
(e) By Lemma 5, the following graph is symmetric in y = x and y = -2: 
_cd—bel 
a ee 





Now shift this graph leftwards by e/d units to get the graph of: 


ed=be 1 
~  d2 x t+efd’ 





which, by Fact 58, has lines of symmetry y = x + e/d and y = -(x+e/d). 
Now shift this last graph upwards by b/d units to get the graph of: 


b  (cd—be) /d? _ bx +c 
d a+e/d  dxte’ 





which, by Fact 58, has the claimed lines of symmetry: 


ap Petits and = ce 

d : d 

It remains to be shown that these are the only two lines of symmetry. If there were a third 
distinct line of symmetry, then there would be more than two asymptotes. But this is not 
the case. Thus, there can be at most two distinct lines of symmetry. 
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Fact 264. Let a,d,ce #0. Consider the graph of 


ax? + br +c 
dz +e 


(a) Intercepts. If e +0, then there is one y-intercept (0,c/e). (Ife =0, then there are 
no y-intercepts. ) 


If b? — 4ac > 0, then the two x-intercepts are 


a ) 
2a a 





If b? — 4ac = 0, then there is only one x-intercept (—b/2a, 0). 
And if b? — 4ac < 0, then there are no x-intercepts. 


The two turning points are 


d2 


(== (ae? + cd? — bde) /a bd -2ae +2 ee 


If ad >0, then the turning point on the left is a strict local maximum and the one on 
the right is a strict local minimum. And if ad <0, then the one on the left is a strict 
local minimum and the one on the right is a strict local maximum. 


There are two asymptotes, one oblique and one vertical: 


and 


ote that since the asymptotes are not perpendicular, this is not a rectangular 

Note that si th tot t dicul this 1 t t l 
hyperbola. ) 
bd — 2 
(d) The hyperbola’s centre is (-5, “I. 
(e) The two lines of symmetry are 
a+tVa?+d* bd-aetevVa?+d? 
v= 7 x + Pp 








Proof on the next page: 
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Proof. We already proved (a), (c), and (d) above. Here we prove only (b) and (e). 





dy d (ete **) - (dx + e) (2ax +b) - (ax? +bx+c)d _ adx? + 2aex + be - cd 


b = 
) Gr dal deve (dx +e)? (dx +e)” 
Hence, dy/dx =0 <= > adx? +2aex+be-cd=0 <> 


_ —2ae + \/4a*e? —- dad (be-cd) -e + \/(ae? + cd? — bde) /a 
— 2ad 7 
So, if (ae? + cd? - bde) /a< 0, then there are no stationary points. 
If (ae? + cd” -bde) /a = 0, then dy/dz = 0 at x = -e/d. But there is no point in y = 
(ax? + br +c) / (dx +e) at which x = -e/d. And so here, there is no stationary point. 


If (ae? + cd? - bde) /a > 0, then there are two stationary points, given by ©. Plugging these 


values of x into y = (ax? +bx+ c) /(dx +e) and doing the algebra (omitted), we can find 
the y-values and thus conclude the two stationary points are 


pq. (Vee Ok bd — 2ae +2 eee) 
’ ~ d ’ ’ 


d2 





with Q being to the left of P. 


Observe the numerator of dy/dz is a quadratic expression with coefficient ad on the squared 
term. Hence, if ad > 0, then this quadratic is U-shaped, so that Q is a strict local maximum 
and P is a strict local minimum. Conversely, if ad < 0, then the quadratic is n-shaped, so 
that Q is a strict local minimum and P is a strict local maximum. 


(e) Let (p,q) be a point in the hyperbola, i.e. it satisfies y = (ax? + ba + c) /(dx+e). Use 
Fact 41 to write down the reflections of the point (p,q) in the lines: 


poe, eee 
7 d a2 


Through an insane amount of algebra (omitted), it is possible to show that these reflection 


points also satisfy y = (ax? + ba + c) /(dx+e), thus proving that this hyperbola is indeed 
588 








symmetric in the above lines. 


It remains to be shown that these are the only two lines of symmetry. If there were a third 
distinct line of symmetry, then there would be more than two asymptotes. But this is not 
the case. Thus, there can be at most two distinct lines of symmetry. 

















°88This painful, brute-force method is not exactly the “proper” or “usual” way to find the lines of symmetry 
(for which see my “forthcoming” H2 Further Mathematics Textbook), but does avoid having to use other 
facts about conic sections that we haven’t discussed. 
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142.21. Inequalities 


[Fact 14 


Proof. (a) (==> ) Suppose aj, a@2,...,@n #0. 
Then applying Fact ??(a)( = = ) to a, and ag, we have a,az # 0. 


Next, applying Fact ??(a)( == ) to aja2 and ag, we have a,aza3 # 0. 


Finally, applying Fact ??(a)( == ) to ajaga3...@n_1 and ap, we have P = a,a9qa3...dy, #0. 
( <— ) Suppose without loss of generality a; = 0. 
Then applying Fact ??(a)( <= ) to a; and ag, we have ajaz = 0. 


Next, applying Fact ??(a)( <= ) to aja2 and a3, we have ajaa3 = 0. 


Finally, applying Fact ??(a)( <= ) to ajaga3...@n-1 and ap, we have P = ajaga3...dy, = 0. 
(b) (==) If any of ay, ag, ..., and a, is zero, then P =0 and in particular P } 0. 


Suppose an odd number of aj, ag, ..., and a, are negative (and the rest are positive). Let 
P, be the product of all the positive terms, a be one of the negative terms, and P» be the 
product of all the negative terms except a. 

By repeated application of Fact ??(b), P, and P, are positive. So, by the same result, P; P, 
iS positive. 

Now, by Fact ??(c), P = P,Poa@ is negative and in particular P } 0. 

( <— ) Suppose an even number of aj, dg, ..., and a, are negative (and the rest are positive). 


Let P3; be the product of all the positive terms and FP; be the product of all the negative 
terms. 


By repeated application of Fact ??(b), P3 and P; are positive. So, by the same result, 
P = PsP, is positive. 


(c) Similar to (b) and omitted. 
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143. Appendices for Part II. Sequences and Series 


Informally, a sequence converges if its terms “eventually” get “arbitrarily” close to some 
limit LD ¢ R. Formally, 


Definition 296. Let (a,) be a (real and infinite) sequence. Let L ¢ R. Suppose that for 
all ¢ > 0, there exists N such that for all n > N, we have 


la, - Ll <e. 


Then we say that the sequence (a,) is convergent and that its limit exists; moreover, it 
converges to L and its limit is L. 


That the sequence (a,) converges to L may be written as 


An > L or lim ag =; 


n—oco 


If a sequence does not converge, then we say that it diverges or is divergent, and its limit 
does not exist. 


Definition 297. Given the series aj + a2 +a3+..., its nth partial sum is the finite series 
Qi + 9 es? + Ay. 





A convergent series is then simply one whose partial sums converge: 


Definition 298. Let a, +a2+a3+... be a series and its nth partial sum be s, = a) + a2 + 
-+++@,. Consider the sequence (s,,) = (51, 52, $3,...). 


If the sequence (s,,) converges to some real number L, then we say that the series a, + 
dg +a3+... is convergent and that its limit exists; moreover, it converges to L and its 
limit is L. 

If a series does not converge, then we say that it diverges or is divergent, and its limit 
does not exist. 

Example 1577. Recall Grandi’s series: 1-1+1-1+1-1+.... 


The corresponding sequence of partial sums is (s,,) = (1,0,1,0,1,0,...). 


Pick ¢ = 0.4. Suppose that for some k, we have |s; — L| é¢. Then 


1 
lSska1 — L| = |S - Spai + 8~ — L| > |5% - Se41| —|s, - ZL] = 1-|s,- L| >1-e =0.6 ><. 
We have just shown that (s,,) diverges. Hence, Grandi’s series diverges. 


Fact 266. (Reverse Triangle Inequality) Suppose x,y¢R. Then |x - y| > ||x| -|y]]. 





Proof. 
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ry < |a| yl 
—2xy > -2|z||y| 
x+y? —Qaey > 2” + |yl - 2 || [yl 
(x-y)” > (le| -|yl)” 
lz —y| > lla] -|yIl- 

















Proof. Let (a,) be a non-zero arithmetic series. Let d = aj - a, be the common difference 
n 


And Sy = >) a be the nth partial sum. Below, > denotes the use of the Reverse Triangle 
i=l 
Inequality. 


Case 1. Suppose d = 0 (so that a, = a2 =...). Since (a,,) is non-zero, we must have a, # 0. 
1 
Pick € = |a,|/2. Let L ¢ R. Suppose that for some k, we have |s;, — L| < ¢ = |a,|/2. Then 


r 
|Sr+t — [| = Sz + Ak+1 — [| = |S + Ay — [| = lay = (L - s,)| Pa ||a1| = |L — s;| 
fax] _ far} _ 


i 
= = ($= L||2 —|s, —L|> - 
Ilex] — |sx~ L|| 2 Jaa] — [sx - L] > |aal|- > = 


Case 2. Suppose d # 0. If d > 0 (or d < 0), then let 7 be the smallest integer such 
that a; is positive (or negative). Pick ¢ = |d|/2. Suppose that for some k > j, we have 
ls, — L| <e =|d|/2. Then |azs1| = |ax + dl] > |a; + d| > |d| and 


r 
lSko1 — D| = |8% + Qe — L] = lager — (L - 5x)| > |laxai| - |Z - 55| 


d d 
= Ilana | ~ Sx = Li|| 2 lax 7 [Sx -L| - |d| = a = a =. 














Fact 114. LetaeR. If |r| <1, then 


atartar?t+are+-.-= 


7 





Proof. If a=0 or r =0, then the result clearly holds. So suppose a #0 and r # 0. 


Let S,=at+artar?+are+--+¢ar't, 





Lor 
By Fact 113, S,=a i 
-r 



































Let ¢ > 0. Pick K so that — r*| =¢. Then for all k> Kk, 
—1T 
a a 1-r* a a 
fay oo" 17 a leap masa 
a 





ir ae =€ eel <E. 




















l-r 
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Fact 115. Leta+0. If |r|>1, thena+ar+ar?+ar°+... diverges. 


Proof. Let L € R and s, be the nth partial sum of the given series. Pick € = |a;|/2. Suppose 
that for some k, we have |s; — L| <¢. Then 


|Ska1 — L| = |S +ayr* L| = Jarr* -(L- sk)| S Jair | -|L- Sr| 


a a 
= far |= oy = LI > far = ox = 2] > aul = fon ~ Z] > [al t= IS =e 











where > uses the Reverse Triangle Inequality. 


Fact 267. Suppose (an) > a and (b,) > b. If there exists N such that ay > by, for every 
n>N, thena>b. 


Proof. Suppose for contradiction that b> a. 





Then there exists N such that, letting ¢ ib-a> 0, for every n> N, 


la, -a| <e/3 and lb, -— b| < €/3, 
which implies An —a<e/3 and b- by <e/3, 
or equivalently, ate/3 5 ce and by, 5 b-e/3. 











3 2 
Altogether, b, >b-e/3 tate- é/3 = a+ 2e/3 > dn, contradicting ay > bp. 
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144. Appendices for Part III. Vectors 


Note that much of the discussion in these Appendices can be handled more easily with the 
machinery and terminology of linear algebra. But in these Appendices, I shall avoid the 
explicit use of linear algebra because it isn’t in H2 Maths. 


144.1. Some General Definitions 


In the main text, we defined terms such as vector, length, and unit vector in R? or R°. 
We now give their general definitions in R”: 


Definition 299. A point in R” is any ordered n-tuple of real numbers. 
Definition 300. The point (0,0,...,0) in R” is called the origin and is denoted O. 


Definition 301. Let A = (a1,do,...,@,) and B = (bj, bo,...,b,) be points in R”. Then 
the vector from A to B, denoted AB, is the ordered n-tuple defined by?®® 


AB = (b, — a1, by - a9,..., bn — dm) € R”. 


Definition 302. Given the vector u = (w,U2,-..,Un) € R", its length (or norm or mag- 
nitude), denoted ju], is the number defined by 


Jul =\/ut+ue+---4+u2, 


Definition 303. Given u = (uy, u,..., Un) and c€ R, the vector cu is 
GU = (Clin Cig. eu, 


Definition 304. Given the points A = (a1,q@9,...,@,) and B = (bj, b2,...,b,), the differ- 
ence B- A is defined as the vector AB, i.e. 


Pes (eae. 


Definition 305. Given a point A = (a1, a9,...,a@,) and a vector v = (v1, V2,.--, Un), their 
sum A+v is this point: 


A+v = (a, + 01,02 + U9,...,An+Un)- 


Definition 306. Given a point B = (6), bo,...,b,) and a vector v = (vj, V2,...,Un), the 
difference B-v is this point: 


B-v = (db, — UY, bo -— v2,...,bn — Un)- 


























*8°Note that technically, the set R” that contains the points A and B is different from the set R” that 


contains the vector AB. The former is a Euclidean space, while the latter is a vector space. But 
this is beyond the scope of A-Level Maths and so we shan’t worry about it. 
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Definition 307. Given the vectors u = (u1, U2,...,Un) and v = (0, v2,...,Un), their sum, 
denoted u + v, is this vector: 


U+V = (U1 + U1, U2 + Vo,---,Un + Un)- 


Definition 308. Given the vector u = (uj, U2,..., Un), its additive inverse, denoted -u, 
is this vector: 


—u = (-U1, -U2,..-,-Un) - 


Definition 309. Given the vectors u = (ui, U2,...,Un) and v = (vj, V2,...,Un), the differ- 
ence u—v is defined as the sum of u and v. 








Fact 268. Jf u = (uj, U2,...,Un) and v = (v1, U2,..-,Un) are vectors, then 
U-V = (Uy — U1, U2 — Ve,---,;Un — Un)- 
Proof. By Definition 308, -v = (-v1,-v2,...,-Un). And now by Definition 309, 
u-v=u+ (-Vv) = (uy — U1, U2 — Vo,---, Un — Un) - 











Fact 269. Suppose A, B, and C be points. Then AB AG C8) 


Proof. Let A = (a1,42,.--,@n), B = (b1,bo,...,0,), and C = (c,¢9,...,C,). Then by 
Definition 304, AB = (b; - a1, b2-a2,...,bn-Gn), AC = (c1 - a1, C2 - G2,.--,Cnx—Gn), and 
CB = (b -c1,b2-©,...,6n - Cn). And now by Fact 268, 

ABS AC = (G5 byte Sd) SH Sos eee) 


Sy Sep Oyscesbs =) SOD. 














Definition 310. Given the non-zero vector u = (w1,U2,---,Un) € R”, the unit vector in 
its direction (or its unit vector), denoted U, is defined by 


oak! 
u = —Uu. 
ju 





So, given the vector u, its unit vector U is simply the vector that points in the same direction 
but has length 1. 
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144.2. Some Basic Results 


Fact 270. Let a = (a1,a9,...,a,) be a vector andcéR. Then |ca| = |c||al. 





Proof. By Definitions 303 and 302, 


eal = le(ay005,..7,0y)| = |(0aj, Gao, + ..,€a,)|= ss (ca;)” 


n nm 
Yiea) ee = |cly\} >. a? = |c|[al. 
i=1 1=1 i=1 


Fact 124. Suppose a and b be non-zero vectors. Then 














2. 


(a) <=> aandb _ point in the same direction; 


(b) 


a 
(c) <—= ab; 
=> 


aandb _ point in exact opposite directions; 


(d) a tb. 





Proof. (a) Suppose 4 = b. Then a/|a| = b/|b| or a = ({a|/|b|) b. Since a = kb for some 
k >0, by Definition 137, they point in the same direction. 


Now suppose instead that a and b point in the same direction. Then by Definition 137, 
there exists k > 0 such that a= kb. Thus, 


4 1 1 i “ 
a= —~a=—_kb=—_kb= _b=b. 
lal [Ab [Afb] [| 
(b) Similar, omitted. 


(c) and (d) follow from (a) and (b). 

















Fact 117. Suppose v is a vector. Then |v|>0. Moreover, |v|=0 <> v=0. 





Proof. Let v = (v1, 02,-..,Un). Then |v| = \/ > v? > 0, with |v] —> v=0. 
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The next result holds only in two-dimensional space: 


Fact 125. Leta, b, andc be vectors. Ifa tb, then there exist a,G €R such that 


c=aa+ Sb. 





Proof. Let a = (a1, a2) and b = (0), 62). Let ¢ = (c1,c2) be any vector. 
Suppose a, = 0. Then az # 0 (because a # 0) and b; #0 (because a | b). 


Now pick = bic2 = baci and = 
ayb, by 
0+¢ 
a vary hearin | 0 c(h). ee (Jee 
ayb, ag bi \ by gd aay C9 
ashy by 


The cases where a2 = 0, 6; = 0, or b2 = 0 are similarly handled. 


Now suppose 41, do, b;,b2 #0. Since a + b, we have a;/a2 # b;/bo and thus a,b - ab, + 0. 


boc, — 6 — 
Now pick 2 peel = OU. ancl eee 
a bo = ab, a,b _ ab, 
Then iaob= boc — byc2 [ ay " a1C2— A2C1 [| by 
aby — arb \ ay ayby —azb \ by 


01 b9C, —axb[Co+axb7C — adic} 

















a b2 — aby Cy 
= = = Cc. 
aabyCy — a2b1 C2 + a1 b2C2 =aeb5C, C2 
aybg — aad, 


Fact 126. Suppose v is a line’s direction vector. Then 


u is also that line’s direction vector 





Proof. Suppose | is the line described by R = P+ Av (A€R), where P € R” is some point. 
Let A and B be distinct points on /. Since A, B € 1, there are distinct real numbers a and 
8 such that A= P+av and B= P+ Bv. 

Thus, AB = (G-a)v, where G-a#0. We have just proven that any direction vector of I 
must be a non-zero scalar multiple of the vector v. 


We next prove that any non-zero scalar multiple of the vector v must be a direction vector 


of 1. Lett k #0. Let C=A+kv=P+(at+k)v. Observe that Cel. Thus, kv = AC is a 
direction vector of 1. 
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144.3. Scalar Product 


Definition 311. Let u = (u1,u2,...,Un) and v = (v1, V2,...,Un) be vectors. Then their 
scalar product u-v is the number defined by 


n 
u:-V= es 
i=1 


Fact 129. Suppose a, b, and c are vectors. Then 


(a) a-b=b.-a. (Commutative) 
(b) a-(b+c)=a-b+a-c. (Distributive over Addition) 





Proof, Let a= (a4, @9,..+,@,), b= (bt, ba,....50,), atid C= (Cy, Ca, 4.46%). 





Then sboy ae ab a 
i=l i=l 
And, a(ba ees ative) = ate yea eae 











i=1 i=1 i=1 


Fact 130. Suppose a and b be vectors and cé€ R be a scalar. Then 


(ca): b=c(a-b). 





Proof. Let a = (a1, a@9,...,@,) and b = (01, bo,...,b,). By Fact 116(b), we have 


(ca): b=)" (en,) by =e aih = e(a-b). 


A vector’s length is the square root of its scalar product with itself: 

















Fact 131. Suppose v be a vector. Then |v| =\/v-v and |v| =v-v. 





Proof. By Def. 302, |v| =\/ >> v?. By Def. 311, v-v = }° uv; = D> v7. Thus, |v| = /v-v. 











Fact 133. (Cauchy’s Inequality.) Suppose u and v are non-zero vectors. Then 





Equivalently, —jul|v|<u-v<|ulfy] or (u-v)’<|ul’ vp. 


Proof. Let x€R and § = |u+zv|’. Write 
S=|lut2v) =(u+cv)-(utzv) =u-ut+2(u-v)c+(v-v) 2? =|ul +2(u-v) 24 |vf 22. 


Observe that S is a quadratic expression in x. Moreover, S' > 0. So, its discriminant must 
be non-positive. That is, (2u-v)* - 4{v|’ Jul’ < 0. 
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2 
Sane <1. Now take square roots to get -1< a I, 
ful" |v| ful |v] 











Rearranging, 





Fact 135. Suppose u and v are non-zero vectors. Then 


(a) avi u and v point in the same direction; 
u| |v 


u-v Oe : : : 
u and v point in exact opposite directions; 


ful fy] 


u:v amit 
jul v| : 
u:v 


rae u and v point in different directions. 
ul |v 





Proof. We first prove <= of (a). If uand v point in the same direction, then by Definition 
137, there exists k > 0 such that u = kv and so, 


u-v_— (kv)-v_k(v-v)_ kly|iy} 


J 








fully] aviv] UAlivilvl lviiv] 


The proof of <= of (b) is very similar. If u and v point in the exact opposite directions, 
then by Definition 137, there exists k <0 such that u = kv and so, 


u-v (kv)-v_k(v-v)_ klyi|fv] 


lulfvl  lAviivlIAlIviIvl -&Ivi Iv 


—1 "A 








In the remainder of this proof, we prove ==> of (a) and (b). 


We first show that if u tv, then u-v # +|u||v|. We'll use the same idea that was used in 
the proof of Cauchy’s Inequality: 


If u + v, then u # xv for all ee R. So, u- rv #0 and ju- xv|>0. Thus, 
S=lu-av) =(u-2zv)-(u- xv) = jul’ -2(u-v) x4 |vf 2? >0. 


Observe that S is a quadratic expression in x, with positive coefficient on x”. And since 
S>0 for all x, its discriminant must be negative, 


(-2u-v)? —4|v|" Jul’ <0. 


Rearranging, we get (u-v)° < |ul’|v|” and thus u- v ¢ +|ul|v}. 

We’ve just shown that if u { v, then u-v # +|/ul|v|. And so, by the contrapositive, if 
u-v=+|ul|v|, then u || v. 

Now, if u-v # |u||v|, then by <— of (a), u and v do not point in the same direction. 
Thus, if u- v = —|uj|v|, then u and v must point in the exact opposite directions. J 


Similarly, if u-v # -|ul|v|, then by <— of (b), u and v do not point in the exact opposite 
directions. Thus, if u-v =|ul||v|, then u and v must point in the same direction. ¥ 
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144.4. Angles 


Definition 312. The standard basis vector in the ith direction (or ith standard basis 
vector), denoted e;, is the vector whose ith coordinate is 1 and other coordinates are 0. 


Definition 313. The ith-direction cosine of the vector v = (v1, V2,---, Un) is this number 
Ui 
lv] 

Fact 271. Suppose 0 is the angle between a vector v = (v1, V2,-.-,Un) and e;. Then 


U; 
cos = —. 


Iv| 





Ge l+ Va 0 j 
Proof. By Definition 145, cos6 = St = Dei Yi on 
Iv| le; [v|-1 [v| 

















The next Fact says that given two lines, we can choose any direction vector for each and 
the calculated angle between the two chosen direction vectors will, as expected, be fixed: 


Fact 272. If one line has direction vectors u, and v,, while another has ug and vo, then 


[uy ; Up| _ |v -Vo| 





juy|fus| vil {vel 





Proof. There exist non-zero real numbers and jz such that u, = Av; and up = ve. So, 











Corollary 23. Suppose 0 is the angle between two lines l, and ln. Then (a) 0=0 <=> 
ly || lo; (b) C= m2 ee 


Proof. Suppose the two lines have direction vectors u and v. 





2g EM ju-v| 








(a) 0=0 <= cos =0 <> ——=cos0=1 <> u-v=+|ul|v] — ullv <— 
jul {v| jul [v| 
ly || lo. 
(Oe ese Ee Ye es. dey eS 
2 jufv| 2 ful [v| 2 














ly df Io. 
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144.5. The Relationship Between Two Lines 


Fact 139. Suppose two lines are ... 
(a) Identical. Then they are also parallel. 


(b) Distinct and parallel. Then they do not intersect. 


(c) Distinct. Then they share at most one intersection point. 





Proof. (a) If two lines are identical, then they also share a direction vector, so that by 
Definition 150, they are parallel. 


(b) Suppose two lines are parallel. Then by Definition 150 and Fact 126, they share some 
direction vector u. 


Suppose also that they intersect at some point S. Then both lines can be described by 
— 

r= OS + Au and are identical. 

Thus, if two parallel lines are distinct, then they cannot intersect. 


(c) We already showed that two distinct and parallel lines do not intersect. We now show 
that two distinct and non-parallel lines share at most one intersection point. 

Suppose for contradiction that two distinct lines share two distinct intersection points P 
and Q. Then PQ is a direction vector of both lines. Thus, both lines can be described by 
r=OP+ \PO and must thus be identical. 

















Proof. Suppose two lines are described by 


Yr = (pi, p2) + A (U1, uz) and r= (qi, 42) + (U1, v2) (A, €R). 


By definition of a line, at least one of u; or ug must be non-zero. So, suppose without loss 
of generality that u,; #0. Now, 


e If v, =0, then va #0 and so uyvq -— Ugvy = Uy vq # 0. 
e Ifv, #0, then u2g/u1 # v2/v1 (because the two lines are not parallel) and so ujv2—- U2 # 0. 


Thus, uyv2 - uv, # 0. The reader can verify (through rather tedious algebra) that the two 
lines intersect at these parameter values: 


_U(me-®)t+wla-P)y gg 
U1 V2 — U2QVI U1 


> 


This intersection point is also unique because by Fact 139, two lines can share at most one 
intersection point. 
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144.6. Lines in 3D Space 


Fact 155. Suppose the line | is described by r = (pj, po, p3) + A(V1, v2, V3) (AER). 
(1) If v1, v2,v3 #0, then 1 can be described by 
saa I res aa 
U1 V2 U3 


(2) Ifv, =0 and v2, v3 #0, then l is perpendicular to the x-axis and can be described by 


(3) If v2=0 and v1, v3 #0, then l is perpendicular to the y-axis and can be described by 


L-~Pi, 2-Pp3 
Yy = po and = : 
V1 U3 


(4) If v3 =0 and vy, v2 #0, then | is perpendicular to the z-axis and can be described by 


t-Pi _Y—P2 
U1 U2 


z=) and 
(5) [fv ,v2 =0, then | is perpendicular to the x- and y-axes and can be described by 


t= pi Y = P2. 


(6) If v,,v3 =0, then l is perpendicular to the x- and z-axes and can be described by 


t= pi & = p3. 


(7) If v2,v3 =0, then | is perpendicular to the y- and z-axes and can be described by 


Y= Y = 2. 





1 2 3 
Proof. Write L=p,t+ Avy, y=po+Avg, and z=p3 + Avs. 


oe = 
Now, v1x = minus vpx = yields 
4 
ULY — U2 = V1 (po + Av) — V2 (Pit Avi) =Uip_-vpi or Ua (Z@— pr) = V1 (y- pa). 
seat a 2 ae -_ 
Similarly, vox = minus v3x = and v,x = minus v3x = yield 


v2(z—ps)=v3(y-po) and =u, (z= ps) $ v3 (w= pi). 


(Proof continues below ...) 
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(... Proof continued from above.) 


(1) If v1, v2, v3 #0, then divide = by vjv2 and 2 by vgv3 to get 








iP UP aul APs UP 
V1 V2 U3 v2 


(2) If v, =0 and v9, v3 # 0, then « becomes x = p, and divide 2 by vgv3 to get 


Z-p3_Y-P2 
U3 V2 


Since (0, v2, v3)-i=0, / is perpendicular to the x-axis. 


(3) If v2 =0 and 2, v3 #0, then 2 becomes y = po and divide 8 by vj v3 to get 





Since (v1, 0,v3)-j =0, 1 is perpendicular to the y-axis. 


(4) If vs =0 and 2, ve #0, then © becomes z = p3 and divide < by vj v2 to get 


Pi _Y-P2 
U1 V9 





Since (v1, v2,0)-k =0, / is perpendicular to the z-axis. 


(5) If v1, v2 =0, then = and 2 become 


r= pj and Y = po. 
Since (0,0,v3)-i=0 and (0,0,v3)-j=0, / is perpendicular to both the z- and y-axes. 


(6) If v1, v3 =0, then = and £ become 
L= py and Z>=Ds. 


Since (0, v2,0)-i=0 and (0,v2,0)-k =0, 1 is perpendicular to both the x- and z-axes. 


(7) If v2,03 =0, then 2 and 2 become 


y= fe and Zz = p3. 














Since (v,,0,0)-j =0 and (v,,0,0)-k =0, / is perpendicular to both the y- and z-axes. 
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144.7. Projection Vectors 


Fact 273. Jf u and v are non-zero vectors, then (u-¥)¥ is the unique vector such that 


(a) (u-¥)¥ | v; and (b) u-(u-¥) Lv. 





Proof. We first verify that (u-¥) ¥ satisfies (a) and (b): 
(a) Let @ be the angle between (u-¥) ¥ and v. Then 


[(u-v)V¥]-v (u-v)(W-v) uv ae. 





C0805 a Lee rn 
(uv) vfv]  fu-v|fy|fy] luv 


Thus, 0 =0 and (u-¥)¥ || v. 


(b) By Definition 146, u-(u-¥)¥ Lv <> [u-(u-t)¥]-v=0. But this last equation 
is true, as we now verify: 


[u-(u-¥v)V]-v=u-v-(u-v)¥-v -wv-(u-%)(*-v} 


i 


vy (u-v)|vP=0 Vv 


We now show that no other vector satisfies (a) and (b). 


Suppose w is also a vector that satisfies (a) and (b). That is, 
(a) w || v; and (b) u-wiv. 


Since w || v, there exists A # 0 such that w = Av. And since u- w 1 v, we have 














(u-Av)-v=0 or u-v-Av-v=0 or u-v=Av-v or ne 
Viv 
Thus, w=\v=——~v= (u-¥)¥. 
Viv 


Fact 274. (Lagrange’s Identity) Suppose aj, dg, a3, bi, b2, by € R. Then 
(a) (aj ot az) (b; at b3) = (a,b; st agby)” = (aybo = ayb,)°: 


(b) (a? + a3 + a3) (b7 + B3 + 2) — (ayby + agby + agbg)” 
= (ay be = Gab) oF (a3by = Giese oF (agb3 = Ge. 








Proof. We proved (a) in Exercise 242(e) and (b) in Exercise 268(e). 
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Fact 149. Suppose a and b are vectors. Then 


lrej,al = la x b| 





Proof. By Pythagoras’ Theorem (Theorem 17), 


: /,_ 2 = id 
Irejpal = la| — |proj,al’. 


We will prove this claim twice, once in the 2D case and again in the 3D case. In each case, 
we will use Lagrange’s Identity (LI). 


2D case. Let a = (a1, a2) and b = (6), 62). Then 


2 
ab 1 + Agbo 
iejpal = VlaP—[projyal? = | 2+ a3 - See aaba) 


b? + b3 


(a2 + a2) (b? + b8) - (a,b, + agb2)” 
b? + b3 
x bl. 


tt _ | (ayb2 — agb,)” _ |aib2 = azbi| — agb)| =|a 


~ b? + b3 \/b? + BB 





3D case. Let a = (a), a2,a3) and b = (bj, bo, b3). Then 


2 (a,b, + abe + a3b3)” 
lrej,al = \V/ lal - |proj,al = a? + a2 + a2 - 


b? + b2 + be 





(a? + a2 + a2) (b? + b2 + b2) — (aby + agby + 4363)” 





b? + b3 + b3 











\/ bt + b3 + 8 Ib] 


ur V (a2b3 - azby)” + (abi — a1b3)” + (ayb2 — agb1)° _jaxbl _ | P 
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144.8. The Vector Product 


Fact 146. Let a and b be non-zero vectors. [fax b=0, then a || b. 


Proof. Let 6 be the angle between a and b. If ax b = 0, then by Fact 148, Jax b] = 
|a| |b] sin 6 = 0. Since |a| + 0 and |b] #0, we have sin@ = 0 and thus a || b. 














Fact 157. Suppose a, b, andc are vectors, witha tb. Then 





c||axb —. Gila, be 


Proof. <= follows from Fact 275 (below). 


For = = , suppose c || ax b. Then there exists k # 0 such that c = k(axb). So, 
c:a=[k(axb)|]-a=k(axb)-a=0 and thus c 1a. We can similarly show that c-b = 0 
and thus c 1 b. 














Fact 275. Suppose a, b, c, and d are non-zero vectors witha + b. Ife 1 a,b and 


dia,b, then c || d. 





Proof. Let a = (a1, 42,43), b = (b1, bz, b3), € = (C1, 2,3), and d = (dj, ds, dz). 
Since c L a,b, we have c-at0ande-b20. Or, 
C1Q1 + C249 + C303 20 and C101 + Cabo + c3b3 2; 
Let x = agb3 - a3bo, y = a3b; — a,b3, and z = a,b2 — ab, so that ax b = (2, y, z). 
Now, b,x + minus a,x 2 yields 
0 = by (cga2 + C343) — ay (Cab2 + c3b3) = —C2z + Cay or Coz 3 C3Y. 
Similarly, box + minus 2X 2 and b3x + minus a3x 2 yield 
4 5 
Cea ete and Cry = Cot. 
Since d 1 a,b, we have, similarly, 
6 7 8 
doz = dgy, djz=d3x, and dyy=dox. 


We will break down the remainder of the proof into four cases, depending on whether any 
of x, y, and z are zero. We will show that wherever no contradiction arises, c can be written 
as a non-zero scalar multiple of d, so that c || d. 


(Proof continues below ...) 
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(... Proof continued from above.) 


Case 1. All of x, y, and z are zero. 


Then a x b=0, so that by Fact 146, a || b, contradicting our assumption that a } b. 


Case 2. Exactly two of x, y, and z are zero. 
6 


Suppose x = y=0 and z #0. Then from 5 . =, and £ we have c, = Cc) = 0 and d, = dy) = 0. 


And now, c can be written as a non-zero scalar multiple of d: 


Cl 
C3 
Cc = C2 = = 
ds 
C3 C3 3 


Case 3. Exactly one of x, y, and z is zero. 


Suppose x =0 and y,z #0. Then from 4 and im we have c; = 0 and d, = 0. 


Note that co # 0, because otherwise, from 3 we have c3 = 0 and now c = O, contradicting 


our assumption that c is non-zero. Similarly, c3, do, d3 # 0. 


And now 2 divided by 2 yields: oe 
dy dz 


And so again, c can be written as a non-zero scalar multiple of d: 


ds 


Ct 0 
C=] G@ [=] © dg dy 
C3 C3 3 
Case 4. None of x, y, and z is zero. 
Then 2 divided by £ yields: ae 
dy dz 
seed A sic 7 3 qj & 
Similarly, = divided by = yields: —=—. 
d, dz 
Cj C9 
Th —_— = — = —, 
us, a 


Cj dy 
C3 
C= C9 = -— 
ds 

C3 d3 
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144.9. Planes in General 
The definition of a plane, reproduced: 


Definition 175. A plane is any set of points that can be written as 


R:OR-n=d or {Rornwe=d}, 


where n is some non-zero vector and d¢€ R. 


Fact 158. Let A and B be distinct points and q be a plane. If q contains A and B, then 
it also contains all the points in the line AB. 





Proof. Suppose the points are A and B, so that the line AB may be described by r = 
OA+AAB (\€R). 
Suppose the plane can be described by r-n = d. 


We will prove that any point on the line AB is also on the plane qg. (We will thus have 
shown that q contains the line AB.) To do so, we need merely verify that the generic point 


r= OA+AB on the line AB satisfies q’s vector equation: 


r=(OA+)4B)-n -|OA+\(OB-OA)|-n 





204i Ob iis 10 nse hd 














Fact 159. Ifq= {R 1OR n= d} is a plane, thenn 1 q. 


Proof. Let v be any vector on gq. Then there are points A,B € q such that v = AB. Since 
A,Béq, we have OA-n=d and OB-n=d. 


So, n-v=n-AB=n-(OB-OA)=n-OB-n-OA=d-d=0. 


We’ve just proven that given any vector v on qg, we have n-v = 0 or equivalently n 1 v. 
Hence, by Definition 177, n is a normal vector of q (i.e. n 1 q). O 


Fact 160. Let q be a plane and n and m be vectors. Supposen tq. Then 





10) | ——— sero 


Proof. By Definition 177, n 1 qg <=> n-v=0 for every vector v on q. 
By Definition 138, m ||/n <—» m=kn for some k ¢ 0. 


And so, for any vector v on qg, we have 


m-v=(kn)-v=k(n-v)=k-0=0. 














Thus, by Definition 177, m 1 q. 
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Fact 163. Let q be a plane and P and R be points. Suppose Pe q. Then 


ay 
Req ihe wector PR 1s on @: 





Proof. Let n be a normal vector of g. Since OR = OP + PR, we have 





OR-n=(OP+PR)-n=OP-n+PR-n. © 
And now, Req 
=> OR-n=OP-n (Definition 175) 
= PR-n=0 (©) 
=> PRin (Definition 146) 
—> PR is on q. (Fact 162) 











...,2%) is a non-zero vector, then 
seal 
k 
v2 
: es =p. 
: i=l 


Uk 

















k 
Proof. By Definition 311, (21, 22,...,2%)- (m1, 72,---,7) = yee. 
i=l 





Here is the general Definition of a two-dimensional plane in R”: 
Definition 314. A two-dimensional plane in R” is any set that can be written as 


{R: OR =OA+Xu+ pv (A,peR)}, 


for some point A and some non-parallel vectors u and v. 
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Fact 277. In 3D space, Definitions 175 and 314 are equivalent. 


Proof. Corollary 33 shows that Definition 175 implies Definition 314. 
We now show the converse. Let g = {R :OR=OA+ ut uv (A, me R)}, for some point A 


and some non-parallel vectors u and v. 
Let n = ux v (which is non-zero because u } v), d= OA-n, and s={1R:OR-n=d}. We 


will show that (a) Peq == Pes; and then (b) Pes = > Peg. We will thus have 
shown that s =q. 


(a) Suppose P € q, ie. OP= OA+au+ bv for some a, 3 €R. Then 
OP -n=OA-n+au-n+ Bv-n=d+0+0=-d. 


So, Pes. 
(b) Now suppose P € g, i.e. OP -n=d. Rearranging, OP -n=OA-nor AP-n=0. And 
so by Corollary 32, AP is on g. By Theorem 19 then, AP can be written as the linear 


combination of u and v. That is, there exist real numbers a and ( such that AP = aut Bv. 
=_—> oc > 
Rearranging, OP = OA+au+ bv. So, Pe gq. 














Fact 162. Suppose q is a plane with normal vector n. Then 





Vv => Vv 1s On q. 


Proof. Let n = (n1,no,...,nN~) 1g and suppose v 1 n. Our goal is to show that v is on q. 


Suppose q is described by r-n +d. Since n# 0, pick any n; #0. Let P be the point whose 
ith coordinate is d/n; and other coordinates are 0. Then P € q because 


Next, let Q = P+v. Then we also have Q € q because 


OQ-n=(OP+v)-n=OP-n+v-n=d+04d, 














Since P,Q €q and v = PO, v is a vector on the plane. 


Fact 278. Let m be a vector and q be a plane. Suppose m1 q. Then there exists e € R 
=> 
such that OR-m =e for all Req. 


Proof. Let A, R€q and e=OA-m. On the one hand, AR-m = 0 (because m 1 q). On the 
other, AR-m = (OR-OA)-m=OR-m-OA-m. Thus, OR-m=OA-m =e. 
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Suppose n 1 gq. By Fact 160, m || n — > mig. The converse is also true:?”? 


Theorem 18. Let q be a plane and n and m be vectors. Suppose n 1. q. Then 


meg in || 1 





Proof. Let q = {R:OR-n= dh, n = (nj, 72,...,N%), and m = (m1, mMo,..., Mz). Suppose 
m iq. By Fact 278, there exists some e € R such that for all Re q, OR-mie. 


We will use Lemmata 6 and 7 to prove Theorem 18: 


Lemma 6.7;=0 <> m,; = 0. 


Proof of Lemma 6. Suppose n; = 0. Since n + 0, there exists some j for which n; #0. 


Let we R. Let S,, be the point whose ith coordinate is w, jth coordinate is d/n,;, and other 
coordinates are 0. Then 5S, € g because 


oe a 
OS a° t= wine > —ha + »: 0-n, =0+d+0=d. 
mg le{i.j} 


Te 1 
Since S,,, € gq, we have OS,,-m =e or 


—— d d d 2 
OSy:m=wm+—mj+ YL 0-m = wm + —m,; +0= wm; + —m, =e. 
My le{i,j} mG iy 


Since 2 holds for all w « R, it must be that m, = 0. 


The proof that m;=0 = > n; =0 is similar and thus omitted. 


Lemma 7. n;#0 ==> mjd/nj; =e. 


Proof of Lemma 7. Suppose n; # 0. Let Q be the point whose ith coordinate is d/n; and 
other coordinates are 0. Then Q € gq because 

















O6-n==n.+0-m=d+0=d. 


a l#i 














Since Qe g, OQ:m=e or OQ-m = —m; + 9>0-m = —m; + 0 = —m; =e. 
1h; I#i Thy; TY; 


We’ve completed our proofs of Lemmata 6 and 7. On the next page, we continue with our 
proof of Theorem 18. 


(Proof continues below ...) 





Note that this is really just a restatement of a fundamental result from linear algebra. The proof is 
rather long, but uses only material and language we’ve already covered in this textbook. 


1633, Contents www.EconsPhDTutor.com 


(... Proof continued from above.) 
We will show that whether (a) d #0; or (b) d=0, we have m || n. 


(a) Suppose d #0. By Lemma 6, if n; = 0, then m; = 0, so that m; =n; (e/d). 
And by Lemma 7, if n; #0, then m; =n; (e/d). 


Hence, for all i ¢ {1,2,...,k}, m;=n,;(e/d). Thus, we may write m £ >. 


Since m ¢ 0, it must be that e #0. So, = shows that m || n. 


m . 
(b) Suppose d= 0. Since n #0, there is some j for which nj #0. Write m; 2 —2n,. 
thy 


By Lemma 6, nj #0 => m,; #0. 

By Lemma 7, m,d/n,; =e. Since d= 0, we have e £9, 

By Lemma 6, for 2 such that n; = 0, we have m; =0. And so, for such i, ™; Z ny, 

Now consider any s # j such that n, #0. Let 7’ be the point whose sth sone ake is 1,, 
jth coordinate is —n,, and other coordinates are 0. Then T'€ g because 


—=> 
OF n= ngtiet+ (ns) Ty + ms O-np =njns-— Nn; +0 =0. 
lé{s,j} 


; a 1 6 
Since T’ € g, we have OT -m=e=0 or 
———_»> 8 
OT -m=njms+(-ns)mj+ Y) O-mp=njms—ngm; +0=0. 
le{i,j} 
8 oy; 
Thus, for any s #7 such that n, #0, we can rearrange = to write m, = —Ng. 
nj 
Altogether then, 2 zo and 2 show that for all ie {1,2,...,k}, we have m; = (m,/n;) nj. 
10 1; 
m= —n. 
Nj 


Hence, 














; 1 
Since m,; # 0, ~ proves that m || 1. 
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Fact 279. The unique plane that contains the point A and has normal vector n is 





{R:OR-n=OA-n}. 


—_ —_ 
Proof. The given plane contains A, because OA-n=OA-n. 


Let v be a vector on the plane. Then there exist points P and 1Q on the plane su such that 
v= PQ=O0Q-OP. Now, v-n = (OQ - OP) -n = OQ-n-OP-n=OA-n-OA-n=0. 
Thus, v 1 n. We’ve just shown that the given plane has normal vector n. 


We now prove uniqueness. Suppose the plane {R ; OR ‘m= d} contains A and has normal 
vector n. Then by Theorem 18, m = kn for some k # 0. So, d= OA-m = OA-(kn) = kOA-n. 
And now, {R:OR-m = d} = {R:OR-kn = kOA-n} = {R:OR-n=OA-n}. 














How to go back and forth between a (hyper)plane’s cartesian and parametric forms: 


Fact 280. Let x = (21,22,...,2,) € R® be a vector. Let n=(m4,n2,...,N~) be a non-zero 
vector. Without loss of generality, suppose n, #0. Let vy = (d/n,0,0,...,0). And for 
each i € {2,...,k}, let v; be the vector whose 1st coordinate is —n;/n,, ith coordinate is 1, 
and remaining coordinates are 0. Suppose 


k 
3={x Sonat = i} and P= {xix=vi+ Sd (ayn eR) 


4=2 


eis =e 





Proof. We will show that (a) ae S implies ae 7; and (b) ae T implies ae S. 


k k 
(a) Suppose a = (a1, d@2,...,a%) € S. Then yo nia = OF a; = («- Yan) fm, 


i=l i=2 
For each 7 = 2,3,...,k, let A; = a;. Then we have 


We’ve just shown that ae T’. 


(b) Now suppose a¢€ 7. Then a = (< - ee ee ee ae Aa} for some Ag,...,A, € R. 
Ny j=Q UY 


k k 
And now, Seniai=m (2-9/4 J Soma = d. 
1 1=2 


We’ve just shown that ae S. 














The results given in this subchapter were general. In contrast, the results in the next 
subchapter will apply only to planes in R?. 
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144.10. Planes in Three-Dimensional Space 


Fact 164. [fa and b are non-parallel vectors on a plane q, thenaxb 1 q. 


Proof. Let a = (a1, 42,43), b = (6), b2, 63), and n = (n1,n2,n3g) 1 g, so that a,b 1 n, or 
ain, + dang + a3n3 20 and bin, + bone + b3n3 20. 
Now, box + minus 2X e yields 
0 = by (ayn, + agn3) — ag (b1n1 + 63N3) = (a1b2 — agb) ny — (agb3 — a3b2) n3. 


— ics 2 ft 2 a 
Similarly, b)x = minus a,x = and b3x = minus a3x = yield 


(a3b, = abs) ng — (aybe = azb;) ng = 0 and (agb3 = a3b2) ny — (a3b, = abs) ny = 0. 
aob3 = a3b> Ny (a3by = abs) n3 — (a,be = ayb;) ng 0 
(axb)xn=] agbj —ayb3 |X] no |=] (abe - gb) ny — (agb3 — agb) ng | =| 0 | = 9. 
ab» = ab, N3 (azb3 = a3b) ny — (a3by = abs) Ny 0 











Since ax b #0, n #0, and (ax b) x n=0, by Fact 146, ax b || n. 





Theorem 19. Let q be a plane anda and b be non-parallel vectors on q. Suppose c is 
a non-zero vector. Then 


cis avectoronq << > There exist A, u€R such that c= rat pb. 





Proof. In the main text, we already proved <—. Here?! we prove —>. 


Let n = (n1,72,n3) be the plane’s normal vector. Let a = (a1,d2,a3), b = (b1, 62,53), 
c = (C1, C2,¢3), so that a-n=0, b-n=0, c-n=0, and also a # kb for all k #0. 


: 1 2 3 
Write A\N1 + dgng + agng = 0, bin, + bono + b3n3=0, and cn, + cong + ¢3N3 = 0. 


Since n + 0, suppose WLOG that ng #0. Now rewrite é, Z and 2 as 


1 ayn, + AQN2 29 bin, + bene 
a3 = —-—————_ b3; = -—————-,_and__ ¢ 
n n n 
3 3 3 


3. C{N1 + CoN? 


We will use Lemmata 8 and 9 to prove Theorem 19: 


Lemma 8. (a) a; and az are not both zero. (b) b, and by are not both zero. 


Proof of Lemma 8. (a) If a1,a2 = 0, then a3 # 0 (because a # 0) and ayn, + agng + agn3 = 














0+0+agn3 #0, contradicting +. The proof of (b) is similar. 


(Proof continues below ...) 





°!Note that again, this long proof is just a fundamental result from linear algebra (applied to the 3D 
case), but written using only material and language we’ve introduced in this textbook. 
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(... Proof continued from above.) 


Lemma 9. a )b2 - agb, + 0. 


Proof of Lemma 9. Suppose for contradiction that a bg - agb; - 0. We will show that 
whether (a) a; #0 or (b) a, =0, a contradiction arises and hence a,b. - ab; +# 0. 


(a) If a; # 0, then rearranging = we have by 2 agb,/a,. If bj = 0, then by = 0, but this 
contradicts Lemma 8. So, 5; # 0. 


We now show that a = kb for k #0, contradicting our assumption that a + b: 








ayn, + are) _ 4 (0 agby _ by + (agbi/a1) “2 ) 
by ) ) 


ay N3 
b b 
2 = (b1,b9, --™ ==) 2 ~ (by, be, bs) = “1. 
1 1 


= 1 
a= (a1, 42,43) = | 1, A2,— Fa 
3 


N3 b 











(b) If a, =0, then by Lemma 8, az #0 and the same contradictions as in (a) arise. 





The proofs of Lemmata 8 and 9 are complete. We now resume our proof of Theorem 19. 


6 G1C2 ~ A2C1 (cy - bi) /a, if a, #0 


Pick jl and rz 


ab — agby (co —-pibg) Jag if ay =O. 


We now verify that Aa + wb =c, or equivalently, that 
8 9 10 
Aa, + pb, = c1, Ado + pbo=co, and Aaz + pbs = cs. 


We now show that if 2 and 2 hold, then so too does = 


12 ayn, + aan bin, + bone 
Aa3 + bs = —’A——. - pw 
3 3 
(Aa, + wb 1) ny + (Aa + ub) Ng 89 CN, + C2N2 10 : 
SS 5555555558558 5855 SS | Ss = = C3. 


3 3 
It thus suffices to show that © and 2 hold. And we now do so, in each of two cases: 
(i) Suppose a, #0. Then 8 holds: Aa, + pb, f Cy — pby + pudy 8 c,. And so too does 2. 


< a2 — a2c] ab; — aby 6 4201 A102 — A2C1 9 
Adz + by = (cy — by) = + puby = <= + pp = —— + ——— 20. 
ay ay a1 ay ay 


(ii) Suppose a, = 0. Then 2 holds: Ady + pby £ C2 — [be + jibe 2 co. And so too does 8. 














a1 1C2 azb) — a,b, — aycg = a2, — A1C2 8 
Aa, + by = (C2 — pba) — + poby = —= + pw = — + — — =). 
a2 a2 a2 a2 a2 
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144.11. The Relationship Between a Line and a Plane 


Fact 171. Given a line and a plane, exactly one of the three following possibilities holds: 
The line and plane are 


(a) Parallel and do not intersect at all; or 


(b) Parallel and the line lies entirely on the plane; or 





(c) Non-parallel and intersect at exactly one point. 


Proof. Describe the line / and plane q by 
r-p+Av and r-nd. 
To find any points at which / and gq intersect, plug + into 2 to get 
(p+Av)-n=d or p:-nt+dAv-n=d or Av-n2d-p-n. 


Thus, the intersection points of 1 and qg correspond to the values of A for which 3 holds. 
Suppose I || g. Then by Fact 170, v-n =0 and 5 becomes p:ne=d. 

(a) If p-n#d, then / and q do not intersect at any value of 4. So, | and q do not intersect. 
(b) If p-n=d, then / and q intersect at all values of A. So, | lies completely on gq. 

(c) Now suppose instead that / ++ g. Then by Fact 170, v-n #0. 


d-p-n 
And so, we can rearrange 3 to get 5 oneal es 
ven 
This shows that there is only one value of A at which the line and plane intersect. And this 
unique intersection point is given by 














d- 
p+ Av =p +——-v. 
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144.12. The Relationship Between Two Planes 


Fact 173. Suppose q andr are planes with normal vectors u and v. Then 


(a) q|lr — uly; and (bg ey 





Proof. Let @ be the angle between gq and r. By Definitions 181 and Facts 172 and 135, 


-, [u-v| ju-v| 

















(a) q\|\r <> 6=cos > ——=0 — — =cos0=1 — ull v. 
jul [v| jul [v| 
(b) gir = ee ed => eM oe 0 —> u-v=0 — ulv. 
julfv] 2 jul [v| 2 


Fact 174. If two planes are parallel, then they are either identical or do not intersect. 


Proof. Suppose two planes are parallel. Then they share some normal vector n. 
Suppose they are described by OR-n = d, and OR-n = dy. If dy = dg, then they are identical. 


— 
So suppose d; # dy. If the point P is on the first plane, then OP-n = d, # dy, so that P is 
not on the second plane. Thus, the two planes do not intersect. 














Lemma 10. Let n = (n1,7o,...,n~) and m = (m ,mMy,...,Mx) be vectors. If n 4 m, 


then there aret and j such that nim; —njm; #0. 





Proof. Suppose for contradiction that nym; — n;m; + 0 for all ae 


Pick any s such that n, + 0. Then by é. we have nym, — n,m + 0 for all i. Rearranging, 














(m,/ns) n; =m; for all i. Thus, m =(m,/n,)n, contradicting n |} m. 
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Fact 175. If two planes are not parallel, then they must intersect. 


— — 

Proof. Let the two planes be described by OR-n = dand OR-m = e, where n = (nj, n2,.-., Nk), 
m = (m1,m2,..., ms), andn + m. By Lemma 10, there exist 7 and j such that nymj;—njm; # 
0. And since nym; —njm; #0, at least one of n; or n; must be non-zero. 


Suppose without loss of generality that n; #0. Let P = (p1,po,...,pr) be the point with 


en; — dm; a= Din; 
Dj = aa 


=, , and p,=0 for all/¢ {i,7}. 
NGM; — LM; Nj; 


We now verify that both planes contain the point P: 


——> 
OP-n=) pm =pinitpjnj+ >> pin 
le{i,j} 


_ d= pjnj 


nm +pjinj+0=d—-pjn; + pn; = d, J 
ry 


ane d-p;n; 
OP-m = Di pim = pimi t+ pjmy + DY) prim = ——*-*m; + pjmj + 0 


le{i,j} : 


_ dm; + pj (nimj-—njm;) — dm, + en; — dm; 











ac J 





ry; uz) 


Fact 176. Suppose two non-parallel planes have normal vectors n and m. Then their 
intersection is a line with direction vector n x m. 


Proof. Here (Appendices) we’ll actually go a little further by fully specifying the line along 
which the two planes intersect. 


Let the two planes q; and gq be described by OR: n=d and OR-m =e. Let P be the point 
constructed in the proof of Fact 175. 


Then, we claim, g; and q intersect at the line described by 
r=OP+\nxm (A €R). 


To prove this claim, we first verify that q; and q. contain the above line. To do so, plug 
the generic point of the above line into each plane’s vector equation: 


(OP + nx m)-n=OP-n+(Anxm)-n =d+0=d, Vv 
(OP + nx m)-m=OP-m+(Anxm)-m=d+0=d. "A 
Next, let S € q Nq2 with S # P. We will prove that S' is on the given line. 
Since P, S € q1 Nq2, we have PS in,m. And so by Fact 157, PS || a xm. That is, 


Po 2a torsonie. wo 














Rearranging, we have Os = OP +Anxm. Hence, S is on the given line. 
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144.13. Distances 


As mentioned in Remark 209, in 2D space, the hyperplane r-n = d describes a line. So, 
Fact 178, which applies generally to n-dimensional space, can actually also be applied to 2D 
space to prove the following two results, which are now reproduced from our Appendices 
for Part I (Functions and Graphs): 


Proposition 3. The unique point on the line ax + by+c=0 that is closest to the point 
(p,q) ts 


B ( ap+bq+e 2 
= 8 ee 
- e+e 4 a? + b? 





Proof. Replace n, d, and A in Fact 178 with (a,b), -c, and (p,q) to get 


_d-OA-n | —¢- (p,q)* (a, 6) | -c-ap-bq__aptbgte 
InP a2 + b? t+ a + BP 





k; 


So, by Facts 178 and ??, the point on the given line that’s closest to the given point is 


ap+bq+e 
B= A+kn=(p.q)- PIO (a,b) =(p-a 





ap+bq+e pe) 
e+e 4 a? + b? 











Corollary 3. The distance between a point (p,q) and a line ax + by+c=0 is 


lap + bq + c| 


Jae +h 





Proof. Continue with the above proof and apply Fact 178: 




















=). 7 _ap+bqt+e 7... lap + bq + c| 
[AB] = |x| In| = Pap VO +P = 
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144.14. Point-Plane Distance Using Calculus 


Example 1578. Let A= (1,2,3) be a point, g be the plane r-n=r-(1,1,1)=3, and B 
be the foot of the perpendicular from A to qg. Find B and |AB | (the distance between A 
and q). 

In Ch. 77, we already showed that B = (0,1,2) and |AB| = V3. We'll now show this 
again using calculus. 

First, write g into parametric form: R = (0,0,3) +. (1,-1,0) + w(0,1,-1) (A, u€R). 
The distance between A and any arbitrary point R €q is 


JAR Say Ova Ly (4 2) 4 ny = 0/2 2? ED a a 


The values of A and p that minimise |AR| correspond to the point B. Our goal then is 
to find these values. We’ll do so using calculus—this will be very similar to what we did 
in Chs. 62 and 69, one difference being that we'll take two derivatives w.r.t. A and [. 


Another difference is that these derivatives are partial derivatives. Loosely, when 
taking a partial derivative with respect to a variable, we treat any other variable as a 
constant. So, 


(2? + 2? + 2 — 4p — 2A +5) =4A+2-2y, 


(2? + 2? +2 —4ye— 2A +5) =4-4- 2). 
LL 


Setting these last two expressions equal to zero, we get these First Order Conditions: 


a 2 


4\+2-2n] 5,20 and 4y-4-2d) 5 = 0. 


Take 2 plus 2x £ to get \=0 and ji=1. Hence, 


B= (0.0.3)4) (1.0) + 20, P21) = (0,3) 0410) 4 10,11) 16,12) 


And, |AB| = / 202 + 2p? + 24 - 4fi - 2ft + 5 = V0+2+0-4-045 = V3. 


Happily, these results are the same as before. 





Note that this textbook does not explain why the above method works. We will merely 
note that the intuition for why it works is similar to that given for calculus (of a single 
variable) in in Part V (Calculus). 
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144.15. The Relationship Between Two Lines in 3D Space 


Fact 181. Suppose l, and lg are distinct lines described by R= P+au and R=Q+ Bv 
(a,G¢€R). Then exactly one of the following three possibilities holds: The two lines are 


(a) Parallel and do not intersect; moreover, the unique plane that contains both lines is 
Gz = {R: R= P+du+pPQ (Q.neR)}, or 


(b) Not parallel and share exactly one intersection point; moreover, the unique plane that 
contains both lines is described by m={R:R=P+utpv (A,peR)}; or 


(c) Skew (i.e. neither parallel nor intersect) and are not coplanar. 





Proof. (a) Suppose 1, || lz. Then by Fact 139, they do not intersect.??? So, u 4 PO.5% 
By Fact 169, gq is the unique plane that contains P, u, and PO. 
Set 4 = 0 to see that gq, contains |). 


Next, since u || v, /2 can also be described by R = Q+ Gu (3 €R). Now set p = 1 to see 
that gq also contains ly. 


In the remainder of this proof, we’ll suppose instead that 1, + lo. 


Then u + v and by Corollary 33, q is the unique plane that contains P, u, and v. So, q 
is the only possible plane that contains both /; and lo. 


Set 4 =0 to see that q contains 1}. 


By Fact 139, l, and ly intersect at most once.?”4 


(b) Suppose J; and ly share an intersection point S. Then PS is a direction vector of 1. 
—_ 
So, PS || v. Hence, 


gy = {R: R= P+d\utpv (A,#e€R)}={R: R= P+Xu4 piv + pePS (A,jn,H2€R)}. 


Set v2 =1 and A=0 to get R= P+ pvt Ps=53 jrv and see that q contains lp. 
Thus, q is the unique plane that contains both J; and ly. 


(c) Suppose q contains 2. Then there exist \ and fi such that 
Q=P+ \u + [iv. 
Now, consider the point T' = Q - fiv € ly. It is also in 1; because 
T=Q-fw=P+u+ fv-fv=P+du. 


We’ve just shown that if q contains lj, then J; and ly intersect. 














So, if J; and J, do not intersect (and are thus skew), then q does not contain Io. 





°®2This eliminates the possibility that the two lines are parallel and intersect. 
—d, _ _ 
3Tf u || PQ, then ku = PQ for some k € R and Q = P+ PQ = P+ku is on 1,, so that the two lines 
intersect. 
°°4This eliminates the possibility that the two lines are not parallel and intersect more than once. 
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144.16. A Necessary and Sufficient Condition for Skew Lines 


Fact 281. Suppose the lines l, and ly are described by r = OP + u andr = OO + Av 
(A€R). Then 





l, and ly are skew PQ -(uxv) #0. 


Proof. (\impliedby) If PO: (ux v) #0, then ux v #0, so that by Corollary 25, u + v and 
Ly + do. 

Suppose for contradiction that 1, and l, intersect at some point S. Then there are numbers 
a and 6 such that 


S=P+au=Q+ bv or PQ = au - bv. 
And so, PQ-(uxv) = au: (ux v)-Bv-(uxv)=0-0=0. 


But this contradicts PQ: (u xv) #0. So, 1; and ly do not intersect. 
Since J; and Jz are non-parallel and do not intersect, they are skew. 
(= = ) Now suppose PQ -(uxv) =0. 


If PO = 0, then P=Q, so that 1; and J, intersect and are not skew. And if ux v = 0, then 
by Corollary 25, 1; and ly are parallel and are again not skew. 


So, suppose PO, u xv #0. Then by Corollary 25, u } v. 


Also, PO 1 (uxv). By Corollary 32 then, PO lies on the same plane as u and v. 
And so by Theorem 19, there exist a and (@ such that 


PO =au+ bv. 
Now, let g be the plane described by 
{R:OR=OP +du+ pv} (A, ue€R). 
Clearly, q contains 1, (to see this, set = 0). It also contains the point Q, because 
OQ = OP + PO = OP + au + bv. 


Hence, q also contains Ip (to see this, set \ = a). 











We’ve just shown that /; and Jz are coplanar. And so by Corollary 36, they aren’t skew. 
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145. Appendices for Part IV. Complex Numbers 


Fact 282. Suppose ae R andb>0O. Then 


(a) The two square roots of a+ ib (i.e. solutions to x* = a+ ib) are 


2 y Vaz+b2+a+ti Var + a). 


(b) The two square roots of a-ib (i.e. solutions to x* = a—ib) are 


2 (Var VaP-i a-Va®=B). 





VFB + a-(VE+P a) +3 (Vit) (Vata) 
2a + iV? +P a?) = a+iVi? = a+ ib. 


fos VE a Varo 2iy/ (a+ VeP=B) (a- Vet F) 


[20 - ia? = (a? = 8)] =a-iVb? =a-ib. 


NE en | 


(b) 


Nl N|e ee 

















Fact 283. Suppose a,b€ R with b+0. Then the two square roots of a+ bi are 


2 (War Prarie Vita), 





WO (Ware Os Pravin Vitv ba) 


5 |\vee av+b+a- sarit-ded (Vora) (Vata) 
1 
2 


[| 
|b] = a +ib. 














(24 +215 VEPE) = avi 


Lemma 11. If p,q¢C, then (p+q) =p*+q* and (pq)* =p*q". 


Proof. Let p= (a,b) and q = (c,d). Then 
(p+q)* =(at+c,b+d) =(at+c,-b-d) = (a,-b) + (c,-d) = p* +q". 





(pq) * =[(a,b) (c,d)]* = (ac — bd, ad + bc)” = (ac — bd, -ad - bc) = (a, —-b) (c,-d) = p*q". 
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Theorem 21. (Complex Conjugate Root Theorem.) Suppose co,c1,..-,Cn € R. If 


n-2 





Z=a+ib solves C,u" + Cp10"* + Cn_ot™ 2 +--+ +14 + Cy = 0, then so does z* = a—-ib. 


Cr (a + ib)" =0. 


[Js 


n 
Proof. We are given that a+ ib solves >, c,x* = 0. Or equivalently: 


k=0 k=0 


Taking conjugates of both sides of a we have b cr (a + ib) | *=0* 20. 
k=0 


We now apply Lemma 11(a) and (b) to show that b cr (a + ib) al 2 S) ce (a- ib)”. 
k=0 k=0 


Ct (ait) ®@ Y feg (a +id)']* @ Yeu" [(a + iby!) * 


k=0 k-0 


cy (a — ib)”. 


I 
= 
ie) 
= 
| es | 
o— 
S 
+ 
e 
oa 
—— 
= 
| 
* 
wT 
Mea 
S 
oa 
eo" 
— 
S 
+ 
i 
oO 
a 
* 
| 
= 
l 
Ma 


k=0 k=0 k=0 


Together, 2 and 2 show that > Cr (a - ib)" = 0. Hence, a —ib also solves > cer” = 0. 
k=0 k=0 














Fact 190. Suppose z is a non-zero complex number with |\z| =r and argz=0. Then 


z=r(cos@+isin@). 





Proof. Let z =a+ib. Then by Definitions 193 and 194, we have 


cos” il C st if b> 0, 
z a* +b 
re|z)=Ve2+P and 6 2argz= 
2) | a 
—cos ~ — = -cos - ————., if b< 0. 
lz| Va? +b? 
oe 3 |rcos@, if b> 0, 
Rearranging =, a= 
r cos (-0) =rcos8, if b< 0. 


3 eee 1 
So, a=cos@. Plugging = into =, we have 
; 4 ; 
r=Vr2cos?0+b2 <> r*=r?cos?64+h? <> B=r’sin?6 <> b=ersind. 


Observe that b>0 <=> sin@>0andb<0 <=> sin@é<0. That is, sin@ has the same sign 
as b. Hence, we can discard the negative value in = to get b 2 rsin§. 














And now by 3 and 2 the result follows: z =a+ib=r(cos@+isin6@). 
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Theorem 22. (Euler’s Formula) Suppose @¢R. Then e® = cos6 + isin. 


Proof. Define the function f :R—C by 6+ e” (cos@ +isin@). Then” 


f' (0) = (-i)e® (cos@ + isin 6) +e? (—sin@ + icos 6) 
=e (-icos 6 + sin) +e? (-sin6 + icos@) = 0. 
But by Proposition 8, the only functions whose derivatives are zero are constant functions. 
Thus, e’ (cos @ + isin@) = C for some constant C. 
To find what C is, plug in 0 = 0 to get C =e? (cos0 +isin0) = 1-(1+0) =1. 


Thus, e~’ (cos@ + isin@) = 1. Rearranging, e'’ = cos@ + isin 8. 














Lemma 12. Suppose x €[k7,(k+1)7], where ke Z. Then 


x — kr, for k even, 


(k+1)na-2, for k odd. 


cos! (cos x) = 





Proof. First, note that x - ka and (k+1)7-~2 are both in [0,71]. 

Moreover, if y € [0,7], then cos (cosy) = y. 

(a) If k is even, then cos (a — k7t) = cos xcos (k7t) + sin sin (k7t) = cosa. 
eS 


ee’ 
1 0 


And so, cos! (cos.z) = cos”! [cos (x — knt)] = x — kr. J 


(b) If & is odd, then cos[(k+1)7- a] =cos[(k +1) 2] cosx+sin[(k +1) 7] =cosz. 














a oe oOo 
And so, cos"! (cos.z) = cos”! [cos ((k + 1) m-z)] = (k +1) 1-2. J 





We're actually cheating a little with this proof here, because we haven’t explained how the derivatives 
of complex-valued functions work. We simply assume that they work “fairly similarly”. 
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Fact 193. Let z and w be non-zero complex numbers. Then 


(a) |zw| = |2| |w; and (b) arg (zw) =argz+argw + 2kn, 


if argz+argw > TL, 


where in (b), if argzt+argw« (-7,7], 





if argz+argw <—-7. 


Proof. We already proved (a) in Exercise 322. We now prove (b). 
As in (a), let r = |z|, s =|w|, 0=argz, and ¢=argw. 

Note that 0,¢@€ (-7,7t] and so 6+ ¢e (-27, 27]. 

Case 1. Suppose sin(@ +) >0. Then by Definition 194, 


_,rscos (6+ ¢) 


TS 


arg (zw) = cos = cos | [cos(0+¢)]. 


Also, since sin (0 + @) > 0, we have 6+ ¢€ (-27,,-7] U [0,71]. 


Case la. If 6+ ¢€ (-27,-7], then by Lemma 12, 
arg (zw) = cos [cos (0+ ¢)] =0 +6 + 2m. 
Case 1b. If 6+ ¢€ [0,7], then by Lemma 12, 


arg (zw) = cos! [cos (0+ ¢)] =0 + ¢. 


Case 2. Suppose sin(@+@) <0. Then by Definition 194, 
_,7Tscos(O+¢) _ 


TS 


—cos' [cos (6+ ¢)]. 


arg (zw) 2 —cos 


Also, since sin (0 + ¢) < 0, we have 0+ ¢€ (-71,0) U (7, 271). 
Case 2a. If 6+ ¢€ (-7,0), then by Lemma 12, 


arg (zw) 2 -cos”![cos(@ + ¢)] = -[-(0+ ¢)] =0+¢. 
Case 2b. If 6+ ¢€ (7,27), then by Lemma 12, 
arg (zw) 2 —cos”! [cos (0 + ¢)] = - [22 - (0 + 6)] =0+¢-2n. 


We've just shown that arg (zw) = arg z+ arg w+ 2k7, with 














1, for argz+argw > TI, 
k=40, for argz+argw €(-7,7], 
1, for argz+argw < —-T. 
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Fact 194. Suppose w is a non-zero complex number. Then 


@ Eli 


il 
(b) If w is not a negative real number, then arg — = —argw. 
w 


il 
(c) If w is a negative real number, then arg — = argw =T. 
w 





Proof. Let w=a+ib#0. By Fact 186(b), 


o lara) 
w  \a2+b??a2+b]” 


(a) By Definition 193, |w| = Va? +b? and 


|- a), b \ Vat+P 1 1 P 
wl a2 + b? a2+b?} a+b? — \/g24h2 |u|’ 
i a 
cos - —————, for b> 0, 
Va? +b? 
(b) By Definition 194,  argw = 
cos! — for b< 0 
Var+ , 
_, af (a7 + 8) . —b 
cos ~» ——————. = cos. ©’ ————., for —— >0 or b<0, 
l 1/Va? + b? Va? +b? a? + b? 
And, arg — = 
w ae 
_, af (a? + Bb?) 7. @ —b 
=¢cos ~~ —————— > = cos * ———., for —— <0 or b>0. 
1/Va? +b? Va? +b? a? + b? 
Thus, if b< 0, then ar ee a re w J 
8 5 gw. 
And if b > 0, then ar ee ee w J 
: *w Vee 


il 1 
If b=0, a> 0, then argw = arga = 0 and arg — = arg — = 0 so that indeed, 
w a 


il 
arg — = —argw. J 
w 














1 
And in the exceptional case where b = 0, a < 0, we have arg — = 7 =argw. 
w 
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Proof. We already proved (a) in Exercise 325. We now prove (b): 


Suppose w is not a negative real number. Then by Facts 193 and 194, 


i 1 
arg an arg (-—] =argz+arg— + 2k7=arg z-—argw + 2k7, 
w w w 


1 

-1, for argz+arg— =argz-argw>T, 
w 
1 

where k=3 0), for arg z+arg — =argz-argw€ (-7,7], 

w 
1 

1, for argz+arg— =argz-argw < -7. Jv 
w 


Now suppose instead that w is a negative real number. Then arg w = —71. 
And now by Corollary 40, 





€(-7,71] 0 
—_., aa 
x (2) arg z-M=argz-—argwt2kn, for arg z > 0, 
arg — = arg (-z) = 
Ww arg z+M=argz-—argwt2kn, for argz < 0. J 
°F 
<7 
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146. Appendices for Part V. Calculus 





Revision in progress (November 2021). 


And hence messy at the moment. 
Appy polly loggies for any inconvenience caused. 
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146.1. Some Useful Terms 


Figure to be 


inserted here. 





Let c be a point and ¢ > 0. Previously, Definition 275 already defined N-(c), the e- 
neighbourhood of c. We now define several similar and related terms:°”° 


Definition 315. Let ce R and ¢>0. Define the 


e left e-neighbourhood of c, denoted N; (c), by Nz (c) = (c-«,¢); 
e right e-neighbourhood of c, denoted N? (c), by Ni (c) = (c,c +e); and 


¢ deleted (or punctured) e-neighbourhood of c, denoted N-(c), Nz(c) = (c-e,c) vu 
(c,c+é) =(c-—€,c+e) \ {c}. 


Definition 316. Let ce R. We shall say 


¢ Some neighbourhood of c to mean (c-¢,c +e) for some € > 0; 
e Some left neighbourhood of c to mean (c- €,c) for some é > 0; 


¢ Some right neighbourhood of c to mean (c,c+€) for some ¢ > 0; and 


e Some deleted neighbourhood of c to mean (c-€,c)U(c,c +e) for some € > 0. 


Example 1579. The function f :R > R defined by f (x) =z is 


e Positive on some neighbourhood of 1; 
e« Negative on some left neighbourhood of 0; 
e Positive on some right neighbourhood of 0; 


e Non-zero on some deleted neighbourhood of 0. 





Informally, given a point x and a set S, if we can always find another point in S' that’s 
“arbitrarily” close to x, then we call x a limit point of the set S. Formally, 








Though for simplicity, we restrict these definitions to the case of the real number line R. 
°*7Velleman (2016) uses x > a*, which is also non-standard. 
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Definition 317. Let SCR. We call x a limit point of S if for every € > 0, 


HA erie 2o. 





In words, x is a limit point of S if (and only if) every e-neighbourhood of x intersects S at 
some point other than x. 


Important note: For xz to be a limit point of S, there is no requirement that x ¢ S. That 
is, it is possible that x ¢ S is a limit point of S. 


Earlier, Definition 276 defined isolated points. Together with Definition 317, we have the 
result that an isolated point is never a limit point: 


Fact 284. Let x be a point and S' be a set. If x is an isolated point of S, then x is not a 


limit point of S. 





An isolated point of a set must be in that set. In contrast, a limit point of a set need not 
be in that set. 


So, the converse of Fact 284 result is false. That is, the following claim is false: 


“If x is not a limit point of S, then zx is an isolated point of S.” 


Or, “If x is not an isolated point of S, then x is a limit point of S.” 


Instead, we have this partial converse: 


Fact 285. If x is not a limit point of S. then either x is an isolated point of S orxé€S. 


Or, If x is not an isolated point of S and x¢€S, then zx is a limit point of S. 





We already discussed degenerate and non-degenerate intervals in Exercise 42. Formally, 


Definition 318. A degenerate interval is an interval that is empty or contains one num- 


ber. A non-degenerate interval is an interval that contains more than one number. 





Or equivalently (and as already discussed in Exercise 42), a degenerate interval is an interval 
that has (i) equal endpoints; or (iii) finitely many elements. A non-degenerate interval is 
an interval that has (i) distinct endpoints; or (iii) infinitely many elements. 
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146.2. Limits 


In the main text, we described the statement lim f (x) = L € R informally as 
wa 


For all values of x that are “close” but not equal to a, 
f (2) is “close” (or possibly even equal) to L. 


We now formalise the idea contained in the above informal statement: 


Definition 319. Let f be a nice function with domain D and a be a limit point of D. 
We say that the limit of f at a is L€R and write lim f (x) = L if 


For every € > 0, there exists 6 > 0 such that 
x € DaX;(a) implies f (x) € N-(L). 





The above definition is called the ¢-6 definition and is usually credited to Bernard Bolzano 
(1781-1848) and Augustin-Louis Cauchy (1789-1857). 


Subtle but important point: The above definition includes the requirement that a be a 
limit point of D. If a is not a limit point of D (e.g. when a is an isolated point of D), then 
lim f (x) is simply undefined (or does not exist). 

wa 


The following result says that if the limit is defined (or exists), then it must be unique: 


Fact 286. Let DCR and f: D>R. If lim f (x) = LZ, and lim f (x) = Lo, then Ly = Ly. 





Proof. Suppose for contradiction that Ly # Ly. Let ¢ = |L,—- L9| /2. Observe that N-(L1) 9 
N- (Lz) = @. 

By Definition 319, there exist 61, 52 > 0 such that x € DAN ining5,,5)} (a) implies f (x) ¢ N- (L1) 
AND f (x) ¢N-(L2). But this contradicts =. 














We tweak Definition 319 to get left- and right-hand limits: 


Definition 320. Let DCR, f: D>R, and a be a limit point of D. 
We say that the left-hand limit of f at a is L €R and write lim f (x) = L if 


For every € > 0, there exists 6 > 0 such that 
“== DTN, (Ga) implies j (a) < N.( 5). 


We say that the right-hand limit of f ata is L ¢ R and write lim f (x) = L if 


For every € > 0, there exists 6 > 0 such that 
a< DTN, (a) moplies 7 (7 )<N. (2). 
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Remark 212. When we write lim f (x) 2 L, the symbol “x” is, as usual, a dummy variable 


that can be replaced by any other symbol, such as y, z, or ©. So for example, these four 
statements are equivalent: 


limf(r)=L, limf(y)=£, limf(z)=£, ue Oya st 


And so, really, the symbol “x” is superfluous. We could very well rewrite + more simply 
as 


lim feds 
Similarly, we could very well rewrite lim }(2)=2 and lim, f(t) = las 
lim f =e and lim f = 1 
Fact 196. Suppose f is a nice function. Then 


lim f (x) =a = Ligne Ge) el lim, f (x). 








Proof. Immediate from Definitions 319 and 320. 











So far, we’ve only defined what the statement lim f (x) = L means. Strictly and pedantically 
wa 
speaking, we have not yet defined what a statement such as the following means: 


limkf (x) = M. 
We shall assign to + this meaning: 


Let g=kf. Then limg (x) = M. 
wa 
Similarly for other statements. For example, the statement limc 2 ¢ shall mean this: 


Define h: R > R by h(a) =c. Then limh (2x) =c. 


1655, Contents www.EconsPhDTutor.com 


146.3. Infinite Limits and Vertical Asymptotes 


Definition 321. Let DCR, f: D>R, and a be a limit point of D. 


(a) We say that the left-hand limit of f at a is oo and write lim f (x) = oo if 


For every N € R, there exists 6 > 0 such that 
ae DivN, (a)amplies.) (7) oN. 


(b) We say that the right-hand limit of f at a is co and write lim f (x) = oo if 


For every N € R, there exists 6 > 0 such that 
tz € DON; (a) implies f (x) > N. 


(c) We say that the limit of f at a is co and write lim f (7) =o it 
For every N € R, there exists 6 > 0 such that 
x é€ DNs (a) implies f (x) > N. 
(d) We say that the left-hand limit of f at a is —oo and write lim f (x) = —oo if 
For every N € R, there exists 6 > 0 such that 
z € DAN; (a) implies f (x) < N. 


(e) We say that the right-hand limit of f at a is -co and write lim f (x) = —oo if 


wa 


For every N € R, there exists 6 > 0 such that 
z € DAN; (a) implies f (x) < N. 


(f) We say that the limit of f at a is -oo and write lim f (x) = -oo if 


For every N € R, there exists 6 > 0 such that 
xz é€ DaX;(a) implies f (x) < N. 
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146.4. Limits at Infinity and Horizontal and Oblique Asymptotes 


Definition 323. Let DCR, f: D>R, and LeR. We say that 


(a) The limit of f as x approaches oo is L and write lim f (x) = L if 


For every € > 0, there exists N ¢ R such that 
Dn(N,o)#@ and f(x) ¢N-(L) for every x € Dn (N, 0); 


(b) The limit of f as x approaches —co is L and write lim f(x) = L if 


For every € > 0, there exists N ¢ R such that 
Dna(N,o)#@ and f (x) € N-(L) for every x € Dn (-o, N). 





Again, we can, at long last, write down a formal definition of horizontal asymptotes: 


Definition 324. If lim f(z) = L or lim f(x) = L, then the line y = L is called a 
horizontal asymptote of f. 





We can similarly define oblique asymptotes.?”® 


Definition 325. Let DCR and f: D>R. We say that 


(a) The limit of f as x approaches o is ax +b and write lim f (x) = ax +b if 


For every € > 0, there exists N ¢ R such that 
Da(N, oo) #@ and f (x) €N, (ax +b) for every x € DN(N, 00). 


(b) The limit of f as x approaches —oo is ax +b and write jim, f (@) =ar+0 it 


For every € > 0, there exists N ¢ R such that 
Dna(N, oc) #@ and f (x) € N, (ax +b) for every x € Dn (-o0, N). 


Definition 326. If lim f(x) =axr+b or lim f(x) = ax +6, then the line y = ax + b is 
called an oblique asymptote of f. 








58 Actually, with Definitions 325 and 326, we have no need for Definitions 323 and 324. 
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146.5. Asymptotes of a Graph 


This chapter parallels Ch. 142.7. 
So far, we’ve defined only asymptotes of functions. We have not defined asymptotes 


of graphs. 


Example 1580. Define f : R\ {0} > R by f(«) = 1/x. Then f has asymptotes y = 0 
and 2 = 0) 


Now consider instead the graph of y = 1/x—this is simply the set es y) €R*:y= tah. 


We would quite sensibly like to say that this graph also has asymptotes y = 0 and x = 0. 
The problem though is that we haven’t formally defined what any asymptote of a graph 
is. So, right now, we can’t actually say that this graph has any asymptotes. 





It turns out that writing down general definitions of asymptotes of graphs is pretty tricky, 
because a graph is simply any set of points (in R?). 


Example 1581. Below is a graph G. Intuitively and visually, we want to be able to say 
that the red line is a vertical asymptote for G. 


Ay 








But it’s tricky to write down a general definition of (vertical) asymptotes where this is 
SO. 





Here’s one possible set of definitions of vertical, horizontal, and oblique asymptotes: 
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Definition 327. Let G ¢ R? be a graph. We call 


(a) x =a a vertical asymptote of G if there exist ¢ > 0 and a nice function f with graph 
F such that 


(i) x =a isa vertical asymptote of f; and 
(ii) for every Pe F, N-(P)nG= {P}; 


(b) y=aa horizontal asymptote of G if there exist ¢ > 0 and a nice function f with graph 
F’ such that 


(a) y=a isa horizontal asymptote of f; and 
(b) for every Pe F, N-(P)nG = {P}; 


(c) y=ax+b an oblique asymptote of G if there exist ¢ > 0 and a nice function f with 
graph F’ such that 


(a) y=axz+b is an oblique asymptote of f; and 
(b) for every Pe F, N-(P)nG = {P}. 





The condition in each (ii) is to rule out “thick” lines or even “shaded areas” in G: 


Example 1582. Let G = [1,2] R. Absent (a) (ii) in the above definition, every vertical 
line x =a for any a €[1,2] would be a vertical asymptote of G. 


With (a) (ii), we’d say instead that G has no vertical asymptotes (which is probably what 
we want). 
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146.6. Rules for Limits 


Theorem 23. (Rules for Limits) Let f and g be nice functions; and k,L,M «€ R. 
Suppose lim f (x) = L and limg(x) = M. Then 


(a) lim [kf (x) ] (Constant Factor Rule for Limits) 
(b) lim[ f(r) + 9(z2)] (Sum and Difference Rules for Limits) 


(c) lim a aegis) (Product Rule for Limits) 


1 
(d) lim eS) a (for M +0) (Reciprocal Rule for Limi 
wa g OG; 


(e) Him 2 : (for M +0) (Quotient Rule for Limits) 






(Constant Rule for Limits) 


(Power Rule for Limits) 


Proof. First, note that the statements lim f (a) = L and lim g(a) = M say the following: 


© For every ¢; > 0, there exists 6 > 0 such that x ¢ Dn Np, (a) implies |f (x) - L| < ey. 
% For every €, > 0, there exists 6, > 0 such that x ¢ Dn N5, (a) implies |g (x) - M| < eg. 


Fix e > 0. For each Rule, we shall find some 6 > 0 such that if c ¢ Dn (a), then the value 
of the given function at x is less than ¢ away from the claimed limit. 


(a) Pick ef =e/|k|. Let df be as given by ©. Pick 6 = dy. 
Suppose x € Dn; (a). Then by ©, |f (x) - L| < ey and hence 


Af (a) — RL| = |Al[f (x) — L] < |hley = lhl e/ hi =e. 


(Proof continues below ...) 
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(... Proof continued from above.) 


(b) Pick e, =e€/2, e, =¢/2. Let df and 6, be as given by © and ye. Pick 6 = min {6/, dy}. 
Suppose x € Dn X5(a). Then by © and ¥, |f (x) - L| < €;, |g (x) - M| < eg, and hence 


If (2) 9 (2) - Sak |f (2) - L| + |g (z)- M| < ez +eg =e. 
en Inequality 


(c) Pick ef = ¢/|2M| and Eg 2 | (2e¢+2|L)). 
Let 6, and 6, be as given by © and we. Pick 6 = min {d,, 55}. 
Suppose x ¢€ Dn; (a). Then by © and %, |f (x) - L| < e+ and |g (x) - M| <e,. And so, 


If (x) g(a) - LM| 


( 
=|f(x)g(2)-f (a) M+ f(2)M-LM| (Plus Zero Trick) 
< |f (2) 9 (x) - f (x) M|+|f (2) M- LM] (Triangle Inequality) 
= |f (2)I lg (@) - M] + |f (2) - £I[M| (|ab| = |a| [O)) 
= |f (2) -—L + L||g(x) - M|+|f (z) - L||M| (Plus Zero Trick) 
< [|f (x) - L|+|L]]}lg(z)- M+ |f(e)-L||M] (Triangle Inequality) 
< (e+ |L))eq+er|M| 2 5+ 57 


1 =|M/’ 


(d) Pick e, = PTI) Let 6, be as given by ye. 


Mi? M M 
Assume € < 1/|M| (without loss of generality). Then €, 2 mar j vt : im <M =|M|, 


Let é = |M|/2. Since limg(x) = M, there exists 6 > 0 such that if 2 ¢ D ON;(a), then 
g(x) € Ne(M) and, in particular, g (x) #0. 

Pick 6 = {5,5}. Suppose 7 € Dn N5(a). Then |g(2)-M| Z é, and g(x) + 0. Also, 
lg (x)| S |M|-«¢,>0. And so, 


_ |g()- M2 Eg 2 Eg 
Ig (@)IIM] ~ Ig @)IIM > (MT =e) 








| L thie) 
g(x) M| | g(«)M 











e|M|? 9 
1 T+e]M| 7 e|M| - 
1 = =—-= 
e|M = 
(10 - 28th) [(1+ €|M|) |M|-e|M]°] || 


(Proof continues below ...) 
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(... Proof continued from above.) 
(e) follows from (c) and (d). 
(f) For any z, |k-k| =O<e. 


(g) We shall prove the Power Rule for Limits only in the case where a > 0: 





Let a> 0. Pick 6 =min{ = St. 
2expk’ 2 


Suppose x € Dn X;5(a). Note that x >0 (because 6 < a/2). And so, 


a ~ a*| = exp (k In) - exp (klna)| (Definition 335) 
= |(exp k) [exp (Inz) - exp (Ina)]| (Fact 224(c)) 
= |(expk) (4 -a)| (Definition 85) 
E 





= |jexpk| |x - al] < jexpk = 

exp k| |x — a] < |exp eck ; 
It remains to be proven that the Power Rule for Limits also holds in the case where a < 0. 
Unfortunately, such a proof must be omitted altogether from this textbook, for reasons 
briefly discussed in Remark ??. 
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146.7. Why the Geometric Series “'Trick” Doesn’t Always Work 


We now reproduce Examples 737 and 738 from Part II (Sequences and Series). We explain 
why the “trick” works in the first but not the second example: 





Example 737. Suppose S = 1+ 


il ep eee eee ae 
2.) 4s 


2.90, 0 =25 = ( E : J-Qeles+zese..) or 8-2 


3. Hence, S = 2. 


The above argument is mostly correct, but is informal and lacks some details. Here is a 
fuller and more formal presentation of the same argument: 


jae) ame | 
For each k « Zj, define S,; =1+ 5 + Al + A +: eT 7: Suppose we are told that jim 5; is 


convergent, i.e. is equal to some real number S. Then 
ib jim (25e\= (jim 2] (tim si] 250 
Here we use the Product Rule for Limits.°®? This is legitimate if and only if jim Sz is 


actually convergent. 


Sp-28.=(14 545454 oeg)-(2414 54 zt 


jim (Sp—25;) = jim Sh - jim 25; = S-2S = —-S. Here we use the Product and 
Difference Rules for Limits. This is legitimate if and only if jim S; and jim 2S), are 


actually convergent. 


i} iI 
lim (-2 — | = lim -2 - lim — = -2. Here we use the Difference Rule for Limits. 


k-oo k-oo k->0oo Dk 


This is legitimate if and only if lim — is actually convergent. 


k-oo Qk 
3. Hence, —S = -2 or S = 2. 








°9We also use the Constant Rule for Limits. These two Rules are stated as part of Theorem 23 (Part V, 
Calculus). 
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Example 738. Suppose S=1+2+4+8+4+16+.... Then 

1. 25=24+4+8+16+32+... 

2. So, S-2S =(14+2+4+8+16+...)-(2+4+8+16+32+...) or -S=-1. 

o. Hence, S = 1, 

Let’s see why the above argument does not work. 

For each k ¢ Zé, define 5, =1+2+44+8+---+2*1. We do not know whether lim 5; is 


—>0o 


convergent (indeed, it is not). Now, 

iF jim (25;) = (im 2) (jim si] See x 
Here we use the Product Rule for Limits.°°° This is legitimate if and only if jim Sz is 
actually convergent (it isn’t). 

8 —2S_ = (1424448 4---4+ 241) - (24448416 4---4+2*) = 1-2". 

jim (Sp — 2:5),) = lim Sx — jim 25 = 5 =259 = =o. x 
Here we use the Product and Difference Rules for Limits. This is legitimate if and 
only if jim S;, and jim 2.5; are actually convergent (they aren’t). 


lim (1- 2") = lim 1 - lim 2* = 1. x 


00 co 


Here we use the Difference Rule for Limits. This is legitimate if and only if iim 2* is 


actually convergent (it isn’t). 
3. Hence, —S =1 or S=-1. x 
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146.8. Continuity 


Definition 198 is the definition of continuity used in this textbook and also most introduc- 
tory calculus texts. However, past the introductory level, the preferred definition is usually 
this e-d characterisation of continuity: 


Proposition 30. Suppose DCR, f: D>R, andce D. Then statements (a) and (b) 
are equivalent: 


(a) f is continuous at c. 
(b) For every « >0, there exists 6>0 such that x ¢ DnX;(c) implies f (x) € Nz (f (c)). 





Proof. If cis an isolated point of D, then (a) is true (by Definition 198) and (b) is vacuously 
true (because there exists 6 > 0 such that Dn XN; (c) = @). 


So, suppose c is not an isolated point of D. By Fact 285, c is a limit point of D. So, by 
Definitions 319 and 198, 


(b) <= limf(e)=f() = (a). 


Fact 198. Every constant function is continuous. 


Proof. Let DCR, f: D> R, and ae D. 
Let ¢ >0. Pick any 6 >0. If re Dn (a), then 























If (2) - f(@l=le-d=0<e. 


Fact 199. Every identity function is continuous. 


Proof. Let DCR, f: D> R, and ae D. 
Let ¢>0. Pick any 6=c. If x¢ Dn; (a), then 

















lf (2) - f (a)| = |u-al <d=e. 


Theorem 28. Let f and g be nice functions with Rangeg ¢ Domain f. If g is continuous 


ata and f is continuous at g(a), then fg is also continuous at a. 





Proof. Let Dy and D, be the domains of f and g. Let ¢ > 0. 


Since f is continuous at g(a), there exists 6 > 0 such that for every y € D;OX;(g(a)), we 
have |f (y) - f (g (@))| <e. 

Since g is continuous at a, there exists 6 > 0 such that for every 7 ¢ D, 1X5 (a), we have 
lg (2) -g(a)| <6. 

Hence, for every x € D, 1X5 (a), we have |(fg) (x) - (fg) (a)| <e. 














Fact 287. Let DC Rv {0}. Suppose f : D > R is defined by f(x) =1/x. Then f is 


continuous. 
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aze |al 


Proof. Let a¢ D and ¢>0. Pick 6 = t min | *\. Let  € DAN; (a). 















































2 
1 2 
since 6 <4 and x € (a- 6,a+06), we have i> ee hence ih Now, 
1 CHa 1 
Fe) f= [e-=]= [f= [El al 

1{1||1] ae 3 ]1] 2 ave 
—||—| — < |-| —— =e. 
allx}| 2 Jalal 2 




















Theorem 26. Suppose f and g are continuous at aé¢R. Then so too is each of these 
functions: 


(a) fag (b) f-g (c) © (provided g (a) +0) (d) of 





Proof. Let ¢ > 0; Dy and D, be the domains of f and g; and < and ~ denote the uses of 
the ‘Triangle Inequality and Plus Zero ‘Trick. 


(a) Since f and g are continuous at a, there exists 6, > 0 such that for every x € N;5, (a) n 
D;A Dy, we have f (x) € Nj (f (a)) and g(x) € N-j2(g(a@)), so that 


—_ a. la 


1 _ e2_ e/2 a) 
T+lf(@l” €g + |g (a)| 


Since f and g are continuous at a, there exists 6) > 0 such that for every x € N5, (a) AD snDay, 
we have f (x) e N., (f (@)) and g(x) é N.-, (g(a)), so that 


If (x) 9(@) - f(a) g(a)| = If (@) 9 (2) - f (@9(2) +f @g(2)- f(a@9@| 
<|f (c) g(x) - f(a) 9 (@)|+1F @ 9 (a) - fF @ 9 (a) 
=If(@)- Flo @I+ If Ola @) - 9 (@| 
Pf (@)-f @ll9(z) -9(@) +9 @l+If @llg(z) - 9 (a)| 
<|f (c) - f (@)| Lig) - 9 (a)| + 19 (a) + LF (@)l a (@) - 9 (a) 


601 


(b) Let 2 = and ep 


<erley tle (@ll + lf @leo? 5 +f @leg< 545 =e 


(c) From Fact 287 and Theorem 28, the function 1/g is continuous at a. The result then 
follows by also using (b). 


(d) Since f is continuous, there exists 63 > 0 such that for every x € N5, (a) n Dy, we have 
i (x) € Neji (f (a)), so that 
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cf (x) - cf (a) <lellf (@) - f (@I< ld 





Theorem 25. Let f and g be nice functions such that the composite function fg is 
well-defined. Let be R. Suppose limg (x) = 6b and f is continuous at b. Then 
wa 


lim f (g (2) = f (). 





Proof. Let Dr, Dg, and Dy, be the domains of f, g, and fg. Since fg is well-defined, we 
have Dr, = Dy. 


Let ¢>0. 
Since f is continuous at b, by Proposition 30, there exists a > 0 such that 


yeDpnNo(b) => — fly Ne (f(b). 


We will consider three separate cases: 

Case 1. a€éR. 

Since lim g(x) = b, by Definition 319, there exists 6 > 0 such that 
wa 


x€DyaXs(a) => g(x) €Na(b) = DpANa(b) = Dp AX (b) u {0} 


=> f(g(x)) «Ne (f(b) ULF (D)} = Ne (fF (0)). 
We’ve just shown that 
#<¢Dp,0N5(@) —= (fg) (x) € N- (f (0)). 
And so, by Definition 319, 


lim f (g (2) = f (0). 


Case 2. a= 00. 
Since lim g (x) = lim g (x) = 6, by Definition 323, there exists N ¢ R such that DjN(N, co) + 
@ and 


t>N = > g(x)eD;snNa(b) = D;nX, (bd) vu {b} 
=> f(g(x))€N-(f (b))n {f (b)} = Ne(f (0). 
We’ve just shown that 


t>N = (fg) (@) € Ne (f (6). 
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Also, Dg (N, 00) # @. 
And so, by Definition 323, 


lim f (9 (2) = f (0). 


Case 3. a=-—oo. 











Similar to Case 2. 





Take any continuous function f. Restrict its domain to create a new function g. Then g is 
also continuous. A bit more formally and precisely, 


Theorem 52. Let EC DER and f: D>R be a continuous function. Suppose g: E> R 


is defined by g(x) = f (x). Then g is also continuous. 





Proof. Let ae E and e>0. 
By the continuity of f and Proposition 30, there exists 6 > 0 such that 


x é€ DX; (a) = f(x) €N-(f (a)). 
Since Ec D and g(x) = f (x) for all x € E, we have, in particular 
xé€ EO; (a) ¢ DON; (a) —= g(a) =f (x) €N-(f (a@)) = N- (g(a). 











Thus, by Proposition 30, g is continuous at a. 
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Example 1583. Define the function f : R — R by 


Y 


1 
sin — fora = () 
f(z) = v 
0 ror % =U; 








Suppose (for contradiction) that lim, f(x) =LeER. Let ¢ =0.1. Then there exists 6 > 0 
such that 


z € DAN; (0) = (0,0+6) — f(a) €N, (ZL) = (£-0.1,£+0.1). 


But this is false, because given any 6 > 0, the function f takes on all values between -1 
and 1 on (0,0+6).%8 


With this contradiction, we’ve shown that lim f (x) does not exist. By Fact 197 then, 0 
xr—>0t 


is an essential discontinuity. 





Proof. Suppose f : D > R is an increasing function. (The proof of the case where f is 
decreasing is similar and thus omitted.) 


Let ae D®* and <> 0. We will show that f is continuous at a¢ D. 
We first make two observations: 
Observation I. Suppose f is not minimised at a. 


Then since the range of f is an interval and f is increasing, there must exist b < a such that 
f(b) € (f (a) -¢, f (a)). And moreover, since f is increasing, for every x € (b,a), we must 
have f (x) €[f (0), f (a)] and in particular f (7) € Nz (a). 


Similarly, 





63Tet b € [-1,1]. Pick any integer k such that sin-'b + 2k7 > 1/6 (the existence of such an integer k 
is guaranteed by the Archimedean Property). Now, let c = 1/ (sin b+ 2k). Observe that c < 6 and 
f (c) = sin(sin"'b+2k7) = sin(sin"'b) = b. We've just shown that for any b € [-1,1], there exists 
cé€(0,0+6) such that f (c) =b. 

6°4TF no such a exists, then D is empty and f is continuous. 
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Observation II. Suppose f is not maximised at a. 


Then since the range of f is an interval and f is increasing, there must exist c > a such that 
f(o € (f (a), f(a) +2). And moreover, since f is increasing, for every x € (a,c), we must 
have f (x) €[f (a), f (c)] and in particular f (7) € Nz (a). 


Depending on whether f is minimised and/or maximised at a, we have four possible cases 
to examine: 


Case 1. Suppose f is both minimised and maximised at a. 
Then f is a constant function and hence continuous (Fact 198). 
Case 2. Suppose f is not minimised but is maximised at a. 


Then using Observation I, pick 6 = a- 6 so that for every x € Dn X; (a), we have f (x) « 
Nz(a): 


Since f is maximised at a and is an increasing function, for every x > a (with x € D), we 
must have f (2) = f (a) « N-(a). 


Altogether then, for every x ¢ Dn X5 (a), we have f (x) € N- (a). 
Case 3. Suppose f is minimised but not maximised at a. 


Then using Observation II, pick 6 = c-a so that for every x ¢ Dn Xj (a), we have f (x) « 
N, (a). 


Since f is minimised at a and is an increasing function, for every x < a (with x € D), we 
must have f (x) = f (a) « N- (a). 


Altogether then, for every x ¢ Dn X5 (a), we have f (x) € N- (a). 
Case 4. Suppose f is neither minimised nor maximised at a. 


Then using Observations I and IJ, pick 6 = min {a — b,c - a} so that for every x € DAN; (a), 
we have f (x) € N; (a). 














Theorem 27. Let D be an interval and f : D > R be a one-to-one function. If f is 


continuous, then so too is its inverse f !. 





Proof. Since f is one-to-one and continuous on an interval, by Proposition 4, it is either 
strictly increasing or strictly decreasing. And so, by Proposition 6, the inverse f~! is also 
either strictly increasing or strictly decreasing. 


Observe that the range of f-' is D, which is an interval. So, by Proposition 31, f7! is 
continuous. 














The Extreme Value Theorem is the intuitively plausible result that if a function is 
continuous on a closed interval, then it attains both its minimum and maximum on that 
interval: 


Theorem 53. (Extreme Value Theorem) Suppose the function f is continuous on 
the closed interval [a,b]. Then there exist m,M €[a,b] such that for every x € [a,b], we 


have 





f(m)s f(a) s f(M). 
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Figure to be 


inserted here. 








Proof. Omitted.®° 














See e.g. Wikipedia. 
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146.9. The Derivative 
In the main text, we made this informal claim: 
A function f is differentiable at a point a —>? f is approximately linear at a. 
Here’s a formal version of the above claim: 


Proposition 32. Let DCR, f: D>R, andae D. Suppose Le R. Then statements 
(a) and (b) are equivalent: 


(a) f'(a)=L. 
(b) For every « >0, there exists 6 >0 such that 


xe Das (a) — lf (2) -[f (a) + L(x-a)]| <elx-al. 





Proof. Statement (a) is equivalent to this statement: 


For every € > 0, there exists 6 > 0 such that 





xé€ DX; (a) —= eee); 





The above statement is in turn equivalent to (b), because 


FO) 7) =| (@)-F@)-L@-2)] 


LF (x) - F(a) - Lea) |=|——|If @) - EF @) +L (wa) 


Now, it’s not completely obvious why Proposition 32 corresponds to our above informal 
claim. So, let’s parse it a little. 


il 
Yea | 




















Let t be the tangent line at a. The line t is described by this equation: 


y=f(a)+L(x-a). 


near” a. Then the value taken by t at x, is 


4 


Let x; be any point that’s 


f (a1) +L (a1-a). 


Figure to be 


inserted here. 





The vertical distance between f and t at x, is 
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f (x) -[f (ai) + L(x -a)]. 
We can think of line ¢t as our attempt to approximate f. We can thus also think of the 
above quantity as the error made by line t when attempting to approximate f. 


Proposition 32 then says that this error is smaller than ¢ |; — a]. Since € is usually assumed 
to be a small quantity and 2, is “near” a, ¢ |x, — al is itself a tiny quantity. Thus, Proposition 
32 says that the error (made by line t when attempting to approximate f) is tiny and hence, 
t is a good approximation for f (at least for x “near” a). 


Lemma 13. lim (xz - a) = 0. 


Proof. By the Difference, Power, and Constant Rules for Limits, 











lim (x - a) = lim — lima = a-a=0. 
wa wa «w 





Lemma 14. Let DCR and f: D>R. If lim[f (x) -c] =0, then lim f (x) =c. 





Proof. Let ¢ > 0. If lim [f (x) -c] =0, then by Definition 319, there exists 6 > 0 such that 


x é€ DaX;(a) —= f (x) - ce Nz (0) = f (x) € Nz (c). 














So, lim f (x) = c (Definition 319 again). 


We now use the last two lemmata to prove that differentiability implies continuity: 


Theorem 29. If a function is differentiable at a point, then it is also continuous at that 


point. 





Proof. Let DCR, f: D>R, and ae D. Suppose f is differentiable at a. 


For « # a, we have 





f(0)- f(a) = FOL) (0), 


Since f is differentiable at a, lim 


ft)" He) 





exists. Now, 


hao a 


Fe) Lta) 


r-a)|=lim[f(#)-f(@)], 


0= = lim (x -a) =lim 


w>a 


lim (x-a) = lim 
where + uses Lemma 13 and % uses the Product Rule for Limits. 

We’ve just shown that lim[f (x) - f(a)] = 0. So, by Lemma 14, lim f (x) = f (a). Hence, 
by Definition 198, f is continuous at a. 














In the main text (p. 884), we proved the Power Rule of Differentiation only in the special 
case where the exponent c is a non-negative integer. We now also prove it in the case where 
the base x is positive and c is any real exponent. 
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Fact 206. (Power Rule for Differentiation) Let DCR and f: D>R be a differen- 
tiable function. Suppose f is defined by f (x) =x° for some ce R. Then the derivative of 
f is the function f': D +R defined by 


fe) Fer, 





In Ch. 146.27 (below), Definition 335 will give the general definition of exponentiation: 


* 


xz° =exp(clnz), for x >0. 


We will use = to prove Fact 206: 


Proof. Suppose x > 0. Then, 
di. a 


a ee [exp (clnz)] 


= [exp(clnz)] - (cInz) (Chain Rule) 


d 
= cexp(clnz) ae lie (Constant Factor Rule for Differentiation) 
a 
1 
= cexp(clnz) — (Fact 221) 
ue 
ae! 
= cr — 
Z 
= ex), (Laws of Exponents) 


We’ve just proven that the Power Rule of Differentiation holds in the case where x > 0. 


It remains to be proven that the Power Rule also holds in the case where x < 0. Unfor- 
tunately, this proof shall be omitted altogether from this textbook, for reasons that were 
already discussed in Remark ??. 
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146.10. “Most” Elementary Functions are Differentiable 
In the main text, we made these imprecise claims: 


“Most” elementary functions are differentiable. 
Moreover, their derivatives are themselves elementary. 


The following results are my attempt to justify the above claims. 


Fact 288. Let D be an interval. Suppose f : D> R is a polynomial function. Then f is 





differentiable. Moreover, f' is elementary. 


Proof. Since f is a polynomial function, let ao,a1,...,a, € R and k € Zj, and suppose 
f (x) = ag +a,x +-+-+azx* for every x € D. 

k 
For each i =0,1,...,k, define h;: D> R by h; (x) = a;z', so that f (x) = )° h; (x) Observe 


i=0 
that by the Constant, Constant Factor, and Power Rules for Differentiation, each h; is 
differentiable with derivative hi: IR > R defined by hi (a;) = iar". 


Hence, by the Sum Rule for Differentiation, the derivative of f is the function f’: D> R 
defined by 


f(a) =F hi (@) = YA (@) = Pia 


We've just shown that f is differentiable. Moreover, the derivative of f is itself a polynomial 
function and hence also an elementary function. 














Fact 289. Let D be an interval. Suppose f : D > R is a trigonometric function. Then f 





is differentiable. Moreover, f' is elementary. 


Proof. In H2 Maths, the only trigonometric functions we consider are sin, cos, tan, cosec, 
sec, and cot. 


sin’ = cos (Fact 211) and cos is an elementary function. 
cos’ = —sin (Fact 212) and —sin is an elementary function. 
tan’ = sec” and sec? is an elementary function. 

cosec’ = cos and —cosec cot is an elementary function. 


sec’ = cos and sectan is an elementary function. 














cot’ = —cosec? and —cosec? is an elementary function. 


Fact 290. Let D be an interval. Suppose f : D > R is defined by f (x) =tan"'x. Then 





f is differentiable. Moreover, f' is elementary. 


Proof. The derivative of f is the function f’: D > R defined by f’ (x) = 1/ (1 - a) which 
is an elementary function. 
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Fact 291. Let D ¢ (-1,1) be an interval. Suppose f,g: D > R are defined by f (x) = 


‘re and g(x) = cos'x. Then f and g are differentiable. Moreover, f' and g' are 


sin” 
elementary. 





Proof. The derivative of f is the function f’: D > R defined by f’ (x) = 1/V1-<2x?, which 


is an elementary function. 


The derivative of g is the function g’ : D > R defined by g' (x) = -1/V1-.2?, which is an 
elementary function. C 


Fact 292. Let D¢ R* be an interval. Suppose f : D > R is defined by f (x) =Inx. Then 


f is differentiable. Moreover, the derivative of f is itself an elementary function. 





Proof. The derivative of f is the function f’: D > R defined by f’ (x) = 1/z, which is an 
elementary function. 














Fact 293. Let D be an interval. Suppose f : D > R is defined by f (x) =expx. Then f 
is differentiable. Moreover, the derivative of f is itself an elementary function. 





Proof. The derivative of f is the function f’: D > R defined by f’ (2) = exp, which is an 
elementary function. 














Fact 294. Let D be an interval that does not contain zero andcéeR. Suppose f: D>R 
is defined by f (x) =x°. Then f is differentiable. Moreover, the derivative of f is itself 
an elementary function. 





Proof. The derivative of f is the function f’: D > R defined by f’ (x) = cx!, which is an 
elementary function. 














The next Corollary is immediate from the Sum, Difference, Product, and Quotient Rules 
for Differentiation and our definition of elementary functions: 


Corollary 52. Let D be an interval. Suppose f,g: D > R are differentiable functions. 
Then f+ 9, f-9, f-g are differentiable and elementary. 





If moreover g(x) #0 for allae D, then f/g is also differentiable and elementary. 
The next Corollary is immediate from the Chain Rule: 


Corollary 53. Let D be an interval. Suppose f,g: D > R are differentiable functions. 





Then fg is differentiable. 


One way to see why not all elementary functions are differentiable is to examine the following 
gaps in the above results: 


¢ The arcsine and arccosine functions have domain [-1,1]. However, in Fact 291, we place 
the restriction that the domain D must be a subset of (-1,1). 


1676, Contents www.EconsPhDTutor.com 


e In Fact 294, we place the restriction that the domain D does not contain zero. 


¢ In Corollary 52, for the quotient f/g, we place the restriction that g(a) #0 for all x. 


Fact 295. Let D be an interval. Suppose f : D—>R is a polynomial function of order k. 
Then f is smooth. Moreover, for every integern>k+1, the nth derivative of f is the 





function f™ : D +R defined by f™ (x) = 0. 


Proof. The first k derivatives of f domain D, are real-valued, and are defined by 
f' (x) = kage® 1 + (k-1) ag_yr®? +--+ + 2aga + ay 
f" (@) Shh jag? e (R= 1) 62 age’ be 42a, 
fe U : ark (k-1)-+++- 20 + ag_1 (k-1)! 
f (x) = agk! 
For any n >k+1, the nth derivative of f is the function f : D > R defined by 
f™ (x) =0. 


We’ve just shown that for every positive integer n, the nth derivative of f is defined at 
every point in D, the domain of f. Hence, by Definition 209, f is smooth. 
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146.11. Leibniz Got the Product Rule Wrong 
In Ch. 90.2, we mentioned that Leibniz initially got the Product Rule wrong. Here are 
more details on this story: 


In an answer to a MathOverflow question, Francois Ziegler writes, 


In the manuscript “Determinationum progressio in infinitum” (...), Leibniz writes 
. (with “n” in place of “=”): 


_ 2 2. oe ee 
© =a [ . “Hente- do=—" aa 
a? +t? a? +t? 





az 


This amounts to asserting that d[uv] = du du where u = dt and v = ap and 
a 


thus differentiating the product wrong, as the editors comment in footnote 14. 


“Footnote 14” is from a 1923 edition of Leibniz’s collected writings and letters. It reads 
(translation by Google Translate): 


Leibniz differenziert das Produkt zunadchst falsch; in der weiteren Uberlegung 
nahert er sich trotz zweier Fehler der richtigen Formel. 


Leibniz initially differentiates the product wrongly; in the further consideration he 
approaches the right formula despite two mistakes. 


The 1923 editors date the above manuscript to “Anfang [Early] November 1675”. 


Note though that by 1675-11-11, Leibniz had clearly recognised that the naive Product 
Rule was wrong. From Edwards (1979, p. 254): 


In the manuscript of November 11, he poses the questions as to whether d (uv) = 
(du) (dv) and d(v/u) = (dv) (du), and answers in the negative by noting that 


d(x?) =(x+ dx)” — x? = 2x dx + (dx) = 2x dr, 
ignoring the higher-order infinitesimal, while 
(dx) (dx) = (w+ dx - x) (x +dx- 2x) = (dz). 


At this time Leibniz is still searching for the correct product and quotient rules. 


By 1675-11-27, Leibniz had arrived at the correct Product Rule (translation from Child, 
1920, p. 107): 





Ergo dxyndxy-—xdy. Quod Theorema sane memorabile omnibus curvis commune 
est. 


Therefore dxy ndxy - xdy. Now this is really a noteworthy theorem and a general 
one for all curves. 
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146.12. Proving the Chain Rule 


In Ch. 90.4, we gave an incorrect “proof” of the Chain Rule that contained two flaws. 
The first flaw concerns the possibility that g(x) - g(a) = 0 and hence that we might be 
committing the Cardinal Sin of Dividing by Zero (see Ch. 2.2). 


The second flaw concerns the use of this equation: 


fin £(9(2)) ~ f (9(@)) 2 


2 seg. 


While plausible, 2 requires some additional justification, as we now explain: By our defini- 
tion of the derivative, 


f(y)-f(g(@) 
j' (g(a)) = i y—g(a) : 


Hence, ? holds if and only if the following equation holds: 


im £W)=F(9()) jg, FO) F (9) 
yoga) = Y- g (a) ma = g(z)—g(a) 


which is again plausible, but not obviously true. 


The following proof fixes both of these flaws. 


Theorem 32. (Chain Rule) Let f and g be nice functions with Range g ¢ Domain f. 
Suppose f and g are differentiable. Then the function fog is differentiable and its 
derivative ts 


Geg)eg: 





Proof. Let D = Domain f, E = Domaing, and ae E. 
Define the function h: E > R by 


F(g(@)) = f(9@) 
n(ah) 9@)-9(@) 


f'(g(@)), for g(x) = g(a). 


We now state and prove two Lemmata: 


; for g(x) #g(a), 


Lemma 15. /fxe FE with x #a, then 


a a 2 f(g (2))- f(g (a) 


L—-a 


8) ecemerereams 





Proof. If g(x) # g(a), then 
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h(x) 2 g(a) g(a) 1 f(g ())- Fo (@)) 9 (@) -9 (@) _ FG (2) - F(a) 


—a g(x) - g(a) r-a r-a 








While if g(x) = g(a), then 


h(a) LLP =G 2 gg (q)) O = 9 - LY) FV) 


v—-a w—-a v-a 











This completes the proof of Lemma 15. 


Lemma 16. limh (x) = (g(a). 


Proof. Let ¢ > 0. Since f’(g(a)) exists, there exists 6 > 0 such that for every y € 
Dn; (g(a)), we have 





4 
<E. 


eee — f'(g(a)) 


y-g(a) 





Next, by the continuity of g, there exists 6 > 0 such that for every x ¢ En Ns (a), we have 
g(x) € DAN; (g(a)) and hence also 


f (g(a) - f(g (a) 
g(x) — g(a) 


If (9 (a)) - Fg (a) = 9 for g(a) = g(a). 


We’ve just shown that for any ¢ > 0, we can find 6 > 0 such that for every x € En NX5 (a), 
we have |h (a) - f’(g(a))|<e. Hence, 





ce for g(x) # g(a), 


- f'(g(a)) 





ln (a) - f(g (a))| = 


limh (2) = f’(g(a)). 











This completes the proof of Lemma 16. 





We now complete the proof of the Chain Rule:°° 





606 f(g (a)) exists because Range g € Domain f and f is differentiable. 

607 At =, we can use the Product Rule because these two limits exist: 

1. limh (x) 3 f’(g(a)) (by Lemma 16) 

g(x) - 9 (4) 
L-a 


2. lim 


za 


= g' (a) (by the definition of the derivative) 
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(fo9)"(a) «tim £29) @)= (29) (@) _,, £(9(@)) = fala) 


= lim E (x) 2 on 2) - _ 


ee" 


3 f"(g(a)) g'(a) 


Stim (elim 2 2-IO _ (g(a) 9! (a) 


We’ve just shown that for any a € FE, the derivative of fog at a exists and is equal to 
f(g (a)) 9" (a). 
Hence, the derivative of f og is the function (f og)’: E > R defined by 

(fog) (x)= f'(g(a))g9' (2). 


Or equivalently, the derivative of fog is 











(flog) -g’. 


Below is a weak version of the Inverse Function Theorem (IFT). It is weak because of 
the two strong assumptions in the second sentence.®*® These strong assumptions allow for 
an easy proof (using the Chain Rule.) 





Theorem 54. (Inverse Function Theorem) Let D be an open interval and f: D>R 
be a differentiable function. Suppose f is one-to-one and its inverse f ' is differentiable. 
Then for allaé D such that f' (a) #0, 


(f")' Cf (@)) = Pe 





Proof. By Proposition XXX, f-!o f : D > R is defined by Ca of) (x) = x. And so, 
by the Power Rule of Differentiation, ( oy ) ': D > R is differentiable and defined by 


(ftof)'(e) #1. 
Let a € D be such that f’ (a) #0. By the Chain Rule, (f7! 0 f)’(a) 2 (f-')' (f(a) (f)' (a). 
Plug = into 2 to get 1 = (f-')' (f(a) (f)'(a). Rearrange to get 


(fF )'F@)=aa 














Fwy 


Corollary 54. (Parametric Differentiation Rule) Let f and gq be nice functions with 
Range g © Domain f. Suppose f and g are differentiable. If a¢ Domaing and g' (a) #0, 
then 


(fog)'(a) 
g(a) 


Ui ea)ta)= 








68 These strong assumptions are usually absent from versions of the IFT usually presented elsewhere. 
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Proof. By the Chain Rule, (f og)‘ (a) = (f’°g) (a) g' (a). Rearrangement yields the result. 














It’s not at all obvious why the above result corresponds to our Parametric Differentiation 
Rule. To see why, replace f and g with the more familiar y and x. Then 


/ 
Tae (fg) 
f'(g) ==. 
>  ¢! 
dy es —Y 
dir dy . de 
dt ° dt 


(As usual, t is a dummy variable that can be replaced by any other symbol.) 
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146.13. Fermat’s Theorem on Extrema 


Theorem 35. (Fermat’s Theorem on Extrema) Let f be a function that is differ- 
entiable on (a,b). Suppose cé (a,b). 


If c is an extremum of f, then f’ (c) =0. 





Proof. We'll prove the contrapositive—i.e. if f’(c) #0, then c is not an extremum of f. 
Suppose f’(c) > 0. That is, suppose 
ty ETN 


ZC L-C 





Then there exists 6 > 0 such that for every x € (a,b) NX; (c), 
f@)-F(0, 


L—-C 


) 


and hence, «>c — > f(r)>f(c) and oie <> f(a) <7 Ce): 


So, c is not an extremum of f. 











If instead f’(c) < 0, then we'll similarly find that c is not an extremum of f. 





Definition 328. Let S¢R. We call x € S an interior point of S if there exists ¢ > 0 such 
that: Ne(2) eS, 


(Also, a point is called an interior point of a function if it is an interior point of that 
function’s domain.) 
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146.14. Rolle’s Theorem and the Mean Value Theorem 


Theorem 55. (Rolle’s Theorem) Let a<b. Suppose f is a function that is continuous 
on [a,b] and differentiable on (a,b). If f (a) = f(b), then there exists c€ (a,b) such that 


7 oe 


Figure to be 
inserted here. 





Proof. Since f is continuous on [a, b], by the Extreme Value Theorem, f attains a maximum 
and a minimum on [a,b]. 


Suppose f attains its maximum and minimum at a and b. Then since f(a) = f(b), it 
must be that f is a constant function. And so, by the Constant Rule for Differentiation, 
f' (x) =0 for all x € (a,b). 


If either the maximum or minimum occurs at some c € (a,b), then by Fermat’s Theorem 
on Extrema, f’(c) =0. 














Using Rolle’s Theorem, we can prove that a function whose derivative is a zero func- 
tion is itself is a constant function: 


Proposition 8. Let D,E CR and f: D> E be a function. Suppose D is an interval. If 
the derivative of f is the function f': D> R defined by f’ (x) =0, then f is a constant 


function—i.e. f defined by f (x) =c (for some ce E). 





Proof. Since f is differentiable, D must contain more than one element. 
So, pick any a,b¢ D with a< b. Note that since D is an interval, [a,b] ¢ D. 
Define g: [0,1] > R by 

g(a) = f(a+a(b-a))—- f(a)-a2[f (0) -f(a)]. 


Observe that g is a differentiable function, with 


g' (x) = (b-a) f’ (a+2(b-a))-[f (0) - f (a)] =0-[f0)-f@I=f(@)-f@). 
Also, g (0) = f (a) - f (a) -0 =0 and 
g(1) = f(a+1(-a)) - f(a) -1[f (0) - f(@)] =0. 


Observe that g satisfies the assumptions of Rolle’s Theorem. And so, there exists d € (0, 1) 
' 2 
such that g’(d) = 0. 


Together, = and 2 imply that f (a) = f (0). 
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We've just shown that f (a) = f(b) for any a,b¢ D. Hence, f is constant on D.° 











The Mean Value Theorem is a more general version of Rolle’s Theorem: 


Theorem 56. (Mean Value Theorem) Leta <b. Suppose the function f is continuous 
on [a,b] and differentiable on (a,b). Then there exists c€ (a,b) such that 


FO=-F@) 


fi(- 


Figure to be 
inserted here. 





Proof. Define g: [a,b] > R by 


b)-fla 
g(a) = F(0)- [EL (aa) + F(a], 
The function g is continuous on [a,b] and differentiable on (a,b), with 


f(6)- f(a) 
b-a 


a-a)+f(a)|=0 


g' (x) = f' (x) - 


a(0)=4(0) [LOO 


ie (0) =F) -[_ LE 6-0) + F(a] =0. 


Altogether, g satisfies the assumptions of Rolle’s Theorem. 
And so, there exists c € (a,b) such that g’ (c) 2). Together with +, we have 














(=f (Q- LEE «9 or fi(e) = OH) 





6°This proof was stolen from copper.hat. 
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146.15. The Increasing/Decreasing Test 


Fact 208. (Increasing/Decreasing Test, IDT) Let f be a function and D be an 
interval. Suppose f is differentiable on D. 


(a) f’(z) 20 for allaeD f is increasing on D. 


(b) f' (2) 20 torallxe D f is strictly increasing on D. 
(c) f’(z) <0 for allaeD f is decreasing on D. 
(d) f’(x) <0 for all xe D f is strictly decreasing on D. 





Proof. (a) (==> ) Pick any a,b¢ D with a< b. By the Mean Value Theorem, there exists 
c€ (a,b) such that f’ (c) = FO)" Fa) Since f’(c) > 0 and b-a> 0, we have f (b) > f (a). 
(a) (<= ) We'll prove the contrapositive: 
Pick any e € D. Suppose f’(e) < 0. Then there exists some 6 > 0 such that for every 
x € DX; (e), we have 

f(e)-F© 9 

Ze 

Hence, f is not increasing on D. 
The proof of (b) is identical to (a) (= >). 
The proofs of (c) and (d) are similar to (a) and (b) (respectively) and thus omitted. 
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146.16. The First and Second Derivative Tests 


Lemma 17. Letl<a<r,%° D be the interval (1,a) or [l,a), and E be the interval (a,r) 
or (ar 


Suppose f : DU {a} > R is continuous. 


(a) If f is increasing on D, then f (a) > f(x) for allxeD. 
(b) If f is strictly increasing on D, then f (a) > f (x) for all xe D. 
(c) If f is decreasing on D, then f (a) < f (x) for allxe D. 


(d) If f is strictly decreasing on D, then f (a) < f(a) for all xe D. 
Suppose g: EU{a} > R is continuous. 


(e) If g is decreasing on E, then g(a)> g(x) forallxe E. 
(f) If g is strictly decreasing on E, then g(a)> g(a) for allxe E. 
(g) If g is increasing on E, then g(a) < g(a) for allxe E. 
(h) If g ts strictly increasing on E, then g(a) < g(x) for allxe E. 





Proof. (a) We'll prove the contrapositive: Suppose there exists b ¢ D such that f (a) < f (0). 


Let € =[f (0) - f (a)] /2. By the continuity of f, there exists 6 € (0,a—6) such that for all 
x €(a—0,a), we have |f (x) - f (a)| < € and hence also f (b) > f (x). 


So, b¢(a-06,a). In particular, b< a-6. 
Hence, for any x € (a—0,a), we have b< x and f(b) > f (x)—-so, f is not increasing on D. 
(b) If f is strictly increasing on D, then it is also increasing on D, so that by (a), f (a) > 
f (a) for all xe D. 

Suppose (for contradiction) there exists c € D such that f(c) > f(a). Since f is strictly 
increasing on D, for any x € (c,a), we have f (x) > f (c) > f (a), contradicting : 
The proofs of (c) and (d) are similar to those of (a) and (b) and thus omitted. 
The proofs of (e)—(h) are similar to those of (a)—(d) and thus omitted. 














Lemma 18. Let D an interval, f: D> R be a continuous function, andae D. We shall 
call Dn (-o0,a) and Dn (a,c) “a’s left” and as “a’s right”, respectively. 


(a) If f ts increasing on a’s left and decreasing on a’s right, then a is a global maximum 
OF. 


(b) If f is strictly increasing on a’s left and strictly decreasing on a’s right, then a is a 
strict global maximum of f. 


(c) If f is decreasing on a’s left and increasing on a’s right, then a is a global minimum 


Oe 


(d) If f is strictly decreasing on a’s left and strictly increasing on a’s right, then a is a 
strict global minimum of f. 








610\We allow for | = —co and r= oo. In such cases, D and E must be open. 
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Proof. (a) If f is increasing on a’s left, then by Lemma 17(a), f (a) > f (x) for all xe D. 
And if f is decreasing on a’s right, then by Lemma 17(e), f (a) > f (x) for all xe D. 
Hence, a is a local maximum of f. 


The proofs of (b)—(d) are similar and thus omitted. 

















Proof. (a) By the Increasing/Decreasing Test (IDT), f is increasing on a’s left and decreas- 


ing on a’s right. Hence, by Lemma 18, a is a global maximum of f. 
The proofs of (b)—(d) are similar and thus omitted. 














1688, Contents www.EconsPhDTutor.com 


Example 1584. Define g: R > R by®! 


o(24sin=), for 2 20) 
g(z) = e 
0, tor 7= 0), 


1 
Observe that for any x +0, g(x) = 2+ (2 + sin - >x* (2-1) =a*>0. Hence, 0 is a strict 
1 


local (and also global) minimum of g. 


Figure to be 
inserted here. 


Also, g is differentiable (and thus continuous) everywhere, as we now show. For x ¢ 0, 
we have 


d 1 1 1\-l 1 1 
— G (2 + sin -)| = 4° (2 + sin - | +4 (cos =} ag E (2 + sin - | — cos -| é 
dx x x x) x? x x 


(This last expression is defined for all x + 0.) 


For x = 0, we have 





g (0) = lim g(a) — 9 (0) — 9 (0) = lim g(%) = lim 
x0 a) r-0 @ x0 


Altogether then, 


1 il 
x? E (2 +sin =} - cos —| . “tor 2s 0), 
g’ (a) = x “ 
OF fora ="(). 


Given any € > 0, we can always find some (a,b) € (0,¢) such that g'(x) < 0 for all 
x € (a,b),°!” so that by the Increasing/Decreasing Test (IDT), g is (strictly) decreasing 
on (a,b) ¢ (0,¢). Thus, the converses of (c) and (d) of Lemma 18 and Proposition 33 
are false. 


To similarly show that the converses of (a) and (b) of of Lemma 18 and Proposition 33 
are false, consider —g. 








61!'This example was stolen from Gelbaum and Olmsted, Countereramples in Analysis (1964, p. 36). 
6!2This should be easy to see by examining g’ (a). But lest you are not convinced, here is one possible 
construction: 
Let ¢ >0. Let b=1/(2k7), where k is any positive integer such that b < min {¢,1/8}. (The existence of 
such an integer k is guaranteed by the Archimedean Property.) 
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Proposition 10. Let f:[a,b] > R be a differentiable function and cé (a,b). Suppose c 
is the only stationary point of f in (a,b). 


(a) If f (c) ts greater than f (a) and f(b), then c is the strict global maximum of f. 
(b) If f (c) is smaller than f (a) and f (b), then c is the strict global minimum of f. 





Proof. (a) Suppose for contradiction that there exists x € (a,c) such that f (x1) = f (c). 
Then by the Mean Value Theorem, there exists 9 € (1, c) such that f’ (%2) = [f (c) - f (21) ] /(e- 
0, contradicting our assumption that c is the unique stationary point. 


Suppose for contradiction that there exists x3 € (a,c) such that f (23) > f (c) > f (a). Then 
by the Intermediate Value Theorem, there exists x4 € (a, x3) € (a,c) such that f (x4) = f (c). 
We will then arrive at the contradiction given in the first paragraph of this proof. 


The above two paragraphs show that for all x € (a,c), f (x) < f (c). We can similarly show 
that for all x € (c,b), f(a) < f(c). Since f(c) is also greater than f(a) and f(b), we 
conclude that f (c) > f (x) for all x € [a,b]. Thus, c is the strict global maximum of f. 


(b) Similar, omitted. 














Proposition 11. (Second Derivative Test for Extrema, SDTE) Let f be a function 
that is twice differentiable at c. Suppose f'(c) =0 (i.e. c is a stationary point of f). 

(a) If f" (c) <0, then c is a strict local maximum of f. 

(b) If f" (c) >0, then c is a strict local minimum of f. 

(c) If f" (c) =0, then the SDTE is inconclusive—more specifically, c could be any of the 


following: 


e a strict local maximum; 
e a strict local minimum; 
e an inflexion point; or 

e none of the above. 





Proof. (a) Suppose f” (c) <0. Then 





08 $6) tig DLO yy, LC) 


zc £—C 


Since f is twice-differentiable at c, f’ is continuous at c. And so, by 5, there exists ¢ > 0 
such that f’ (2) > 0 for all x € (c-¢,c) and f(x) <0 for all x € (c,c + €). By the FDTE(b) 
then, c is a strict local maximum of f. 





Observe that 


g' (b) = [4b (2 +0) — 1] = b°(8b-1) <0. 


+ 


By the continuity of g’, there exists > 0 such that if we let a = 1/(2km+ 6), then g’ (x) < 0 for all 
x € (a,b). 
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The proof of (b) is similar and thus omitted. 


(c) If f”(c) = 0, then c could be a strict local maximum (Example 1186), strict local 
minimum (Example 1187), inflexion point (Example 1188), or none of the above (Example 
1189). 


To prove that “c is exactly one of the following”, simply note that (i) a strict local maximum 
cannot also be a strict local minimum; and (ii) by Fact 297, an inflexion point cannot be 
an extremum. 
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146.17. Concavity 


Definition 329. Let f be a function and D be an interval that contains more than one 
number. 


If for every a,b¢ D and every X€ (0,1), 


(a) f(Aa+(1-A)d) 2 Af (a) + (1-A) f (6), then f is concave on D. 


(b) f(\at+(1-A)b) >Af (a) + (1-A) f (6), then f is strictly concave on D. 


(c) f(Aa+(1-A)b) < Af (a) + (1-A) f (0), then f is conver on D. 

(d) f(\at+(1-A)b) <Af (a) +(1-A) f (0), then f is be strictly convex on D. 
(d) f(Aa+(1-A)b) =Af (a) + (1 -A) f (0), then f is linear on D. 

A concave function is one that’s concave on its domain. 

A strictly concave function is one that’s strictly concave on its domain. 


A convex function is one that’s convex on its domain. 


A strictly convex function is one that’s strictly convex on its domain. 





A linear function is one that’s linear on its domain. 
It is immediate from the above definition that “linearity <= concavity and convexity” 


Fact 296. Suppose f is a function and D is an interval that contains more than one 
number. Then 


f is linear on D f is both concave and convex on D. 





The following lemma characterises concavity and can serve as an alternative definition 
thereof: 


Lemma 19. Suppose f is a function and D is an interval that contains more than one 
number. Then 


(a) f is concave on D <=> For any 21,2%2,23 € D with x < x2 < 43, we have 
itis (21) = TAt3) =f (o>) 
LQ — Ly 7 Jb — S65, 


(b) Same as (a), but replace concave and > with strictly concave and >. 


(c) Same as (a), but replace concave and > with convex and <. 


(d) Same as (a), but replace concave and > with strictly convex and <. 





Proof. (a) (==> ) Let 21, 22,23 € D with x, < x2 < 73 and A= ee 
XL3—- LY] 
Observe that x3 - 21 > @3 — £2 > 0, so that A € (0,1). 
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U3-— X92 TQ — 2X1 
Also, Ar, + (1 - A) 23 = L1+ 03 = £9. 
03 - L4 L3— 21 





If f is concave on D, then 
f (a2) 2 Af (ei) + (1- A) f (@s) 


t3— X92 LQ - LX 
= —— f (41) + ——f (23) 
U3 -Xy L3— L4 


| 


(23 = 21) f (to) = (#3 = £2) f (21) + (22 = 21) f (3) 
(x3 — £2 + £2 - 21) f (%2) > (%3 - Ze) f (41) + (22-21) f (z3) 


— (x3-22)[f (22) - f(21)] 2 (e2-21)[f (xs) - f (z2)] 
f (Za) = f (1) 7 f (x3) ~ f (t2) 


v2- Ly U3 - 2X2 


| 


J 


(a) (<=) Let 21,23 € D with x; < x3 and X€ (0,1). Let rg = Az, + (1—- A) 23. 
rp f Ata =F lei) fea) ~ f (a2) 








, then as shown above, 


a 3 — XQ 
f (Att (1-A) as) = f (a2) 2 Af (21) + (1 -A) f(s). 
So, f is concave. J 


To prove (b), simply replace the appropriate weak inequalities in our proof of (a) with 
strict ones. 











The proofs of (c) and (d) are similar to (a) and (d) (respectively) and thus omitted. 





Proposition 12. (First Derivative Test for Concavity, FDTC) Let D be an interval 
with endpoints | andr. Suppose f: D> R is differentiable. Then 
(a) f' is decreasing on (l,r) <> ff is concave on D. 


(b) ff’ is strictly decreasing on (l,r) f is strictly concave on D. 


— 
(c) f' is increasing on (l,r) <=> f is convex on D. 
—> 





(d) ff is strictly increasing on (l,r) f is strictly convex on D. 


Proof. Let a,b« D with a< b. 
(a) (== ) Suppose f is concave on D. 


By Lemma 19(a), for every cé (a,b), 
Flo)- Fla) , Fb) - Fle) 


c-a 7 b-c 
Hence, lim ie) 54a) > lim fo) - Fe) 
coa* c-a c>b- b-c¢ 
wane fi(a)>f'(b) Vv 


(a) (<=) Suppose f’ is decreasing on (1,1). 
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Let A € (0,1). 
By the MVT, there exist d and e with a< d< (Aa+(1-A)b) <e <b such that 
_fQa+U-A)b)- fla) _ fQa+ (A) b)- F(@) 


“— Aa+(1-A)b-a (1- A) (b-a) 
and Fz i 
~ b-[Aat+(1-A)b] d(b—a) 


Since f’ is decreasing, f’(d) > f’(e) or 
fQa+(U-A)b)-fl@ . f)-fQa+ A-)A)>) 


(1 - A) (b-a) 7 A (b-a) 
= ALfAa+ (1-A)b)- f(a)] 2 - A) LF (6) - FAa + (h- A) 8) 
— f(Aat+(1-A)b) > Af (a) + 1 -A) f (8). J 


(b) (== ) Suppose f is strictly concave on D. 


By Lemma 19(a), for every c,d,e such that a<c<d<e<b, 


f(c)—~ f(a) 1 Fd) - Ff (c) 2 Fle)-F@) 3 F(b)-F) 
d—-c e-d b 


Coa —] 
By S, 
f(a) 

van LOL) 4 F(D=- FO. 

coat c-a d-c 
Similarly, by 5 

f(b) 
FO-1M 5, FO)-FO 
é=d eer b-e 


4 4 
Together, . >, and > imply that f’(a) > f’ (0). "A 


(b) (<=) Similar to (a) (just replace the appropriate weak inequalities with strict ones). 
J 











The proofs of (c) and (d) are similar to (a) and (b) (respectively) and thus omitted. 
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Fact 209. (Graphical Test for Concavity, GTC) Let D be an interval and f be a 
function. Then 


(a) Pe eeconcre Poe f is on or above the line segment connecting 


any two points in D. 
f is above the line segment connecting any 


(b) f is strictly concave on D <> 


two points in D (excluding the two endpoints). 
(c) ones Pea) f is on or below the line segment connecting 
any two points in D. 


f is below the line segment connecting any 





(d) f is strictly conver 
two points in D (excluding the two endpoints). 


Proof. <= For every a,b¢ D and X« (0,1), f (Aat+ (1-A)b) > Af (a) +(1-A) f (0) 


=— f lies on or above the line segment connecting any two points in D. 


(b) Same proof as (a), but replace “concave”, “>”, and “on or above” with “strictly con- 
cave”, “>”, and “above”, respectively. 


The proofs of (c) and (d) are similar to those of (a) and (b). 














Concavity (or convexity) implies continuity: 


Proposition 34. Suppose a<b. If f : (a,b) > R is concave, then f is continuous. 


The previous sentence is also true if concave is replaced by strictly concave or convex or 
strictly convex. 





Proof. Suppose f is concave and cé€ (a,b). We will show that f is continuous at c. 


Pick any d€ (a,c) and any e € (c,)). 






Graph of f 


C= (cf (c)) | B= (ef (e)) 


D=4F@)) 


d Cc e€ 





Consider the points C' = (c, f (c)), D = (d, f (d)), and FE = (e, f (e)); and the lines DC and 
CE. Observe that the lines DC and CE are described, respectively, by 


S13 At \a+(1-A)b, the height of the graph of f is f (Aa+(1-A)b) and that of the line segment from a 
to bis Af (a) + (1-A) f (0). 
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HO- 1) ¢, FO-$O (0, 


y= floret -c) and y=f(e)+ 


By the GTC (Fact = for any x € (d,c), the point (x, f (x)) must lie (i) on or above the 
line DC; and (ii) on or below the line CE.°™ 


Similarly, for any x € (c,e), the point (x, f (2)) must lie (i) on or above the line CE; and 
(ii) on or below the line DC. 


Altogether, for any x € (d,e), the point (x, f(x)) must lie in the green region. So, it is 
intuitively “obvious” that as x gets “nearer” to c, f (x) must also get “nearer” to f (c). In 
other words, f is continuous at c.°! 


Let’s prove this last bit of “obvious” intuition a little more rigorously. For any «x € (d,e), 
we have 


Fo) 0) f(e)- mae 


min { f Ho+t bs ~), f(e)+ LIL }< fo) smax{s(Q +22 


Hence, 


IF (2) ~ F (6)| smax PFO, _ Wher Ft, _ ah 





—d e-C 


Fo) - FAI LF le) - Fey) 


= fe = max |? 


If f(c) - f (d) = f (e) - f (c) = 0, then by Lemma 20 (below), f is constant on (d,e) and 
hence also continuous on (d,e). 

So, suppose that at least one of f (c) - f (d) or f (e) - f (c) is non-zero. 

Let ¢ > 0. Pick 


E 
2max {OFO! WO=101\ 


e€-c 


O-= 








Then for any x € XN; (c) € (d,c) U(c,e), we have |x —c| < 6 and 


IF @) ~ F (0) se ~ elmax {IFA hed Foy 


Ifl)-f@I lfle)-F(O!)_« 
< 5max | 2A een =. 


cad eL€ 


We’ve just shown that for any ¢ > 0, there exists 6 > 0 such that x € (a,b) A N5(c) implies 
f (®)€N-(f (c)). That is, we’ve just shown that f is continuous at c. 




















®14(i) is direct from the GTC. To see (ii), suppose for contradiction that the point is above the line 
C’'E—then C' would be below the line segment connecting this point and FE, contradicting the GTC. 
615This “proof by picture” was stolen from &. 
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Proof. Suppose for contradiction there exists a€ (d,c) U(c,e) such that f(a) # f (d). 
¢ If f(a) > f (d) and aé (d,c), then the line segment connecting (a, f (a)) and (e, f (e)) 
lies above (c, f (c)), contradicting the Graphical Test for Concavity (GTC, Fact 209). 


If f (a) > f (d) and aé (c,e), then the line segment connecting (d, f (d)) and (a, f (a)) 
lies above (c, f (c)), contradicting the GTC. 


¢ If f(a) < f (d) and ae (d,c), then the line segment connecting (d, f (d)) and (c, f (c)) 
lies above (a, f (a)), contradicting the GTC. 


¢ If f(a) < f(d) and aé (c,e), then the line segment connecting (c, f (c)) and (e, f (e)) 
lies above (a, f (a)), contradicting the GTC. 





Hence, no such a exists and f is constant. 
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146.18. Inflexion Points 


Definition 330. We call c an inflezion point of a function if the function is continuous 
at c and there exists ¢ > 0 such that the function satisfies either statement (a) or (b): 


(a) Strictly concave on (c-¢,c) & strictly convex on (c,c+¢€). 
(b) Strictly convex on (c-¢€,c) & strictly concave on (c,c+¢€). 


Proposition 35. (First Derivative Test for Inflexion Points, FDTI) Let f be a 
function and c be a point. If c is an inflexion point of f, then c is a strict local extremum 


Of a. 





Proof. Since c is an inflexion point, by Definition 330, there exists € > 0 with € < € such 
that either (a) f is strictly concave on (c-é,c) and strictly convex on (c,c+é); OR (b) f 
is strictly convex on (c- €,c) and strictly concave on (c,c+ €). 


Suppose (a). Then by the FDTC (Proposition 12), f’ is strictly decreasing on (c-é,c) 
and strictly increasing on (c,c+é). And so, by the FDTE (Proposition 33), c is a strict 
local minimum of f’. 














Suppose (b). Then we can similarly show that c is a strict local maximum of f". 


As noted in Remark 141, the converse of the FDTI is false: 


Example 1585. Define f :R—> R by®!® 


ya 
g+a‘sin— for x2 +0, 
x 


f(z) = 
0 for ¢ =). 
With some work, we can verify that 
e f is differentiable 
¢ f’ is continuous 


¢ 0 is a strict local minimum of f’ 


However, 0 is not an inflexion point of f. 


Proposition 36. (Second Derivative Test for Inflexion Points, SDTI) Let f be a 
function and c be a point. Suppose f is twice-differentiable on some neighbourhood of c. 
If c is an inflexion point of f, then f" (c) =0. 





Proof. By the First Derivative Test for Inflexion Points (FDTT), c is a strict local extremum 
of f’. So, by Fermat’s Theorem on Extrema, f” (c) = 0. 














We now discuss the Tangent Line Test (TLT). 
Let f be a function and c be a point. Consider the tangent line of f at c. It is described by 





®16This example was stolen from Kouba (1995), “Can We Use the First Derivative to Determine Inflection 
Points?” 
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y=f(c)+ f'(c) (w-ce). 
If at t #c, the tangent line is strictly 


« Above f, then f (c)+ f’(c) (t-c)>f(®. 
¢ Below f, then f (c) + f’(c) (t-c) < f(t). 


Hence to say that the tangent line is strictly above f on c’s “immediate left” and strictly 
below f on c’s “immediate right” is equivalent to this statement: 


There exists ¢ > 0 such that 
f(c)+f'(c) (w-c) > f (2) for all x € (c-«,c). 
and f(c) + f' (c) (4 -c) z P(e) for all x € (c,c +e). 


Or equivalently, there exists ¢ > 0 such that®” 


3 f(a) - fl 





fi(e)< 


for all x € (c—€,c) U(c,ct+é). 


Conversely and similarly, to say that the tangent line is strictly above f on c’s “immediate 
right” and strictly below f on c’s “immediate left” is equivalent to this statement: 


There exists ¢ > 0 such that 





fi(o)> LWL for all x € (c—é,c) U(c,c+eé). 
L-C 

Hence, the TLT which was given informally in the main text may be rewritten formally as 

follows: 


Proposition 37. (Tangent Line Test, TLT) Let f be a function and c be a point. Ifc is 
an inflexion point of f, then there exists some € > 0 such that for alla € (c—e,c)U(c,c t+), 
either (a) or (b) is true: 





@ fo M9 @& roo ee 





Proof. By the First Derivative Test for Inflexion Points (FDTT), c is a strict local extremum 
of f’. That is, there exists ¢ > 0 such that for all 2 € (c—e,c)U(c,c+é), either statement 
(i) or (ii) is true: 


Gi) fO<f(@) (ii) f'(c)> f(x). 


Let x € (c-¢,c)U(c,c+e). By the Mean Value Theorem (MVT), there exists a between x 
and c such that 





3 1 , 1 
17To get <, rearrange > and <, in each case dividing by x -c. Note that in the case of >, x-c <0 and so 
when dividing by x -c, we must reverse the inequality. 
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So, if (i) is true, then (a) f’(c) < He)" 1) is true. 











Similarly, if (ii) is true, then (b) is true. 





There is the possibility that an inflexion point is an extremum: 


Example 1586. Define f :R — R by 


for x < 0, 


Oe te 
Ve tore > 0) 


Figure to be 
inserted here. 


Clearly, 0 is a strict local minimum (and indeed a strict global minimum) of f .°"® 


Observe that f is continuous. Moreover, f is strictly convex on R™ and strictly concave 
on R*. And so by Definition 330, 0 is an inflexion point of f. 


And so, awkwardly enough, we have a point that is both an inflexion point and a strict 
local minimum. 


(By the way, f is smooth everywhere except at 0, where it is not differentiable. ) 





The above awkward possibility is eliminated if we assume the function is twice-differentiable 
on some neighbourhood of the point: 





Proof. Since c is an inflexion point of f, by Proposition 35, c is a strict local extremum of 


ie 


Suppose for contradiction that c is an extremum of f. Then by Fermat’s Theorem on 
Extrema, f’ (c) 210. 


Since c is a strict local extremum of f’, we have on some deleted neighborhood of c either 
(i) f’ <0; or (ii) f’>0. 

If (i), then f > f (c) on some left neighborhood of c and f < f (c) on some right neighbor- 
hood of c, so that c is not an extremum of f (contradiction). 





618For all 2 +0, we have f (x) >0= f (0). 
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Similarly, if (ii), then f < f (c) on some left neighborhood of c and f > f (c) on some right 
neighborhood of c, so that again c is not an extremum of f (contradiction). 
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146.19. Sine and Cosine 


In this appendix, we’ll show that our formal definitions of the sine and cosine functions 
(Definitions 219 and 220 on p. 1012) correspond to our earlier unit-circle definitions. 


XXX To be written 
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146.20. Power Series and Maclaurin series 


Definition 331. Given the two series A = > a, end 8 = y b,, define 
n=0 n=0 


n 
One » G0 es for each n€ Zp. 
i=0 


Then the Cauchy product of A and B is the series > Che 
n=0 


Definition 332. We say that the series ee is absolutely convergent and converges 


absolutely to L if: 


Theorem 57. Suppose the series » An and y b, converge to A and B. For each né Zo, 
n=0 n=0 


n co Co 
let c,, = > Gb: If > ad, 1s also absolutely convergent, then » Cn converges to AB. 
1=0 n=0 n=0 








Proof. Omitted.° 











Theorem 58. Let f and g be functions and Rp, Rg € R*U {oo}. Suppose f (x) = D> anz” 
n=0 


for all x € (-Rr, Rr) and g(x) = > b,x” for all x € (-Re, Rg), If c€ (-Re, Re) and 
n=0 


Ds b,c” converges absolutely to some number in (—Rpr, Rr), then 


n=0 


(Fa) (0 = an( 3: bnc} | 








Proof. Omitted.°° 














619See e.g. Apostol (1974, p. 204, Theorem 8.46) or Rudin (1976, p. 74, Theorem 3.50). 
620See e.g. Apostol (1974, p. 238, Theorem 9.25: “Substitution Theorem”) or Lang (1999, p. 66, Theorem 
3.4: “Composition of Series”). 
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146.21. Antidifferentiation 


Example of a function that is antidifferentiable but not continuous:°”! 


Example 1587. Define f :R— R by 


1 

. on for x #0, 
f(z) = x 
0, for x = (); 


For x + 0, we have 


1 
f' (a) = 2x sin — — cos -. 
ie a 


—~f (0 az sin 4-0 i 
f' (0) = lim HO FO) - lim — 5 = lim (xsin—) = 
Hence, f is differentiable and its derivative is the function g : R > R defined by 


| 1 
2x sin — — cos —, for x #0, 

g (x) = ; © 
0, tor & =(), 


Thus, g is an antidifferentiable function. However, g is not continuous. 


Fact 215. Let f(x) and g(x) be expressions containing the variable x. If ~9 (2) = 
a 
HCE ethen i f@ydr =o) +C. 





d 
Proof. Suppose h is a differentiable function defined by h (x) = g(x). Since a! (ea 7 (a), 
the derivative of h is defined by h’ (x) = f (x). 
Now, suppose the function 7 is defined on an interval and by i (x) = f (x). Then by Corollary 


47, its antiderivatives are exactly those functions defined by 7’ (x) = g(x) +C (for C eR). 
That is, 











Teo) dr=9 (2) +C. 





The converse of Fact 215 is not true for the simple and pedantic reason that in Definition 
223 but not in Definition 206, there is the requirement that the function’s domain be an 
interval. 





621See Wikipedia for more examples. 
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146.22. The Definite Integral 


The following definition of the definite integral heavily favours simplicity over generality. 
Its chief advantage is that it simply formalises the intuition discussed in Ch. 104.4. Another 
is that it avoids any mention of suprema and infima (concepts we haven’t discussed in this 
textbook). 


Definition 333. Let a<b, f be a real-valued function defined on [a,b], n € Zj, and 


5) ) 


Suppose lim S,, exists. Then we say that f is integrable on [a,b]. Moreover, we call 
nN— co 


lim S;, the definite integral of f from a to bis S and will often denote it by 


ie fi fac or [i f(@) av 


If lim S, does not exist, then f is not integrable on [a,b] and the definite integral of f 
Nn cCco 
from a to b does not exist. 





Explanation. Partition [a,b] into 2” closed intervals of equal width. Then each such 
«qi OO baa . 1\b-a 
interval has width “on and midpoint a+ (i - 5) a 














b- 

For each such interval, consider the corresponding rectangle that has width = and height 
1\b- b- 1\b- 

f (« 4 (i — 5) ="). This rectangle’s area is aa (« + (i - 5) ="). 


We have in total 2” such rectangles. And their total area is S,. 


Figure to be 
inserted here. 





To prove some of the following results, we’ll need uniform continuity: 


Definition 334. Let f be a function and S be a set. We say that f is uniformly 
continuous on S if for every € > 0, there exists 6 > 0 such that for any a,b € S with 
la - b| < 6, we have |f (a) - f (b)| <e. 








Proof. Omitted. (See e.g. Abbott, 2015, pp. 132-133.) 
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b 
Theorem 42. Let a<b. If the function f is continuous on [a,b], then i iy (LISS. 
a 


Proof. Let ¢ > 0. Since f is continuous on [a,b], by Theorem 59, f is also uniformly 
continuous on [a,b]. That is, there exists 6 > 0 such that if c,d [a,b] and |c—d| <0, then 


if ()- FO < —— 





For each n € Zp and i¢€ {1,2,...,2”}, define the interval J, ; = ja +(¢-1) “4 . “|. 


att 
2” 2M 
By the Extreme Value Theorem, for each i € {1,2,...,2"}, there exist In j,Unj € Ing such 
that f (x) € [f (Ini) f (Unz)] for every x € I;,;. Let 


a2. 














by 722 =a Ss (lag) and v2” 
2” i=1 

: + 2 b-a 622 : : a oe 7 b-a : 
Pick N € Zp such that 6 > oN Consider Jy ;. Since this interval is of length re 

must be that |uy;—- ll < z <6. Hence, by cS f (una) - f Cvs) <5 se . Thus, 

b- b-a2\ ¢ b=a2%e 
U. L = = l i < = =€, 
N~ N= “ON Sy (uni) f (wy)] ON ba ON b=@ E 


6 
We’ve just shown that Uy - Ly < ¢—we'll use this later. 


For each n > N and each i € {1,2,...,2%}, let Tg = {2"-% (i- 1) +1,2"-% (¢- 1) +2,..., 2” Ni} 


gn-N 


(T4 is a set of integers). 


b-a 
Consider the intervals J, ;. They are disjoint and each has width cs Moreover, Ue, ,!nj = 





Inj. 
. 1\b-a 
For any 7 € Tn, we have a+ (1 — = a € UseT, An,j = Iw, and hence, by é, 
b-a 
f(a+ (9-5) *) er va) F (un). 
1 b-a 
Thus, an-N >, Flas (7 -5) "yi e[f (ni), f (un). 
j€L ni 


Now, 





622The existence of N is given by the Archimedean Property. 
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goed a a ie) 
ee ate & s(a+(i-5) 54) 


t=1 t=1 


ere il 2 Ree 
Ht Spd tae Fo) 
eal 


Ln, Un]. 


6 
We've just shown for every n > N, S, € [Ly,Uy] and hence, by <, |S, —L| < ¢.° So, 


b 
lim 6_.= [ f exists. 
n—oco a 














Theorem 43. (Rules of Integration) Let a <b, cé€ (a,b), andd,e¢R. Suppose the 
b b Cc 
functions f and g are continuous on [a,b], so that by Theorem 42, i i i G, [ He 


b 
and f f exist. Then 


(a) i (f+ g)= [ot + foo (Sum and Difference Rules) 


b c b 
(b) i f=} f+ f He (Adjacent Intervals Rule) 


(c) i (= aff. (Constant Factor Rule) 
b 

(d) [ d=(b-a)d. (Constant Rule) 

(e) If f>g on [a,b], then io > foo (Comparison Rule I) 

(f) Ifd< f(x) <e for every x €[a,b], then 


b (Comparison Rule IT) 
(b-a)as [ f<(b-a)e. 





Proof. For each n ¢€ Z), let 
b-a2z 1\b-a bag? 1\b-a 
Fi, = =< n= (ae . 
a YF (a+ (i 5) mH ) and G a Ya (a+(i =) i 


Since f and g are continuous on [a,b], by Theorem 42, 

















6231¢ is also true that |S,, — U| <e. 
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b b 
lim F* f feR and lim Gn =f geR. 





zl b 
(a) Define h = f +g. For each ne Zp, let oo = n(: +(i-5) =} = f+G,. 


By the Sum and Difference Rules for Limits, 
b b 
f fag- fhe lim H,, = lim (F, + Gp) # lim F,+ lim Ga= ff fe f gO 


(b) The proof of the Adjacent Intervals Rule is long so we’ll put it at the bottom. 





a 2 b 
(c) Define i = df. For each ne Zj, let I, = oe (« +(i- ;) x] = dF,,. 
= 


By the Constant Factor Rule for Limits, 


b 
f = f i= lim n= Jim (4F,) Sd lim Fy=d f f. / 


(d) Define 7: [a,b] > R by j (x) =d. For each ne Zj, let 


b-a2. b-a b-a2. b-a 
n= ) —_— = = gn = _ : 
d, a > j (« + (i 5] a )- a 24 = d=(b-a)d 














b b 
Hence, Lt iiae Gos J 


(e) Suppose f > g on [a,b]. Then for every ne Z)j and ve {1,2,...,2”}, 


So, for every NE Zo, 


Fy="89 5 (as +(i- 2) Pa), 8S 9(a+(i-4) 44) =o, J 


i=1 t=1 




















b b 
Hence, by Fact 267, [ r= lim i, 2 tim Gas [ g. 
(f) follows from (d) and (e). ¥ 


(b) We now prove the Adjacent Intervals Rule. 


For each n € Zj and p,q € [a,b] with p <q, partition [p,q] into 2” intervals of equal width. 
—p 
Qn 





Specifically, define the leftmost interval to be [{pajna = [pa 7 | and the remaining 











2” — 1 intervals to be [tp qjni = (p+ (i - Pp - iS | OEE Qonguye hs 
1\ Ge 
The midpoint of each of these 2” intervals is p+ neil — 5) ae 
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- 1 
Corresponding to each of these 2” intervals is a rectangle with width a height f (> + (i — 5) 
q-P . 1\q-p 
dh —= 
and hence area i ar (> ~ (i 5] an 
These 2” rectangles have total area 











Since f is continuous on [p,q], by Theorem 42, 
; 1 [4 
dim Panel = i feR. 


Next, for any n € Zj, suppose c is in the j (n)th interval. Then 7 (n) = 2". 
—a 


_ din) Z ae = 
Define = Tm f(a+(i-5] ~—*) and ‘= a > f(a+(i-5] ~—*). 


i=j(n) 











Observe that J, and r, are approximately the portions of 5,4) that are to the left and 
right of c. Moreover, 





2 b-a 1\b-a 
n¢1m? Spas) + af (a+ (4()- 5) a ). 


Below we state three Lemmata. If they are true, then we can also easily prove the Adjacent 
Intervals Rule: 


b c b 
fo f= Bi nt) = lim int tim = ff fe ff iz 


b 
Lemma 21. i f= la (i, 7) 


b-a 
Qn 


V2 frvo- [or 





; ; i? . CG 1 
Proof. Since lim Sp fap) = [ f and lim mee (« + (1 (n) - 5) 
for Limits, 


= (0, by the Sum Rule 


b 

















b- | 
lim. (len =P Ta) 2 lim. Een Tr Sf (« + (1 (n) — 5) 


Proof. The proofs of Lemmata 22 and 23 are similar. So, we’ll only prove Lemma 22: 
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q 


c 
Z 


Let ¢>0. 
Cc 
Since lim S), tae] = [ f, there exists N; such that for every m > Nj, 
nN Co a 


Sintec = [ - 


By uniform continuity, there exists 6 > 0 such that if |a, — | <6 (and 1,22 € [a,b]), then 





2 & 
|f (v1) — f (@2)| < Th (ea). 
Let M = man f (2). 
2 ( 


- 1 
Let m be any positive integer greater than max {in 2-4) /\w2, mi}, so that < holds. 


4(b-a)M 1 — — 
Let t be any positive integer greater than max {in eae ELS In2, In oCn9) In 2, In eved, 
E c-a 





Then these inequalities hold (as verified in the footnote): 
e=a@ b-a 3 et! ee ae q 274 E€ 5 € 
Qm s Qm-+t-3 se Qm-+t-3 . Qm+3? an c—-q2Zmtttl * gm+4 


Pix he ly Qivesge ys 

Let k; be the largest integer such that the interval [[a4}m+t,n 18 strictly to the left of the 
interval [fa.c},m,k- 

Similarly, let k, be the smallest integer such that the interval [jg pj) m+t4, 18 strictly to the 
right of the interval Dpq¢jm,k- 


b-a 


Qm+t ; 





The left endpoint of the interval Ifg6)m+t,m is a + (ki - 1) The right endpoint of the 


b-a 
interval Ira p),m+t,k, 1S at Re See The distance between these two endpoints is°”° 





624 


The existence of M is given by the Extreme Value Theorem. 
625 Below, we’ll use this identity a few times: 
l 
qn f/Ine = exp (na ?e/r) = exp (— ina] = exp (In B) =. 


3 c-a c-a c-a oO b-a b-a_ 8(b-a) 8(b-a) 8(b-a) oO 
For <, Qm - gin 2°) /In2 ~ a(ea) ~ 2 and Qm+t-3 7 9t-3 ~ ot gin 16-0) 7112 ~ 16(0-a) ~ 2° (So, 





é 
each of the two terms on the left of the inequality < is less than 5): 
b-a M (b-a) M (b-a) E 











For <, Qmst-3 gm-3+In 64(b-0)M 114 94 = Qm-3 x 64(b-a) M ~ Qm+3" 
= E 
5 b-a € b-a E b-a € E 
For <, = 








cam © ¢-qom+i+n 2 /in2 ~ C-agmtl x 8(b-a) ~ gd 
6c-a — 
S26For ky=hy+1< Foge +8, observe that the ratio of the width of the bigger interval [[a¢jm, to that of 
-a 
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b-a b-a 
a+ bys [a+ (h-1) soa |= (r= hr +1) 


So by z, for every 5 € {kj,k, +1,k, +2,...,k,}, 









































Estate (t-2)S2)-F58 Selo (58) rl 


=K] 





























7lf[e-a b-a 1\c-a b-a E 
—___ — ra 1 — — | —— k, - ky + 1) ———~ 

< | Si one (ky — ky + |r (a+ (a 5) on errr i+ )T6(ena) 
c-a b-a b-a E 

— - —_(k, - 1)| M| + —(k, - 1) ———= 

‘ | gin gant (Rr ~ hi + | + Smt (hr — hi + )T6 (ena) 

6\l[c-a b-af[c-a b-af(c-a 

—— = ——_ 8}| so (eo 8] 

|S ye (5 * ome \b— a 7°) 16(e—a) 

b-a E b-a € 


Qm+t—-3 - gm+4 v c-a Qm+t+1 


45 € E E & 
- Jm+3 7 Qmt+4 * Qm+t+4 ~ Jm+2 





Thus, 





6-0 . U=a _c=@ 


the smaller interval Ipq.¢jm, is 2°. This ratio should in turn be close to the number 





m "© Omtt” fR_ a 
k, — kj +1, which is the number of the smaller intervals that “just” more than cover one of the bigger 


6 —= 
interval. Hence, k,-—k,+1 < = 2! +8—where 8 is just a somewhat arbitrary but sufficiently big number 
-a 
6 
that guarantees < is true. 
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Set (0+ (b- 5) Se) Sar DF (* (5-5) ge) 


k=1 s=ky 










































































c-a c-a b-a & b-a\| 8 E E 
ay jean paling tes m = 
k=1 2 a +( 5] =| JQm+t 2 F(ar(s 5) se | Jm+2 4 
Also, 
b-a 2. kp—2 1 b-a b — g Jhmtt) 1 _— es es 
mei Dy Flat (3-5) st) Slee = Sc > f(a+(i-5) art) Smt DS (a: 
Hence, 
b= = 1\b-a 
i as 
+t Qm-+t urs (« + (s 5] 7) 
b-a 2 & 1\b-a bag 2. eS 1\b-a 
~ | om+t ys (at (s- 5) | 7 Qm+t yb Fe (s- 5) | 
b-a2|& 1\b-a kr—2 1\ b-a 
“pt 2 2 F(e+(s-5] pat) 2 F(a aa) 
b-a 2 








eB (eo(a 3) ts (os (oe 8) S58) (eo (4 3)S8) (04-4) 














b-a& C=O, b-a 9€ 
Somat 2, AM saul OMl= sea I< 5 


8 9 
Thus, by < and <, 


42€ € € 
hive oT a,c yn 49° 
| +t A all . 4 = 4 2 
And 
: c 42,1 € E 











We’ve just shown that given any ¢ > 0, we can find n = m+t such that 





n- [of 
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< E. 


Hence, 





This completes the proof of Lemma 22. 














Hence, this also completes the proof of the Adjacent Intervals Rule. 
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146.23. Proving the First Fundamental Theorem of Calculus 


Theorem 44. (First Fundamental Theorem of Calculus, FTC1) Let a< b, f: 
[a,b] > R be a continuous function, and cé [a,b]. Suppose the function g: [a,b] > R is 
defined by 


g(z)= fs 


Then g' =f (i.e. the derivative of g is f or equivalently, g is an antiderivative of f ). 





Proof. Define h: [a,b] \ {c} > R by 








h(a) = 29-99) _ 5 (6), 
Sea cl Reece 
h(a) = fi-fes -f(o) (Definition of g) 
: lef -f(e) (Adjacent Intervals Rule) 
lef 2-5 (, (Times One Trick) 


T-C L-C 





» : , oe, 7 [or (c) at| (Factorise, Constant Rule) 
~ — [ f(t) - f (e) dt. (Difference Rule) 


Let ¢ > 0. Since f is continuous, there exists 6 > 0 such that for every t € [a,b]n(c-6,c+9), 
we have 


-5<f()-f()<5. 


So, by Constant Rule and Comparison Rule II, for every x € [a,b] 1X; (c), 


E 1 s 
——. fo _ = cnt. 
<< f" p(i)- fo) dt= h(a) <é 
We’ve just shown that given ¢ > 0, there exists 6 > 0 such that for every x € [a,b] NX; (c), 


we have h(x) € N- (0). So, we’ve just shown that lim h (x) = 0. 


Or equivalently, lim [uate —f (0)| <0). 


By the Constant Rule for Limits, lim f (c) 2 f (c). Hence, by the Difference Rule for Limits, 


im 260) = 9 (2) =f(c). 


LC L-C 


1714, Contents www.EconsPhDTutor.com 


But of course, LHS is simply g’(c). Altogether then, we’ve shown that g’ (c) = f (c). And 
since c was chosen arbitrarily from [a,b], we’ve also shown that g'(«) = f(x) for every 
x€ [a,b]. Thus, g’ = f. 
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1 
146.24. [ — 
ax’ +bx+c 








by BP b\? b= 4ac 
2 7 = 
Proof. Observe that ax +bx+c=a(a+>-) ven Eval (e+ 3) gd } 
2 2 
2; a («+5-) -(=) : for b? — 4ac > 0, 
2a 2a 
1 b\? 
ax’ +ba+c= a(z+ >] for b? — 4ac = 0, 
a 


2 2 
a («+5-) +(=) ; for b? — 4ac < 0. 
2a 2a 


There are three possible cases. We examine each on the next page. 


(Proof continues below ...) 
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(... Proof continued from above.) 
(a) If b? — 4ac > 0, then 


i 1 +( A B )- (A+B)x+A5*+ BY! 


aa? +bn+c~ a(a+ 4) (a+ ad) a\a+ ¥4 vy ed a(n + 4) (x + A) 


2a 2a 


Comparing coefficients, we have A+ B@20 and 


Ap eee Aa =a) at 
2a 2a 2a 2a 


So, B- A=2a/d. Adding @2, we get 

















2a a —a 
26 =— or B= — d A=, 
7 or 7 an 7 
Thus, 1 1 A B 
v= {= -- [ Saee+ [ Sau 
ax? + bx +c a a+ be a+ 4 
1 1 
_ d d 
-f bid c+ f Sade 
=] b+d| 1 b-d 
= Sine — es rs +C 
ni qe ved 
=-—ln 28) AC 
b+d 
d or 








(b) If b? - 4ac = 0, then 
1 i 1 1. il = 
fa we1 [ -—n.u--- 7 +C= +C 
ax’ +bxt+ce a(x + 2) ant x 2ax +b 


(c) If b? — 4ac <0, then 














1 1 1/2a, _,x+b/2a 2, _,2axn+b 
{f= rO1— a! Gps — tan ta + 
az* + bate (x+ 2) +(£) a\d d/2a 


2a 














where @4 uses Proposition 16(a). 
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ii 
146.25. 7 dr 
Vax? +br+c 


Fact 217. Suppose a,b,c¢ R with a<0 and ax? +bx+c>0. Then 


1 ae 1 — (2az + 6) b) 
Vax? + bx +c a =e Vb? — 4ac 











d| 1 — ax +b) b) 
Proof. ee | var sin” aera “4 


_i 1 ~2a Ja 
Via cae ee, V0? = dae 138 Age = (2Qax +b) 
V b?2-4ac 
_ 2\/-a 7 1 
/-4a2x? —4abr—4ac Vaxr2+brt+e 


The above claim—the second sentence especially—may be considered a little imprecise. For 
a more precise statement of the above claim, see footnote.®?" 




















oe a 
/la Vb? — 4ac 
(a) If b? -4ac < 0, then this expression is undefined. 
=205 =U 
V b2 - 4ac 


Fortunately, we can dismiss these concerns: 


Remark 215. In Fact 217, the expression —— raises two possible concerns: 


(b) If ¢[-1,1] = Domain sin“, then again this expression is undefined. 


(a) Since a < 0 (n-shaped quadratic curve) and there exists x € R such that axz?+br+c>0 
(curve intersects y-axis twice), it must be that b? -4ac > 0 (otherwise the curve would 
intersect the y-axis at most once). (Qax+b)” 


(aa Te 
(hy) an? be cs a= 4077 4 4b ne <0 > 4a Pda Ae 


(a) (2ax +b) 
SS 

b? — 4ac 
oe (2ax + b) 
b? — dac 


es | 2ax +b jes oe 


———}<]1 = 
V b? — 4ac b? — dac 
e (-1,1) ¢ [-1,1] = Domain sin. 

















®7Tet a,b,c€ R with a<0. Let D be an interval with ax? + bx +c > 0 for all 2 ¢ D. Suppose the functions 
F:D->Rand f:D-R are defined by 





























F(z) ; t $e ae 
AQ) Sa an i bs <i 
Var-+br+¢ Val pc dae 


Then an antiderivative of F is f. 
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The next result says that if a #0 and x? +a>0, then 


da =In|V2? +a +a] +C. 


I yam 





Proof. If Vz? +a+a >0, then 





ca | 

£in|\Var+a +a] =“ n(Va?¥a+2) = 2 

= dix Var-+atz 
at+V2? +a 1 


“Vitea(V@ra+z) Vira 





If Vx? +a+2 <0, then 
-20@ 


<n |Vie +a +4] = <n (VF ¥a-2) : ee 


7 r+Var+a tf 
Vir +a(Va?+a+2) Veta 
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The next result says that if a >0, b? - 4ac + 0, and ax? + br +c + 0, then 


é b fe b 
me es ae ae a pe 
a a 2a 


Fact 299. Let a,b,c¢R with a> 0 and b?-4ac ¢ 0. Let D be an interval with ax?+br+c + 0 
for allxée D. Suppose the functions F: D> R and f: D>R are defined by 


| 


1 
—________—. dy = ——]n a ee 
aaa Ja 








1 1 b (é b 
F (sc) = —_—__ and v)=—lIn bp OE ee eee 
(2) Vax? + br +c F(z) Ja |V a a 2a 





Then an antiderivative of F is f. 


/ b b 
Proof. If ee ga ope 5: then 
a a 2a 

















2r+2 4] 
d | 9 b C b 1 2\/22+2a+¢2 
ce — ln ey ere = 
EVV a a @ VO, [x24 ony Sans & 
a a 2a 
VV v2t+eat stare 
1 a2+ea+S 1 1 1 
a Dg ieee ahs b a f2,b,,¢ Var2+brtc 
Ni ad a 1 le at a ae 


/ b b 
If g++ io +e+— <0, then 
a a 2a 
df1 [, dD b 1 nfaatare 
-2y/u2+2a4+£ 
af nde gee bees “_* 
de 70 a Ge, 2a a 


ee ee ee: ae eee 
Ger au + aa ae 
VV 22+ eatsta+e 


1 a2+2a+8 1 1 1 


Qr+2 
a 























5 
g+oe pe Vow +00 +6 
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146.26. The Substitution Rule 


Proposition 38. (Substitution Rule) Let G and wu be differentiable functions and 
g=G'. Suppose the composite function Gow exists. Then 


f geu)-w'] dz=Gou+C. 

















d 
Proof. By the Chain Rule, 7 


x 


(Gou)=(G'ou)-u’ =(gou)-u’. 
Here’s why = corresponds to Steps 3, 4, and 5 of our Five-Step Substitution Rule Recipe: 
; du 
[leu-whae = fa(u) Pae= f g(u) du (Step 3) 
=G+C (Step 4) 


=Gou+C (Step 5) 


Formally and strictly speaking, 
e Step 3 is wrong: We can’t just magically cancel out the dz’s. 
e Step 5 is also wrong: We can’t just suddenly plug wu back in. 


Only = is actually correct. Nonetheless, the Five-Step Substitution Rule Recipe serves as a 
(very) convenient mnemonic for =.°°° The Recipe does give us the correct answer because 
here it so happens that two wrongs do make a right. 





28Here once again Leibniz’s notation shines, just as it did with the Chain Rule and the Inverse Function 
Theorem. 
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Remark 216. Many calculus textbooks®’’ (and probably also many high-school/JC maths 


teachers) incorrectly present the Substitution Rule as 


f[ fo@as (x) d+ f f(u) du. 


The above equation says that the functions (fo g)-g' and f have the same antiderivatives, 
which is clearly false! 


This observation was already made by David Gale in his 1994 article “Teaching Integra- 
tion by Substitution”. I can do no better than reproduce his remarks: 


Of course the equation is false. The expression [f@ dx stands for 


antiderivative, as in a table of integrals, and the variable, be it x, t, u 
or anything else is a dummy. Clearly the antiderivatives on the left and 
right above are not equal. What the books mean, no doubt, is that if you 
substitute g(x) for u after taking the antiderivative on the right you get 
the antiderivative on the left. I expect some readers will say I am being 
pedantic or that there is no need to be so rigorous at the freshman level, 
but I think this kind of lapse is symptomatic of a rather strange set of 
standards and perhaps it sheds light on why none of the books proves the 
inverse substitution theorem. It is because none of them formulates it. 


Here are two corrected versions of = (without changing it too much): 


[ fo@))a@)ar= f raul [ea-a=(ftegre. 


i 
Letting F’ = f, we can also produce two more corrected versions of =: 


f FG @)9 (a) de =F (g(2)) +€ f[ (fe9)-9' = Fog+0. 








29Some textbooks that make this error: Stewart (Single Variable Calculus, 2011, p. 331); Thomas and 
Finney (Calculus and Analytic Geometry, 1998, p. 294)—see also Hass, Heil, and Weir ( Thomas’ 
Calculus, 2018, p. 291). Also, ProofWiki (retrieved 2018-10-06-1058). 
Some textbooks that do not make this error: Apostol (Calculus: Volume I, 1967, p. 212); Larson and 
Edwards (Calculus of a Single Variable, 2016, p. 296). 
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146.27. Revisiting Logarithms and Exponentiation 


In Ch. 5.4, we only defined what b” means in these two cases: 

1. « ¢ Z—so e.g., we know that 5? = 25, (-5)? = 125, and 4.5°? = 4/81; or 
2. b>O and x € Q—so e.g., we know that 57° = ( V5) 

We did not define what b” in these two cases: 

3. x €Q—so e.g., we don’t know what 5v3 is; or 

4. b<OQand z¢Z—so e.g., we don’t know what (-5)”° is 


And so, we’re actually cheating when we take for granted that the Laws of Exponents 
(Proposition 1) hold for all positive bases and real exponents. For example, we haven’t 


defined what 5V? and 5-V? are, yet we cavalierly take for granted that 





5V8 .5-V8 = 5VS+(-V8) = 50 = 1. 


Let us think about what b” might mean if x ¢ Q. 
Consider 5¥3. Say we know that V3 * 1.7320508... and also that 








51 = 5, 
5LT = 510 - (V5) w (1.174619...)""” w 15.425... 
pI = 51m = ("V5 v3) w (1.016 2246...)!” ~ 16.188... 
pt78 — 51000 = ( 5) w (1.00161073...)' 16.241... 
51-7320 _ 50000 = ( woe) "2 (1.000160957...)° 16.241... 
51-732.05 — 5 100-000 =| “5) x (1.0000160945...)7° ~ 16.242... 








17320 
pi-Taz0n0 = phat = (1° °%/5) = 


x (1.000016 0945...) 79°" w 16.242... 


And so informally, we might say that 5V3 ~ 16.2424... 
A little more formally, we might say that 5¥3 is the limit of the following sequence: 


51 517 51-73 51.732 51.7320 51.73205 51.732050 51.7320508 
And so, following the above discussion, one possible approach for defining b” in the case 
where x ¢ Q might go like this: 


« Assume (prove) there is a sequence of rational numbers (9, 21,...) that converges to x. 
e Use that sequence to form the sequence (b”,b”!,...). 


e Assume (prove) this latter sequence converges to some number y. That is, 
(b°°,b7,...) > y. 
e Now define b” to be equal to y. 
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This seems like a perfectly sensible approach. But strangely, it is not the approach we 


will take. Instead, somewhat surprisingly, we’ll define exponents based on the natural 


logarithm and exponential functions:*” 





Fact 223. Supposex>O andneR. Then Inz” =nlnz. 


Proof. Use Definitions 335 (above) and 85 (exp is inverse of In): 














Inz” = In[exp(nInz)] =nInz. 





639Which were in turn formally defined as Definitions 229 and 85 in the main text. 
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The following Proposition shows that for b> 0 and any x € R, Definitions 335 and 230 imply 
the four definitions of exponents and logarithms given in Ch. 5.4. So, we’ll let Definitions 
335 and 230 supersede those earlier definitions. 


Proposition 39. In those cases where the base is positive, Definition 335 implies Defi- 





nitions (a) 27; (b) 29; (c) 30; and (d) 32. 


Proof. Below, = and = denote the use of Definitions 335 and 85 (exp is the inverse of In), 
respectively. 


(a) Let be Rw {0}. 
Case 1. If x € Z*, then 


b” = exp(aInb) -exp(inb+ nb +--+ in 
eS 


ax times 
2 a eee 
= exp (Inb) exp (Inb)...exp (Inb) =b-b b. 
x times x times 
Case 2. If xe Z, then 
¥ 3 1 il 1 
vf exp(ind) = exp (<ind=Inb=--~ nb) 2 exp (Inf rin +--+ Inj] 
Rp 
|| times _—S 
|x| times 
4 (i *) (i 7] (1 a 1 I of 
=exp|{In—-]exp|{In—]...exp{In-]=-—---------=—. 
os ae ae ae on) ae oe 
——_—————————————— i” 
|2| times |2| times 


Case 3. If x = 0, then b” = b° 2 exp (0Inb) = exp0 21. 
Above, 2 and = used Fact 224(c), while 3 and 2 used Facts 222(c) and 224(a). 
(b) Let b> 0 and x €Z\ {0}. We'll show that (b*) = b: 


(vs) = exp (=n s)] = exp {em exp (=n »)]} = exp E (=m s)| = exp (Inb) = b. 


(c) If b>0 and x =m/n for some m,ne Z, then 


* ° 1 i ih * 1 \ 7% 
b* =exp(¢lnb)= exp(™ ind] = exp {rn Jexp(— Ino) = jexp(—in0}] = (o*) ; 
n n n 


Inn 


(d) b° =n <> exp(xInb) =n <> In[exp(zInb)] =Inn —> cinb=Inn <> cars 
n 











ee log,n. (The last step <*> uses Definition 230.) 





1725, Contents www.EconsPhDTutor.com 


In the main text, we gave a partial proof of the following Laws of Exponents, covering 
only the case where the exponents x and y were positive integers. Here now is a full proof 
covering also the case where x and y are any real numbers: 


Proposition 1. (Laws of Exponents) Suppose a,b>0 and x,y¢€R. Then 
1 OF 


(a) orem. (by yt= 2d. (Ee 


be, (d) (b")"=b™. ~— (e) (ab)” = ab". 





Proof. Below, = and = denote the use of Definitions 335 and 85 (exp is the inverse of In), 

respectively. 

(a) b°bY = exp (xInb) exp (yInb) = exp (xInb + ylnb) = exp[(x + y) Ind] 2 bY. 
r ; oan 

(b) b-* = exp (-zInb) 2 exp (-Inb") 2 exp (in =) = - 

4 exp(zlnb) « b* 695 

~exp(ylnb) bY 


(d) (b")” = exp [y nb”) ] 2 exp [zy (Inb)] * pry 634 
(e) (ab)” = exp [In (ab)] © exp [x (Ina +Inb)] = exp (xlna+aInb) 
a [exp (x In a)| [exp (oot b)] * gt pr 685 


Fact 63. For every x € R, e* =expz. 


Proof. By Definitions 335 and 86, 





(c) b” ¥ = exp[(x- y) Ind] = exp (aInb- ylnb) 


























e” = exp (az lne) = exp(a-1) =expz. 


Fact 23. Let x > 0. 
(a) Tf o> 0) then b7 > 0: 


(b) If b>1, then b” > 1. 





Proof. (a) b” = exp(axInb) > 0 because Range exp = R*. 
(b)Inb>0 and zInb> 0, so that b” = exp(aInb) > 1. 

















631mm 4+, use Fact 224(c). 

632% 2 and 2, use Facts 223 and 222(c), respectively. 
633m 2 use Fact 224(e). 

ad Er 2 use Fact 223. 

635In © and 4, use Facts 222(b) and 224(c), respectively. 
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147. Appendices for Part VI. Probability and Statistics 


147.1. How to Count 





Theorem 60. (AP.) Jf A and B are disjoint, finite sets, then |AU B| =|A|+|BI. 


Proof. Let A= {aj,a9,...,a,} and B = {b),b,...,b 7}. Then 


ANB AO), 00 5559 Oigy Diy 09344509} 3 





We have |A| = p, |B] = g, and |Au B| = p+q. The result follows. 











n 
Corollary 55. If Ai, Ao, ..., An are disjoint, finite sets, then |UiL, Aj| = > |Aj|- 





j=l 














Proof. By induction (details omitted). 


Theorem 61. (MP.) [f A and B are finite sets, then |A x B| =|A| x |BI. 


Proof. Let A = {aj,a@9,...,@,} and B = {b),b,...,b7}. Then 
A B-={(s. (orb) (0B) ons) (ab) (8) 


le) tb) Cb) 





We have |A| = p, |B| = q, and |A x B| = pq. The result follows. 











Corollary 56. [f A;, Ao, ..., An are finite sets, then |x?_, A;| = 1, |Ail. 








Proof. By induction (details omitted). 
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Theorem 62. (IEP.) Jf A and B are finite sets, then |AU B| =|A|+|B|-|An Bl. 


Proof. AUB =(A\ (AnB))uB. So by the AP, |AUB|2|A\ (AnB)|+|B]. 
Now, (A\ (An B)) uU(AnB) = A. So also by the AP, |A\ (An B)|+|AnB| = [Al or 
|A\ (An B)|2|A|-|An BI. Plug 2 into = to get the desired result. 














Corollary 57. If A,, Ao, As, are finite sets, then 


| Ai U Ao U As| = |A;| =e | Ao| =F | A3| = |Ay n Ag| = | Ai n As| = | A2 n As| ar |Ai n A» n As| : 














Proof. Similar to the previous proof, just more tedious. 





And here’s the generalisation of the IEP: 


Corollary 58. If A,, Ao, ..., An, are finite sets, then 


Ur Ad=IAd- YY Ain Alt = [Ap Ag Ag) — ++ + (-1)" | Al- 


j=l 2,9 distinct i,j,k distinct 








Proof. By induction (details omitted). 











Theorem 63. (CP.) Jf A and B are finite sets and AC B, then |A\ B| =|A|- |B]. 


Proof. B and A\ B are disjoint, finite sets. Moreover, BU(A\ B) = A. So by the AP, 
|B|+|A\ B| =|A|. Rearranging yields the desired result. 














Corollary 59. [f A ts a finite set and By, Bo,...Bn¢ A are disjoint, then 


n 


JA \ Un Bil = |Al- Dil 


t=1 








Proof. By the corollary to the AP, |U',B;| = )>|B;|. The result then follows by the CP. 
i=l 
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147.2. Circular Permutations 


Consider n objects, only k of which are distinct. Let 71, ro, ..., and rz, be the numbers of 
times the Ist, 2nd, ..., and Ath distinct objects appear. We already know from Fact 227 
that the number of (linear) permutations of these n objects is 
n) 
rylrol... rp 


We also know that m distinct objects have m! (linear) permutations and (m-1)! circular 
permutations. 


A reasonable conjecture might thus be that the number of circular permutations of the 
above n objects is 


(n-1)! 
rylrol... rp! 


The above conjecture sometimes “works”—e.g. SEE has 3!/2! = 3 (linear) permutations 
and SEE indeed also has (3 - 1)!/2! = 1 circular permutation. However and unfortunately, 
this conjecture is, in general, incorrect. Here are two counter-examples. 


Example 1588. There are 3!/3! = 1 (linear) permutations of the three letters AAA. 


If the above conjecture were true, then there ought to be (3 - 1)!/3! = 2!/3! = 1/3 circular 
permutations of AAA. But this is not even an integer, so obviously it cannot be the num- 
ber of circular permutations of AAA. In fact, there is also exactly 1 circular permutation 


of AAA. 


Example 1589. There are 6!/(3!3!) = 20 (linear) permutations of the six letters 
AAABBB. 


If the above conjecture were true, then there ought to be (6 - 1)!/(3!3!) = 10/3 circular 
permutations of AAABBB. But this is not even an integer, so obviously it cannot be 
the number of circular permutations of AAABBB. In fact, there are exactly 4 circular 
permutations of AAABBB. 





A general solution (i.e. formula) is possible but is a bit too advanced for A-Levels.°*° 





636See e.g. this Handbook on Combinatorics. 
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147.3. Probability 


Proposition 21 (p. 1197 above). Let S be the sample space, 4 be the corresponding 
event space, and A, B be events. If the probability function P: i > R satisfies the 
Kolmogorov Axioms, then P also satisfies the following properties: 


1. Complements. P(A) =1-P(A‘°). 


. Probability of Empty Event is Zero. P(@) = 0. 

. Monotonicity. If Bc A, then P(B) < P(A). 

. Probabilities Are At Most One. P(A) <1. 

. Inclusion-Exclusion. P(Avu B) = P(A) + P(B)-P(AnB). 





Proof. (Continued from p. 1197.) 


2. Probability of Empty Event is Zero. @N A=@ and @UA=A. And so again by 
the Additivity Axiom, P(@u A) = P(A) = P(@) + P(A). Thus, P(@) = 0. 


But also by definition, AU A°=S. Hence, P(Au A‘) = P(s). 
By the Normalisation Axiom, P(s) = 1. 


3. Monotonicity. An{B.\ A} =@ and AU{B\ A} = B. Thus, by the Additivity 
Axiom, P(B) = P(A) +P(B. A). By the Non-Negativity Axiom, P(B \ A) > 0. Hence, 
P(B) > P(A). 


4. Probabilities Are At Most One. Any event A is a subset of S. And so by 
Monotonicity, P(A) < P(s). But by the Normalisation Axiom, P (s) = 1. Thus, P(A) <1. 


5. Inclusion-Exclusion Principle. By the Additivity Axiom, P(AU B) = P(A)+P(B\ 
A). 

Also by the Additivity Axiom, P(An B)+P(B\. A) = P(B). 
Altogether then, P(Au B) = P(A) + P(B)- P(An B). 
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147.4. Random Variables 


Proposition 22 (p. 1233). The expectation operator E is linear. That is, if X and Y 
are random variables and c is a constant, then 


(a) Additivity: E[X +Y]= E[|X]+E[Y], 
(b) Homogeneity of degree 1: E[cX]=cE|X]. 





Proof. This proposition applies even for non-discrete random variables. But we’ll prove 
this proposition only for the case where the random variable is discrete. 


We'll use the linearity of the expectation operator. We prove (b) first. 


(() E[eX]= Yo P(X=k)-(ck)=c SY P(X=hk)-k=cE[X]. 


keRange(x) keRange(x) 


(a) E[X+Y] 


~ > 2 PX =k Y =De(k+1) 


keRange() leRange(y) 


- SY k DY P(X=khY=)+ Yl VY P(xX=k,Y=1 


keRange(x) JleRange(y) leRange(y) keRange(x) 


- SY kP(X=k)+ YO P(¥=) 


keRange(x) leRange(y) 





-E[X]+E[Y]. 
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Proposition 23 (p. 1241). Let X and Y be independent random variables. Let c be 
a constant. Then 


(a) Additivity: Var_X +Y]= Var[X]+ Var[Y], 
(b) Homogeneity of degree 2: Var[cX] = c’ Var[X]. 





Proof. We use Fact 236 and the linearity of the expectation operator. 
(b) V[cX]=E[(cX)’] -(cux)? = CE[X?] - 22, = 2 (E[X?] - 22) = 2V[X]. 
To prove (a), we’ll also use Lemma 24: 
V[X+Y]=E[(X+Y)]-(E[X+Y])’ 


= E[X?+¥?4+2xY]-(E[X]+E[Y]y 


= E[X?]+E[Y?] +2E [XY] - (ui + uy + Quxpy) 














= E[X?]-p& + E(Y?] - py + 2(E[XY] - uxpy). 


V[X] V[Y] 0 
Lemma 24. If X and Y are independent random variables, then E[XY]= E|X] EY]. 


Proof. We prove this Lemma only for the case where X and Y are both discrete. 


E[XY]=).ViP(X =k,Y =1)-kil 
k 1 


=) > POXHhPWa=)ek (independence) 
ko 


-¥ (PO = HELP =0-1)= DP =H KEL) 
k l k 





-E[Y] 8 P(X =k) k=E[Y]E[X]. 
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Fact 300. (a) Let X be the number of fair coin-flips until we get two consecutive heads. 
Let Y be the number of fair coin-flips until we get HT consecutively. Then E[LX | = x =6 
ond fay = 


(b) Flip a fair coinn+1 times. This gives us n pairs of consecutive coin-flips. Let A 
be the proportion of these n pairs of consecutive coin-flips that are HH. Let B be the 
proportion that are HT. Then E| A] =u, =1/4 and E[B] = wp = 1/4. 





Proof. (a) To find jzx actually requires a clever, new trick. Let 


p = E[Additional number of flips to get H H|Last flip was TJ, 
q = E[Additional number of flips to get H H|Last two flips were TH]. 


Observe that p is the number of flips, if we’re “restarting” . Thus, p = wx. Now, 


q=P (Next flip is H) x 1+ P (Next flip is JT) x (1+ p) 
=0.5x1+0.5 x (1+p)=1+0.5p. 
(Explanation: If the next flip is H, then we’ve completed HH and this took us only 1 more 


flip. If instead the next flip is 7’, then we start all over again; we’ve already taken 1 flip 
and are expected to take another p flips.) Similarly, observe that 


p =P (Next flip is H) x (1+q) +P (Next flip is T’) x (1+>:) 
= 0.5 x (2+0.5p) + 0.5 x (1+ p) =1.5+0.75p. 


(Explanation: If the next flip is H, then we expect to take, in addition, another gq flips. If 
instead the next flip is 7’, then we start all over again; we’ve already taken 1 flip and are 
expected to take another p flips.) 


Hence, p= 6 = x. The reasoning used above is illustrated by the probability tree below. 
Let’s now find py. Again, let 


r = E[Additional number of flips to get HT |Last two flips were TT], 
s = E[Additional number of flips to get HT|Last flip was H]. 


Observe that r is the number of flips, if we’re “restarting”. Thus, r = wx. 


(... Proof continued on the next page ...) 
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(... Proof continued from the previous page ...) 


Now, also observe that 


s =P (Next flip is 7) x 1+ P (Next flip is H) x (1+) 
=0.5x1+0.5x(1+s)=1+0.5s. 


(Explanation: If the next flip is 7, then we’ve completed HT and this took us only 1 more 
flip. If instead the next flip is H, then we’ve already taken 1 flip and are expected to take 
another s flips.) 


So s=2. Similarly, observe that 


r =P (Next flip is H) x (1+s)+P (Next flip is T) x (1+1r) 
=0.5x (14+2)+0.5x (1+r)=2+0.5r. 
(Explanation: If the next flip is H, then we’ve already taken 1 flip and are expected to 


take another s flips. If the next flip is 7, then we’ve already taken 1 flip and are expected 
to take another r flips.) 


Sor aa aie. 


(b) Let S; be the random variable that indicates whether the ith pair of consecutive coin- 
flips is HH. That is, S;=1 if so and S; =0 if not. Then 


Ge ae 
= . 





A 


And so, E[A]=E ——— 


But E[S;] =1/4. Thus, E[A] = 1/4. 
The proof that E[B] = 1/4 is similar. 
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147.5. The Normal Distribution 


Fact 301. [ e dar = JT. 





Proof. Omitted. See @ or WIKIPEDIA. 














Fact 241 (p. 1260). Let Z ~ N(0,1) and ¢ and ® be its PDF and CDF. 


il 
Z 


P(oco) =1. (As with any random variable, the area under the entire PDF is 1.) 


o(a) > 0, for alae R. (The PDF is positive everywhere. This has a surprising 
implication: however large a is, there is always some non-zero probability that Z >a.) 


3. E[Z]=0. (The mean of Z is 0.) 


4. The PDF @ reaches a global maximum at the mean 0. (In fact, we can go ahead and 


compute $ (0) = 1/V27 * 0.399.) 


) Var Zl =1. (ihe variance of Z 75 1.) 
.P(Z<a)= P(Z<a). (We've already discussed this earlier. It makes no difference 


whether the inequality is strict. This is because P(Z = a) = 0.) 


. The PDF @ ts symmetric about the mean. This has several implications: 


(a) P(Z >a) = P(Z < -a) = ®(-a). 

(b) Since P(Z >a) =1- P(Z<a)=1-(a), tt follows that ®(-a) = 1 - ®(a) or, 
equivalently, ® (a) = 1-®(-a). 

(c) ®(0) =1-6(0) =0.5. 


. P(-1<Z<1)=@(1)- ®(-1) x 0.6827. (There is probability 0.6827 that Z takes on 


values within 1 standard deviation of the mean.) 


. P(-2< Z <2) = @(2) - ®(-2) ~ 0.9545. (There is probability 0.9545 that Z takes on 


values within 2 standard deviations of the mean.) 


. P(-3< Z <3) = ®(3) - ®(-3) x 0.9973. (There is probability 0.9973 that Z takes on 


values within 3 standard deviations of the mean.) 


. The PDF ¢ has two points of inflexion, namely at +1. (The points of inflexion are one 


standard deviation away from the mean.) 





Proof. 1. Let u=2x/V2. We have wu? = 0.52? and du/dx = 1/2. And using Fact 301: 


—0.52 — x du 1 ae 
(00) = [ae ae = a e058 2 da i ee. 


a 


ee vi 


Proof continued on the next page ...) 
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(... Proof continued from the next page ...) 2. Obvious. 
3. K| 4) = [+ (a )jode= a7 a (-ne-**") dx = ae [eos] ss [0-0] =0 
—oo J2n —0o J2n = 09 J2n 
F oa >0; ia<d, 
A ney Gy ead gee a SiS aoc 
PA ) da /2n Jon 0, if a 0, 
<0, i asd. 


! 
U 


2 
re Oot dx: 





5. VIZ ip x -0)d(x dx = f x g tee de -—— [ 7 
(Z)= [(e-oy'o(e) de= f "2? sf 
1 le 7 0.522 | 1 a 0.52? 
=o es — | eo * da ae aoe eel, 
\/ 270 ~oo \/27T J-00 


@ is continuous, increasing for a < 0 and decreasing for a > 0. Thus, ¢@ reaches a global 
maximum at 0. By plugging in a = 0, we can compute this global maximum value to be 
(0) = 1/V 27 x 0.399. 

6. By the Additivity Axiom, P(Z <a)=P(Z<a,Z= 
0=P(Z <a), as desired. 


7. Clearly, ¢(a) = e050 IY 27 = en 0-5(-a)" Hy 27 = ((-a) for allaeR. Thus, ¢ is symmetric 
about the vertical axis x = 0, which is also the mean. 


a)=P(Z<a)+P(Z=a)=P(Z<a)+ 


7(a). Using the substitution u =—a, we have du/dx =—-1 and 
u=—a @—0. 5u2 


ie T=00 e 9. 5a? e0- 5u2 
P(Z>a f. eae [- du = 
u=-a u=-co 4/27 


7(b) and 7(c). Obvious. 


8, 9, and 10. These can be computed numerically, using a computer. 





du =P(Z < -a) = ®(-a). 


>0, ifa<-tl, 
p 7 i =0, i.@s—|, 
—-a 2 2 
11. pcan a = —__p~9.5a = —__p~9.5a az —-1 se 
Fee (a) da /on Jan ( ) <0, if -l<a<l, 
=, ife@=t, 
>0, ifa>tl. 


Hence, +1 are the only two points of inflexion since @ changes concavity only here. 
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Theorem 64. Leta,b¢R be constants with a #0 and X be a continuous random variable 
with PDF fx and CDF Fy. Let Y =aX +b. Then 


fr (c)= als (—). 





Prooy. Fy (e)=P(Y <c)]=FP (aX +0<c) =P (ax <c=)), 
Case #1. If a> 0, then Fy (0) =-+=P (aX <e-0)=P(X <=) = Fx (>), 
a a 


Now differentiate: 
Fy (= 2Fx() = fr (= =fx(—) = The (—). 


lal 


Case #2. If a< 0, then Fy (c) =---=P(aX <c-b) = p(x2—)-1- Fx(—). 


Now differentiate: 
iv Z[1- Fx ()]- 0 --Fte(S)- Gae(S) 


Fact 242 (p. 1270). Let X ~ N(pu,0") and a,b € R be constants. Then aX +b ~ 














N(ap + 0, a’o"). 





Proof. By Theorem 64, the PDF of aX +0 is given by 





2 

1 —-) L 1 -0s(==#) 1 -ospecoe 
a C)=— SS |S e€ = ———e ao : 
fax+r (©) if ( om aes 


a 
But this lattermost expression is indeed the PDF of the random variable with distribution 
N (ay + b, aro}: 
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147.6. Sampling 


Fact 244 (p. 1306). Let S = (X1, Xo,...,Xn) be a random sample of size n. Let X be 
the sample mean and S? be the sample variance. Let a€R be a constant. Then 


Dees, nay = bates) 


Bea 


Ph 
i 
n-1l 








Ore. a onden)ese = 


n-1 





Proof. This proof may look intimidating but it’s really just a bunch of tedious algebra. (I’ve 
also tried to go slow with the algebra, so more steps are explicitly listed than is typical in 
a proof.) 


(a) Start from the definition of the sample variance and do the algebra: 


PCa) Paes 





S$? = —— Se 
p=—1 n—-1 ae ; 
= weer XP — WX? 2K VE Xi 7 wy X? -nX? - 2X (nX) 7 yt X2 -nX? 
7 n=l 7 n—-1 ~ n-1 
2 
7 ee ra os Ts 
n= oe | 


(b) Start from the formula found in (a) and do the algebra: 


n 2 [Dita Xi] n —_ 2 [oh (Xi-ata)]? 
ee eS ee) 





°: n= n-1 
Yo [Xi = a) + a2 +2(X;-a) a a Peed a)’ 
7 a= 
mm, (X;-a)? + D8 a2 + 20D", (X; - a) - Berd ook Mina) Tha 
n-1 . 
52, 0ce a) insta SOG ~ay = [Eh (Kira) + (na) +2na Fh Kira) 
i= 





_ ha (Xia)? - Beal 














n-1 
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Proposition 24 (p. 1312). Let (X1,X2,...,Xn) be a random sample drawn from a 
distribution with population mean fs and population variance o?. Let X be the sample 
mean and S* be the sample variance. Then 


(a) E[X] =p. And (o) B\S* |= 0": 





Proof. (a) was proven in Exercise 508. We prove only (b). 


Equation + is the key piece of intuition (and is formally proven below): 


The degree to which The degree to which The degree to which 


X; varies from X X varies from pu X; varies from p41 
—_ TF —_—_— —4—-——~. 


E[(xi-xX)] +  E[(X-p)] 0 B[(X-p)"]. 














Rearranging, 
PAO; Population variance Variance of sample mean 
= fe ae ee 2 a _n-l1 , 
E[(x:-X)]= E[(%-4)"] - E[(X- 1) | = 0? ~— = ——“9?. 


a9 
We’ve just shown that (X; —X ) is a biased estimator for 07. And in turn, S$? is not: 


= n nm als 
= 7 = n on 


Sa eak)y 


E[S?]=E ; 











As promised, here is the proof of equation é: 


E [(X - x) +E [(x Z 1) | -E (Xi oe ee 1) | 
(X;-X)+(X-u))?-2(Xi-X)(X-p)] 














L( 
-B[(%- 1)? 2(X:-X)(X-p)] 
= E[(X;- 1)? -2( XX - pX,- X? + uX)| 
= E[(X;- 4)? -2(X;X - X?)] 
= E[(X;- )?]+2{E[X?]-E[Xx;]} 
= E[(X; - Ht) = 
The last equality follows because E [x Xi] = ees =F [xe] =F [X?]. 


1739, Contents www.EconsPhDTutor.com 


147.7. Null Hypothesis Significance Testing 
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147.8. Calculating the Margin of Error 


Let yz be the true population proportion (of votes for Dr. Chee). Say we take a random 
sample of size 900.’ Let X be the sample number of votes for Dr. Chee. We know that 
X ~ B(900, 2). 


Our confidence level is 95%. So we want to find the smallest k such that 
P(900H-k < X < 900u+ k) > 0.95. 


And +k/900 will be our margin of error. 

Case #41: Perfect hindsight: py = 9142/23570. 

With perfect hindsight, we now know that p = 9142/23570. So X ~ B (900, 9142/23570). 
We want to find the smallest & such that P (349-k < X < 349+k) > 0.95. 

where 900 x 9142/23570 » 349. Using the “Binomial” sheet at the usual link, we have 


P (349 — 28 < X < 349 + 28) » 0.9488, 
P (349 — 29 < X < 349 + 29) » 0.9565. 


Thus, k = 29. Now, 29/900 » 3.2%. Thus, at a 95% confidence level, the margin of error 
is +3.2%. This is the “true” margin of error, assuming we know p. But this assumption 
defeats the point of sampling—we don’t know jw, which is why we’re doing sampling in the 
first place! 


What we want instead is the margin of error in the case where pz is unknown. 
Case ##2: Without perfect hindsight: » unknown. 


With j unknown, a conservative interpretation would be to find the smallest & such that 
for all pp, P(900u-—k < X < 900u +k) > 0.95. 


(... Analysis continued on the next page ...) 





637This is slightly different from what actually happened: (1) The actual random sampling was most likely 
without replacement (which would change the maths slightly). (2) 100 votes were taken from each of 9 
different polling stations (which would also change the maths slightly). 
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(... Analysis continued from the previous page ...) 


Observe that Var [|X] = 900y(1 - 2) is maximised at = 0.5. Thus, it is plausible®® that: if 
k; satisfies 


X ~B(900,0.5) => P(900x0.5-k< X < 900 x 0.5+k) > 0.95, 
then & also satisfies 
X ~B(900, uw) = > P(900x0.5-k< X < 900 x 0.5+k) > 0.95. 
Our problem thus boils down to finding the smallest k such that for X ~ B (900, 0.5) implies 


P(450-k< X <450+k) > 0.95. 


We have  P(450-29< X < 450+ 29) « 0.9508, 
P (450 - 28 < X < 450 + 28) « 0.9426. 


We conclude that the smallest such k is 29. Now, 29/900 » 3.2%. So the margin of error 
may be given as +3.2%. This is the same as what was calculated above, which is not 
surprising, since 9142/23570 » 0.388 is close to 0.5. 


The reader will, of course, wonder why the Elections Department stated that the margin of 
error was +4%, rather than +3.2% as I calculated here. I am not sure myself. My guess is 
that they probably don’t bother going through all the above calculations afresh each time. 
Instead, each time they report a sample count, they simply read off the margin of error 
from a table that looks something like this: 





Sample Size | Approximate Margin of Error 
400 — 599 +5% 
600 — 999 +4% 
1000 — 2000 +3% 





(By the way, note that it is common to use the CLT approximation when calculating the 
margin of error. I have not done so here. Instead, I’ve stuck with using the original, exact 
binomial distribution. ) 





38Proving this would need a little work though. 


1742, Contents www.EconsPhDTutor.com 


147.9. Correlation and Linear Regression 


Fact 302. Let 21, 20,...,2p and y1, yo,-.-,Yn be numbers. Let x = yaa and y = ae 
Then 


Ve) 


Den (—2) De ay 


Proof. Let u = (41 - %,%2-Z,...,%—-—Z) andv = (y1-9,y2-Y,---, Yn —Y) be n-dimensional 


€ [-1,1]. 





vectors. Then 


ie i-1 (44 — 2) (y= 


~z)\/o" all 
ae inl (x; i) (yi - y) 





But from what we learnt about vectors,°” if @ is the angle between two vectors, then 
u-v 
cos 0 = ——. 
jul |v| 





Since cos@ € [-1,1], the result follows. 














639Of course, in this textbook, we’ve only shown that this is true for two- and three-dimensional vectors. 
But let’s just wave our hands and say that this is also true for higher-dimensional vectors. 
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Proof. (Continued from the proof begun on p. 1350.) Remember that the data (21, 72,...,2n) 
and (41, Y2,---;Yn) are given. Thus, we can treat all the x;s and ys as constants. We have 


<a = er: = © (20052 = \-2 [yi (a+bxi)]. 


Thns, Ya? =0 —> yi - (a+ bai) = 0 —> a+y bz. 


We also have 


a2 = > =i - > (22°) = > = 27; Ei = (a f bx) 


Thus, — Dat =0 <> > EI - (a + bx) x; = 0. Plugging + into this last equation, we 
have > E - (9 ~ bz + bx) x; =0. Tedious algebra yields Formula (ii): 





) » tie — Ney 
Veen 





More algebra yields Formula (i). 











1744, Contents www.EconsPhDTutor.com 


147.10. Deriving a Linear Model from the Barometric Formula 


According to NASA (1976), “U.S. Standard Atmosphere”, p. 12, eq. (33a) (PDF), the 
barometric formula (relating pressure P to height H above sea level), in the case where 
Lup + 0 is given by 
doM 
T REL Mb 
Pa | Mb | | 
Tp + Lu (h - he) 

where Py, Tip, Lup, ho, 90: R* are simply constants. Now, do the algebra: 


! 
Io M 


Tub REM» 
Pes 
Tap + Lup (h- he) 


Ps 


gM 
_ Pp, [Fat Soa al hp 








Tb 








GM Eup 
In P = ln Pyyy - In} 1+ h-h 
7 n Py» R*Lan | Tap ( IF 


Now, for heights up to 11000 m above sea level, is simply the height at sea level. That 





is, hy =O m. If we also let a = In Py» and b = - send get rid of the subscripts in Ljy» 


i 
and Thy» (just to make it neater), then we have 


L 
InP=a+blnj1+—h}). 
aBaaxbin(ie2s) 


For heights up to 11000 m above sea level, L = —0.00065 kelvin per metre is the temperature 
lapse rate (the rate at which the temperature falls, as we go up in altitude; see p.3, Table 
4) and T = 288.15 kelvin is the standard sea-level temperature (also precisely equal to 15 
°C; see p. 4). 
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Part IX. 
Answers to Exercises 
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148. Part 0 Answers (A Few Basics) 


148.1. Ch. 1 Answers (Just To Be Clear) 


(This chapter had no exercises.) 


148.2. Ch. 2 Answers (PSLE Review: Division) 


Tip: Click on the exercise number to return to that exercise. 


Al. Long division for 8057 + 39. The dividend is 8057 and the divisor is 39. 


206 

39 | 8057 
7800. 
257 

0 


257 


Thus, —— = 206—. 


The quotient is 206 and the remainder is 23. 


A2. The error is in Step 5. Since 7 = y, we have x-y =0. And so we cannot simply divide 
both sides by x-y. (Step 7 is not wrong because we already declared that x > 0, so we can 
go ahead and divide by z.) 


A3. The error is in the first step, when he “divide[s] both numerators by x”. The given 
equation’s solutions are x = 8 and x = 0. 
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148.3. Ch. 3 Answers (Logic) 


A4. Every conjunction (AND statement) is false because (at least) one of the two state- 
ments forming the conjunction is false. 


In contrast, every disjunction (OR statement) is true because (at least) one of the two 
statements forming the disjunction is true. 


(a) BANDC: “Germany isin Asia AND 1+1=2.” X 
(b) AANDD: “Germany is in Europe AND 1+1=3.” xX 
(c) CAND D: “141=2AND1+1=3" X 
(d) BORG: “Germany isin Asia OR 14+1=2.” V 
(e) AORD: “Germany isin Europe OR 1+1=3.” Vv 
(f) CORD: “141=2 OR 14+1=3" V 


A5. NOT-E: “It’s not raining.” NOT-F: “The grass is not wet.” NOT-G: “I’m not 
sleeping.” NOT-—H: “My eyes are not shut.” 


A6(a) If x = 0.5, then O is true while N is false. Since O and N do not always have the 
same truth value, we say that are not equivalent and writeO <> N. 


(b) If 2 = -3, then ¥ is true while a is false. Since y and a do not always have the same 
truth value, we say that 7 and a are not equivalent and write y < a. 


(c) Both statements are always true. So, we say they’re equivalent and write © <=> ©. 
(d) Statement is always true, while @ isn’t always true (e.g. x =1). So, & < @. 


(e) Statement & is always false, while @ isn’t always false (in particular, @ is true if x = 5). 


So, = <> @. 


A7(a) NOT-(B AND C) is true. There are two ways to see this: 


¢ Since B AND C is false, its negation NOT- (BAND C) must be true. 


¢ By Fact 1, NOT-(B AND C) is equivalent to NOT-B OR NOT-C: “Germany is not 
in Asia OR. 1+1 #2”. Which is true because NOT-B: “Germany is not in Asia” is true. 


(b) NOT-(A AND D) is true. Two ways to see this: 


¢ Since A AND D is false, its negation NOT-(A AND D) must be true. 


¢ By Fact 1, NOT-(A AND D) is equivalent to NOT-A OR NOT-D: “Germany is not 
in Europe OR 1+1 #3”. Which is true because NOT-D: “1+ 1 # 3” is true. 


(c) NOT-(B AND D) is true. Two ways to see this: 


¢ Since B AND D is false, its negation NOT- (BAND D) must be true. 


¢ By Fact 1, NOT-(B AND D) is equivalent to NOT-B OR NOT-D: “Germany is not 
in Asia OR. 1+1 #3”. Which is true because NOT-B: “Germany is not in Asia” is true. 
(Indeed, NOT-D: “1+ 1 #3” is also true.) 
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A8(a) NOT-(B ORC) is false. There are two ways to see this: 


¢ Since BOR C is true, its negation NOT-(B ORC) must be false. 

¢ By Fact 2, NOT-(B ORC) is equivalent to NOT-B AND NOT-C: “Germany is not 
in Asia AND 1+1+#2”. Which is false because NOT-C: “1+ 1 # 2” is false. 

(b) NOT-(A OR D) is false. Two ways to see this: 


¢ Since AOR D is true, its negation NOT- (A OR D) must be false. 


¢ By Fact 2, NOT-(A ORD) is equivalent to NOT-A AND NOT-D: “Germany is not 
in Europe AND 1+1 #3”. Which is false because NOT-A: “Germany is not in Europe” 
is false. 


(c) NOT-(B OR D) is true. Two ways to see this: 


¢ Since BOR D is false, its negation NOT- (BOR D) must be true. 


¢ By Fact 2, NOT-(BORD) is equivalent to NOT-B AND NOT-D: “Germany is not 
in Asia AND 1+1+#3” Which is true because both NOT-B: “Germany is not in Asia” 
and NOT-D: “1+1#83” are true. 


A9. Remember: An implication P = > Q is true if either its hypothesis P is false or its 
conclusion @ is true. 


Here, the hypothesis of each statement (a)—(d) is false (TPL is not a genius, 7t is not 
rational). Hence, each statement is true. 





























A10. Maths/Logic Everyday English 
G=—>H That I’m sleeping implies that my eyes are shut. 
G=>H I’m sleeping only if my eyes are shut. 
If G, then H. If I’m sleeping, then my eyes are shut. 
If G, H. If I’m sleeping, my eyes are shut. 
HifG. My eyes are shut if I’m sleeping. 
H when G. My eyes are shut when (or whenever) I’m sleeping. 
H follows from G. | That my eyes are shut follows from the fact that I’m sleeping. 
G is sufficient for H. That I’m sleeping is sufficient for my eyes to be shut. 
H is necessary for G. | It is necessary that my eyes are shut, for me to be sleeping. 





All. By Definition 5, P = = Q is equivalent to NOT-PORQ. So, by Fact 2, the 
negation of NOT-P OR Q is 


P AND NOT-Q. 


A12. By Fact 3, NOT-(K == L) is equivalent to 
k AND NOT-L. 


That’s “x is donzer and not kiki.” So the answer is (d). 
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A13(a) G = > H is true (at least for most humans). The converse H = > G is false: 


“If my eyes are shut, then I’m sleeping.” 
Or, “That my eyes are shut implies that I’m sleeping.” 


Two counterexamples to H ==> G: (1) I may be resting my eyes. (2) I may be blinking. 
In either counterexample, my eyes are shut, but I’m not sleeping. 


(b) M = > N is false (counterexample: x = 0.5). The converse N = > M is true: 
“If x >1, then x > 0.” Or, “That x2 > 1 implies that x > 0.” 
(c) y = ais false (counterexample: x = —-3). The converse a => 7 is true: 


“If ¢ = 3, then 2? = 9. Or, “That x = 3 implies that x? = 9.” 


A14(a) The converse is “If the Nazis won World War II (WW2), then Tin Pei Ling (TPL) 
is a genius.” True because the hypothesis is false. 


(b) The converse is “If the Allies won WW2, then TPL is a genius.” False because the 
hypothesis is true AND the conclusion is false. 


(c) The converse is “If I am the king of the world, then 7 is rational.” True because the 
hypothesis is false. 


(d) The converse is “If Lee Hsien Loong is Lee Kuan Yew’s son, then 71 is rational.” False 
because the hypothesis is true AND the conclusion is false. 


A15(a) Since A is true while B is false, the implication , A => B is false. 


Since B is false, the converse B ==> A is true. (Alternate answer: Since A is true, the 
converse B ==> A is true.) 


(b) Since C is true, the implication , A => C is true. 
Since A is true, the converse C' = > A is true. 
(c) Since A is true while D is false, the implication , A => D is false. 


Since D is false, the converse D = > A is true. (Alternate answer: Since A is true, the 
converse , D ==> A is true.) 


(d) Since C is true while D is false, the implication C ==> D is false. 


Since D is false, the converse D = > C is true. (Alternate answer: Since C’ is true, the 
converse , D = > C is true.) 
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A16. Remember: An implication P = > Q is true if either its hypothesis P is false or its 
conclusion @ is true. 


(a) If a is true, then P => Q (iii) could be true or false and 











Q = > P (i) must be true. 
(b) If ie is false, then P = > Q (i) must be true and 
Q => P (iii) could be true or false. 





) 
(c) If P => Qis true, then Q == P (iii) could be true or false. 


(d) If P = > Q is false, then Q => P 





as an a ae a 


i) must be true. 





Parts (c) and (d) here correspond to Fact 4 (a) and (b). 


A17. No. This is another example of affirming the consequent or the fallacy of the 
converse. 


A18. Given the statement “If x is German, then x is European”, its contrapositive is 


(d) “If x is not European, then x is not German” (must be true). 
(a) “If x is European, then x is German” could be true (e.g. x is Xi Jinping) or false (e.g. 
x is Boris Johnson). 


(b) “If x is not German, then x is not European” could be true (e.g. x is Angela Merkel) 
or false (e.g. x is Boris Johnson). 


(c) “If # is not German, then x is European” could be true (e.g. x is Boris Johnson) or 
false (e.g. x is Xi Jinping) 

(d) “If x is not European, then x is German” could be true (e.g. x is Boris Johnson) or 
false (e.g. x is Xi Jinping) 


By the way, (a) is the original statement’s converse, while (b) is the inverse = “Negate 
both”. 


A19. O = > N is false (counterexample: x = 0.5) and so by Fact 6, N < O. 


A20. No two of the three statements are equivalent: 


e Y = X because John could have a NRIC but be a permanent resident (and hence 
non-citizen). 


« X = Z because John may be a newborn Singapore citizen who hasn’t yet obtained 
his pink NRIC. 


e Y = Z because John may have a NRIC but one that’s blue rather than pink. 
Likewise for the non-Singaporean version—no two of the three statements are equivalent: 


e X = Y because John could be a snake (and hence an animal) but not human. 
« Z = X because John could be a tree (and hence a living thing) but not an animal. 
« Z = Y because John could be a tree (and hence a living thing) but not human. 


1751, Contents www.EconsPhDTutor.com 








A21 All Yes All No Some Yes SomeNo 
(a) “All donzers “No donzer “Some donzer “Some donzer 
are kiki.” is kiki.” is kiki.” is not kiki.” 
(b) “All donzers “No donzer “Some donzer “Some donzer does 


cause cancer.” 


causes cancer, 


v] 


causes cancer.” 


not cause cancer.” 





(c) 


“All bachelors 


“No bachelor 


“Some bachelor 


“Some bachelor 





are married.” is married.” is married.” is not married.” 
(d) “All bachelors “No bachelor “Some bachelor “Some bachelor 
smoke.” smokes.” smokes.” does not smoke.” 


A22. By Definition 3, a true statement’s negation is false and a false statement’s negation 





is true. 


(a) Consider the statements, “All animals are dogs,” (All Yes) and, “No animal is a dog,” 
(All No). Both are false. So, All Yes and the All No statements are not always negations 


of each other. 


(b) Consider the statements, “Some animals are dogs,” (Some Yes) and, “Some animal is 
a dog,” (Some No). Both are true. So, Some Yes and the Some No statements are not 





always negations of each other. 






























































A23. Statement Negation 

(a) All Yes: “All donzers are kiki.” Some No: “Some donzer is not kiki.” 
(b) All No: “No donzer is kiki.” Some Yes: “Some donzer is kiki.” 

(c) Some Yes: “Some donzer is kiki.” All No: “All donzers are not kiki.” 

(d) Some No: “Some donzer is not kiki.” All Yes: “All donzers are kiki.” 

(e) All Yes: “All bachelors are married.” Some No: “Some bachelor is not married.” 
(f) All No: “No bachelor is married.” Some Yes: “Some bachelor is married.” 
(g) Some Yes: “Some bachelor is married.” All No: “All bachelors are not married.” 
(h) Some No: “Some bachelor is not married.” All Yes: “All bachelors are married.” 

(i) All Yes: “All donzers cause cancer.” Some No: “Some donzer does not cause cancer.” 
(j) All No: “No donzer causes cancer.” Some Yes: “Some donzer causes cancer.” 
(k) Some Yes: “Some donzer causes cancer.” All No: “All donzers do not cause cancer.” 

(1) | Some No: “Some donzer does not cause cancer.” All Yes: “All donzers cause cancer.” 
(m) All Yes: “All bachelors smoke.” Some No: “Some bachelor does not smoke.” 
(n) All No: “No bachelor smokes.” Some Yes: “Some bachelor smokes.” 

(o) Some Yes: “Some bachelor smokes.” All No: “All bachelors do not smoke.” 
(p) Some No: “Some bachelor does not smoke.” All Yes: “All bachelors smoke.” 
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A24(a) “No person is LeBron James” is an All No statement with subject “person” and 
predicate “LeBron James”. 


(b) The negation of All No is Some Yes: “Some person is LeBron James”. Which is true, 
since there 7s a person who is LeBron James—namely LeBron James himself. 


(c) Since the negation is true, the commentator’s statement is false. 


Where we place the word not is crucial. ‘The commentator wanted to negate the state- 
ment “Everybody is LeBron James”, but placed the word not in the wrong position. He 
incorrectly said, “Everybody is not LeBron James,” but should instead have said, “Not 
everybody is LeBron James”. This latter statement is true and is equivalent to the Some 
No statement, “Some person is not LeBron James”. 


Moral of the story: “All fruit are not apples” is different from “Not all fruit are apples”. 
Don’t get them confused. 


A25. Each of (a)—(n) could be true or false. (None must be true or must be false.) 


A26(a) “Some (i.e. at least one) prime number is even.” 
(b) “Some prime number greater than 2 is even.” 


(c) “No rational number is greater than 7t” or “All rational numbers are not greater than 
1.” 


(d) “Some number is either negative or imaginary.” 


(e) “Some number is either non-positive or rational”. 
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148.4. Ch. 4 Answers (Sets) 


A27. The set of the first seven integers is C = {1,2,3,4,5,6, 7}. 
A28. There is only one even prime number, namely 2. Hence, D = {2}. 
A29. X = {Lee Kuan Yew, Goh Chok Tong, Lee Hsien Loong}. 


A30(a) H contains two elements, namely F and G. (You can think of H as a box that 
itself contains two boxes, namely F' and G.) 

(b) A = {{{1, 3,5}, {100, 200}} , {1, 3,5, 100, 200}$. (Note that the braces go three-deep.) 
A31(a) J contains three elements, namely A, B, and G. (You can think of I as a box that 
itself contains three boxes, namely A, B, and G.) 

(b) J = {{1,3, 5}, {100, 200} , {1,3, 5, 100, 200}}. 

(c) Nope. H is a box that contains the two boxes F' and G, while J is a box that contains 
the three boxes A, B, and G. So the sets H and J are not the same. 

A32(a) Los Angeles ¢ The set of the four largest cities in the US. 

(b) Tharman ¢ The set of Singapore Prime Ministers (past and present). 

A33(a) Yes, because {1,2,3} and {3,2,1} both contain the exact same elements, namely 


the numbers 1, 2, and 3. The order in which we write out the elements of a set doesn’t 
matter. 


(b) No. The set {{1},2,3} contains a set containing the number 1 and the two numbers 2 
and 3. 


In contrast, the set {{3}, 2,1} contains a set containing the number 3 and the two numbers 
2 and 1. 


Since the two sets contain different elements, they are not the same set. 
A34. n(z) =n({LKY, GCT, LHL}) =3. 


A35. L is the set containing the first 50 odd positive integers; hence, n(L) = 50. And M 
is the set containing the first 99 negative integers; hence, n(/) = 99. 


A36. N = {102, 104, 106, 108,...,996, 998}. 


A37. The set W = {Apple, Apple, Apple, Banana, Banana, Apple} has only two distinct 
elements. Hence, n(W) = 2. We can rewrite the set more simply as W = {Apple, Banana}. 


A38. There is only one even prime number, namely 2. Hence, C = {2} and n(C) = 1. 
A39. The set of all primes is H = {2, 3, 5, 7, 11, 13, 17, 23, 29, ...}. 
A40. None. All of them contain infinitely many elements and so all are infinite. 


A41. The set {{{}},2, {@}, {}} contains only two elements—n (S) = 2. 


Observe that {@} = {{}} and {} = @. Hence, {@} and {} are repeated elements that we may 
ignore. Hence, the set {{{}},9, {@}, {3} = {{{}} ,@} contains only two elements—namely, 
(i) the set that contains the empty set; and (ii) the empty set. 
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A42. The set A =[1,1] contains the real numbers that are > 1 and < 1. There is only one 
such number, namely the number 1. And so, n(D) = 1. We can also write X = {1}. 


The set B = (1,1) contains the real numbers that are > 1 and < 1. There are no such 
numbers. And so, n(B) =0. We can also write B = {} =@. 


The set C = (1,1] contains the real numbers that are > 1 and < 1. There are no such 
numbers. And so, n(C) = 0. We can also write C= {} =@. 


The set D = [1,1) contains the real numbers that are > 1 and < 1. There are no such 


numbers. And so, n(D) =0. We can also write D = {} =@. 


The set £ = (1,1.01) contains the real numbers that are > 1 and < 1.01. There are infinitely 
many such numbers (e.g. 1.001, 1.0001, 1.0002). And so, n(Z) = o. 


For each term, we give three possible definitions: 


A degenerate interval is an interval that has (i) equal endpoints; (ii) 0 or 1 elements; 
(iii) finitely many elements. 


A non-degenerate interval is an interval that has (i) distinct endpoints; (ii) more than 
1 element; (iii) infinitely many elements. 


For this textbook’s official definitions of degenerate and non-degenerate intervals, see Def- 
inition 318. 


A43. R = (—o0, 00), R* = (0,00), Rj =[0, 00), R = (-0,0), and Ro = (-0, 0]. 


A44(a) Every integer is also a rational number and a real number; hence, Z ¢ Q,R. 
(b) A rational number is also a real number; hence, QE R. 
Some rational numbers are not integers (e.g. 1.5); hence, Q ¢ Z. 


(c) Some real numbers are neither rational nor integers (e.g. 7); hence, R ¢ Z,Q. 


A45. True. The set of current Singapore Prime Minister(s) is S = {Lee Hsien Loong}.The 
set of current Singapore Ministers is 


T = {Lee Hsien Loong, Tharman, Teo Chee Hean, Khaw Boon Wan,...}. 


Every element in S is in 7’; hence, SCT. 


A46(a) False. Counterexample: Let A= {1} and B= {1,2}. Then ACB, but A# B. 

(b) False. Counterexample: Let A= {1,2} and B= {1}. Then BCA, but Az B. 

(c) True. If A = B, then by Def. 8, every element in A is also in B; so, by Def. 17, AC B. 
(d) True. If A = B, then by Def. 8, every element in B is also in A; so, by Def. 17, BC A. 
(e) False (in particular, <= is false). Counterexample: same as in (a). 

(f) False (in particular, <— is false). Counterexample: same as in (b). 
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A47. Every square is a rectangle; so, S ¢ R. 
Moreover, some rectangles are not squares; so, S # R. 
Hence, by Definition 18, Sc R. 


A48. No. Counterexample: if A = {1,2} and B = {1,2}, then ACB, but A¢ B. 

A49. Yes. By Definition 18, Ac B requires that A ¢ B. 

A50. True. Suppose A ¢ B. Then either A = B or if not, then by Definition 18, Ac B. 
A51(a) [1,2] U[2,3] =[1,3].(b) (-00, -3) U[-16, 7) = (-00,7). (c) {0} UZ* = Z). 


A52(a) Every square is a rectangle. Hence, “the set of all squares and all rectangles” is 
itself simply “the set of all rectangles”, ic. SUR=R. 


(b) Every rational and every irrational is a real number. Moreover, every real number is 
either rational or irrational. Hence, the set of all rationals and irrationals is R (the set of 
all real numbers). 


A53(a) (4,7] 7 (6,9) = (6,7]. (b) [1,2] [5,6] = 2. (c) (-00,-3)n [-16, 7) = [-16, -3). 


A54(a) The only objects that are both squares AND rectangles are squares. Hence, the 
intersection of the set of squares and the set of rectangles is simply the set of all squares, 
Le: Oni= so, 


(b) It is the empty set @. This is because no object is both rational AND irrational. 
A55. V\T = {3} and V \U = {1,2}. 

A56(a) & = £0001, 0002, 0003, ...,9999}, |&| = 10000. 

(b) & = {Male, Female}, |@| = 2. 

(c) & = {Malay, Mandarin, Tamil, English}, |é| = 4 (source). 

(d) & = {Malay}, |é| = 1. 


A57(a) 8" = {0000, 0001, 0002, ...,2999, 4000, 4001, 4002, ... 9999}. 
(b) S’ = (—273.15, 30]. The lowest possible temperature (absolute zero) is —-273.15°C. 


A58. S = {Lee Hsien Loong, Lee Wei Ling, Lee Hsien Yang}. 
T = {Lee Wei Ling, Lee Hsien Yang}. 


A59. {Lee Kuan Yew, Lee Hsien Loong}. 


A60. In words, 

e Q is the set of all rationals x such that x is greater than 0. 

¢ Q) is the set of all rationals x such that x is greater than or equal to 0. 
¢ Z* is the set of all integers x such that x is greater than 0. 


¢ Zj is the set of all integers x such that x is greater than or equal to 0. 
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A6l(a) R ={xeR:2<0} (b) Q@ ={xeQ:2 <0} 
(c) Z ={reZ:x2<0} (d) Rj ={eeR:2<0} 
(e) Q={xeQ:2<0} (f) Zp ={xeZ:x <0} 
(g) (a,b)={xeR:a<a<b} (h) [a,b] ={reR:a<xr<bd} 
(i) (a,b]={teR:a<a<b} (j) [a,b)={weR:a<a<b} 
(k) (-o0,-3) U(5, 00) = {2 eR: 2<-3 OR > 5} 
()) (=e, 1] (2,3) U (308) =e eae LORD <a <3 ORGS 3} = {ee Rie ¢ (1,2), ¢23} 
(m) (-00,3)n (0,7) ={zeR:0<2< 3} 
(n) {Negative even numbers} = {x:v=2k,keZ}={2k:keZ} 
(0) {Positive odd numbers} = {m:2=2k-1,keZ*}={xe:x2=2k+1,k€ Zp} 
={2k-1:keZ*} ={2k+1:ke Zi} 
(p) {Negative odd numbers} = {w:x=2k-1,keZ}={e:r=2k+1,keZ} 
={2k-1:keZo} ={2k+1:keZ} 
(q) {7, 47, 77,107,...} ={(1+3k)a:k€ Zp} 
(r) {-27, 7,47, 77, 107,...} ={(1+3k)a:k€Z,k>-1} 


A62(a) R\ R* =R). 

(b) R\ (QUZ)=R\Q=Q. 

(c) [1,6] \ ((3,5) 9 (1,4)) = [1,6] s (3,4) = [1,3) u (4,6). 

0d) 4155, 0 1oye2.) 42, 6,8, <2 =. 

(e) {2,5,8,11,...}{2,4,6,8,...} = {2,8,14,20...} = {2+ 6k: ke Za}. 
(4) 0,5] m{ (1,8) a [5,9))' =(0,5) mf 5,8)" =(G, 5). 
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148.5. Ch. 5 Answers (O-Level Review) 


A65. False. Counterexample: If x = -1, then Vx? = \/ (-1 *=J/1=1#2. More generally, 
any x <0 would work as a counterexample. 


A66. False. Counterexample: If x = -1, then 


VP _V(-l)_vI_ 1 
x = 

















si 21 
-] —] 
(Again, any x <0 would serve as a counterexample.) 
The error is at =: it is not generally true that x = V7?. 
A67. False. Counterexample: If x = -1, then 
Hi —1 -1 -l 
= = 7 ae eal Ie 
Vx? yay JI 1 
(Again, any x <0 would serve as a counterexample.) 
The error is at =: it is not generally true that x = V2?. 
4x, 951-2 4z , 52(1-2) 
A63(a) 5 5 _ ae) 
52th + 3-257 417-527 — H2tL 4. 3-52 4+ 17-52% 
52+2x 
© BT 4 3 5 4 17-52 
H2t+2a Hata B2+2a 


= — = = —_ =], 
52e (51+ 3417) 52-25 520+? 


jo 7 ede ~ 8r/8 


_ 58" (8- 34) 582-34 _ 564-34 
oar a = VI VIF 


30 30 15 
= ry sau = 2) ——_ = 2— = 15. 5s 
J8 2/2 J2 


Qu+2 = OAs 23x Qxt2 ee ne 23x Qx+2 — 34.2 
(b) J .——___—— —______— 





A64(a) False. Counterexample: Let b = 2, r= 1, y = 2. Then pie”) = o(1°) = 91 = 2, but 
bY =2!2 = 9? =4. Hence, dO”) ¢ b™. 


(b) True—see Proposition 1(d). 
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x x? 
1 TOT 1 vt Vet! 
2 2 2 
Tes tt c= nae re are 
2 2 
2F,/5+1 2F,/5+1 
_ y y _Yy y 
~ 2 2° 2 
2 zo _ of 2 
=a a7 FL y (32 +1) 
y y 
x x 
aa oe 2 
y y? X 
= i ame iene 
= Y 


At the last step, the -1 in the denominator flips the # into a +. 
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A69. 


-b+ Vb? —4ac tor —b + Vb? — 4ac—-b F Vb? — 4ac 


2a 2a _b = Vb? — dac 
b? — (b? — 4ac) 
- 2a(-b+ Vi = dac) 
_ Aac 
- 2a(-b+ Vi = ac) 
2c 


1 
A70(a) log, 32 + logs a 5 -logs27=5-3=2. 
(b) First, logs 45 = logs 5 + logs 9 = log, 5 +2. 
log325 — 2logs35 _ 
log39 2 
So, logs 45 — logy 25 = 2. 





Next, logy 25 = logs 5. 


logs 3 
logy 16 





(c) First, log,,¢ 768 = log, (256 x 3) = logy, 256 + log), 3 = 2+ 


1 
Next, logy 7/3 = log, 3/4 = ri logy 3. 


So, logys 768 — logy W3 = 2. 





% 
% 
9 10823 
4 
% 


A771. If x>0, then 7z >0 and 7x <0. If x =0, then 7z = 0 and 7x =0. If x <0, then 7x < 0 


and 7x > 0. 
A73(a) (x-1)/-4>0 — 2-1<0 — exil. 


(b) -1/-4>0 is always true. (c) 1/-4>0 is always false. 


(d) The numerator and denominator equal zero at -1/2 and -2/3. Draw the sign diagram: 


+ | - | + 


0/3 “1/2 


Hence, x < -2/3 or x > -1/2. 
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(e) The numerator and denominator equal zero at —6 and 14/9. Draw the sign diagram: 


14/9 


Hence, -6 <a < 14/9. 





9x - 14 
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149. 


149.1. 


Part I Answers (Functions and Graphs) 


Ch. 7 Answers (Graphs) 


A75(a) The graph of the equation y =e” is the set {(z,y): y=e"}. 
(b) The graph of the equation y = 37 + 2 is the set {(z,y):y=3xr+2}. 


(c) The graph of the equation y = 2x? + 1 is the set {(2, y) iy = 227 + 1}. 


aN 


(c) The graph of the 
equation y = 277 + 1 is 
the set {(a,y) sy = 2x7 +1}. 


(a) The graph of the 
equation y =e” is 
the set {(z,y):y=e'}. 


i 


Y 


(b) The graph of the 
equation y = 32+ 2 is 
the set {(x,y): y= 3x +2}. 
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A76(a) The graph of y =e”, -1 <x <2 is the set {(z,y): y=e", -l<a< 2}. 
(b) The graph of y = 32 +2, -1 <a <2 is the set {(z,y): y=30+2, -l<a< 2}. 
(c) The graph of y = 227 +1, -1< x <2 is the set {(z,y):y = 207 +1, =12¢< 2}. 


IX 
Y 









(b) The graph of y = 3x + 2, 
-1<-2 <2 is the set 
{(z,y) :y =32+2,-1<2< 2}. 


(c) The graph of y = 2x7 + 1, 
-l1<2 <2 is the set 
{(x,y):y = 20? +1,-l<a<2h. 
(a) The graph of y =e’, 
-1<2x<2 is the set 
(ey) cyse".-lee< 2}. 
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A77(a) 


x+1, forx<0 


x-1, forx>0. 









(b) 





x+1, forxr<0 


x-1, forx>0. 

















A78(a) y =2 has no z-intercepts, one y-intercept (0,2), and no solutions (or roots). 


(b) y = 2” -4 has two z-intercepts (-2,0) and (2,0), one y-intercept (0,-4), and two 


solutions (or roots) —2 and 2. 


(c) y=27+2x2+1 has one x-intercept (-1,0), one y-intercept (0,1), and one solution (or 


root) —1. 


d) y=27+2x7+2 has no z-intercepts, one y-intercept (0,2), and no solutions (or roots). 
Y 
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149.2. Ch. 8 Answers (Lines) 


A79(a) (7-4) (y-5) = (9-5) (4-4) or 3y- 15 = 4a - 16 or y= ga-5. 
i). Ge 23 Gata: Dy +4= Sr +5 or y= 0-5. 
A81(a) (y-5) =3(a2-4) or y = 32-7. 

(b) (y-2) =-2(a@-1) or y=-24+4. 


ee *& F 
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149.3. 


A83. |AB| = 12, |AC|=4, and |BC| =8. 


Ch. 9 Answers (Distance) 


A84. |AB| = V12? +12 = 145, |AC| = V4? + 32 = 25 = 5, and |BC| = V82 + 22 = V68. 


A85. In Exercise 84, we already found 


|AB|= V145, —*|AC=5, 


|BC| = V68. 


Hence, |AB| = |BA| = V145 > |BC| = V68 and |AC| = 5 < |AB| = V145. Thus, 


(a) A is further from B than C. 
(b) C is closer to A than B. 
A86. (3,6). 


149.4. 


A87(a)(i) The circle of radius 4 centred on (5,1). 


(a)(ii) The circle of radius V2 centred on (-3, -2). 


(a)(iii) The circle of radius 9 centred on (-1,0). 
(b)(i) (x +1)? + (y-2)° = 25. 
(b) (ii) (x - 3)? + (y+2)° =1. 
(b) (iii) 2? + y? = 3. 

(c) Contains (8,2)? 


(a)(i) (8-5)°+(2-1)?#16,sono. — (5,1) 
(a)(ii) (8 +3)°+(2+2)°#2, sono.  (-3,-2) 
(a) (iii) (8 +1)? +2? #81, so no. (-1,0) 
(b)(i) (8+1)?+(2-2)' #25, sono. (1,2) 
(b)(ii) (8-3)? +(2+2)'#1,sono. (3, -2) 


(b) (iti) 8? +2? #3, so no. (0,0) 
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Ch. 10 Answers (Circles) 


(d) Centre Radius Diameter 


4 8 
V2 2/2 
9 18 
5 10 
1 2 
V3 2/3 
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149.5. Ch. 12 Answers (Maximum and Minimum Points) 


A90. Refer to graphs and table below. 














N 
(b) y=27 +1, y 
=o) 
B= (1,2) DAG 
A= (0,1) G=(0,1) z 
> > 
r 





For (c), for each & € Z, let Fy, = (2k7,1) and Fy, = ((2k+1)7,1)—note that there are 
infinitely many points EF; and Fy. 


(c) y =cosz. y 


B., = (-2n,1) Ey = (0,1) Ey = (27,1) 








i= (371, 1) 


Fa = (-7,1) Fy = ( 7,1) 
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A C Ex | Fr H 

GMax J J 

SGMax J 

LMax J J 

SLMax "A J 
GMin | ¥ J J 

SGMin | ¥ J 

LMin | 4 J J 
SLMin | Y A J 

Turning | Jv Viv "A 
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IN 
(d) y=cosa, |y 
-l<ax<l 


G = (-1,cos (-1)) 


—_—_- J) 
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A89(a) False. In Example 235, G = (1,3) is a global maximum but not a strict local 
maximum. 


(b) True. A point that’s at least as high as every other point must also be at least as high 
as any “nearby” point. 


(c) True. A point that’s higher than every other point must also be (i) at least as high 
as any other point; (ii) higher than any “nearby” point; and (iii) at least as high as any 
“nearby” point. Hence, a strict global maximum must also be a (i) global maximum; (ii) 
strict local maximum; and (iii) local maximum. 


(d) False. In Example 235, G = (1,3) is a global maximum and also a local minimum. 
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149.6. Ch. 16 Answers (Reflection and Symmetry) 
A92. (-12,3). 


A93(a) |PQ| =\/ (a-c)’ + (b-d)’ = \V/[e- (2c-a)]? + [d-(2d-b)]? =|QR). Vv 

(b) By Fact 26, the line PQ is (c- a) (y- 6) = (d-b) (a- a). 

To verify that R = (2c-a,2d-b) is on line PQ is to verify that it satisfies the above 
equation, as we do now: 


(c-a) (2d-b-b) =2(c-a)(d—-b) = (d—-b) (2c-a-a) =2(d-b)(c-a). J 


A94. (2,3) and (-2,-3). 


A95. By informal observation,° 


40 


y = 2° +2x+4+2 is symmetric in the line x = -1. 








Ry 





149.7. Ch. 13 Answers (Solutions and Solution Sets) 


(This chapter had no exercises.) 





©9Formal proof: Pick any point (a, a? + 2a + 2) in the graph of y = x? + 2x +2. It is possible to show that 
the closest point on the line x = -1 is (-1,a? +2a+ 2), The reflection of the point (a, a 4 2a4 2) in 
(-1,@? +2a+ 2) is the point (-2 ~a,a" + 2a + 2), which we can verify is also a point in the graph of 
y= a" + Qo + 2: 


(-2-a)? +2(-2-a)+2=44+0?+4a-4-204+2=07+2a+2. 


We’ve just shown that the reflection of any point in the graph of y = x? + 2x +2 in the line x = -1 is 
itself also in the same graph. Hence, the graph of y = x? + 2x + 2 is its own reflection in the line x = -1. 
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149.8. Ch. 14 Answers (O-Level Review: The Quadratic Equation) 


A91. In each case, the y-intercept is given by (0,c), the discriminant by b? - 4ac, the z- 
intercepts by ((-0 + Vb? - 4ac) /2a, 0), the line of symmetry by x = —b/2a, and the turning 

















point by (-b/2a,c-b?/4a). me ME 
(a) y= 22? +24+1 | (b) y=-227+2+41| (c) y=2?+4r+4 
y-intercept (0,1) (0,1) (0,4) 
Discriminant =f <0 9>0 0 
. 1 
x-intercepts None (-5.0), (1,0) (-2,0) 
: 1 i 
Line of symmetry L=-- L=- Z=-2 
4 4 
i ars is 
Turning point (-3, =) (5. =) (-2,0) 











The turning point in each of (a) and (c) is also the strict global minimum; and in (b), it 
is also the strict global maximum. 


(a) y=22* +241 


(c) y=a*?+4r+4 








(b) 
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149.9. Ch. 17 Answers (Functions) 


A96. A function consists of three objects: namely, the domain, the codomain, and the 
mapping rule. 











A97. In general, the domain can be any set; and the codomain can be any set. 








A98. A function maps every element in its domain to exactly one element in its codomain. 








A99(a) Yes, a is well-defined because it maps every element in the domain to (ex- 
actly) one element in the codomain—we have a(Cow) = Produces milk, a (Chicken) = 
Produces eggs, and a (Dog) = Guards the home. 


(b) No, 6 isn’t well-defined, because it isn’t clear what b (Dog) is. 
(c) No, c isn’t well-defined, because it isn’t clear what each state’s “most splendid” city is. 


(d) No, d isn’t well-defined. China has more than one city with over 10M people, while 
Iceland has none. So, China would be mapped to more than one element in the codomain, 
while Iceland would be mapped to none—in either case, we’d violate the requirement that 
a function map every element in the domain to (exactly) one element in the codomain. 
A100. f(1) = 1+1 = 2; g(1) = 17(1) = 17; h(1) = 3! = 3; i(1) is undefined because 
1¢Z = {-1,-2,-3,...}; 7 (1) =17. 

A101. Below, each function is written out explicitly. 


(a) Each function maps each element in its domain to exactly one element in the codomain 
and is thus well-defined. 


(b) From below, it is clear that only b =c. 























Function Domain Codomain Mapping rule 

a {1,2} ql, 2,36 bi a(1) =2,a(2)24 

b ile ney 11, 2,0) 450,0) b(1) =2,5(2) =4, 6(8) =6 

C 1 haesoh 11 2:3.4,0,0+ é(1) =2,.é(2) =4,.4(3) =6 

d 40,12; 37 4 1 28,4, 5,6 a(0)=0,¢0C1)=2,.d(Q)=4, d(3).=6 
A102(a) f (3) = 3, f(a) = 3, f (3.5) = 4, f (8.88) = 4, and f (0) is undefined (because 
O¢R*). 
(b) Yes. (c) Yes. 


A103(a) Define f : Z* > Z by f (x) = 32. 
(b) Define g: RX > R \ {0} by g(x) = 2”. 
(c) Define h: {2k:k¢Z} > {2k:ke€Z*} by h(x) =2°. 


A104(a)(i) Domain contains objects that aren’t real numbers. So, 7 is not a function of 
a real variable. 


(a) (ii) Codomain? contains only real numbers. So, 7 is a real-valued function. 


(a) (iii) Either Domaini or Codomaini contains objects that aren’t real numbers. So, 7 is 
not a nice function. 


(b)(i) Domain contains only real numbers. So, 7 is a function of a real variable. 


(b) (ii) Codomaini contains objects that aren’t real numbers. So, j is not a real-valued 
function. 
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(b) (iii) Either Domain j or Codomainj contains objects that aren’t real numbers. So, 7 is 
not a nice function. 


(c)(i) Domaink contains only real numbers. So, k is a function of a real variable. 

(c) (ii) Codomaink contains only real numbers. So, k is a real-valued function. 

(c) (iii) Both Domain? and Codomaini contain only real numbers. So, 7 is a nice function. 
(d)(i) Domain! contains only real numbers. So, / is a function of a real variable. 


(d) (ii) Codomain!/ contains objects that aren’t real numbers. So, / is not a real-valued 
function. 


(d) (iii) Either Domaini or Codomaini contains objects that aren’t real numbers. So, / is 
not a nice function. 


tj 


l 
Function of a real variable J J 


SN 


Real-valued function VY 
Nice function J 


A105. Typically, we denote a function by a single letter, such as f or g. 


And so, typically, “f (~)” denotes not a function, but instead the value of f at x. If f isa 
real-valued function, then f (2) is simply a real number. 


A106. This is a trick question. The answer is that yes, of course it is possible—as we 
stressed earlier, the mapping rule need not “make any sense”. 


Indeed, there exist exactly 2 x 2 = 4 possible functions with domain A = {Lion, Eagle} and 
codomain B = {Fat, Tall}: 


f (Lion) = Fat, f (Eagle) = Fat g (Lion) = Fat, g (Eagle) = Tall. 
h (Lion) = Tall, h (Eagle) = Fat. i (Lion) = Tall, 7 (Eagle) = Tall. 


In general, given any two finite sets S and T, we can construct n(T’) "“°) possible functions 
using S as the domain and T as the codomain. (Can you see why?)**! 


A107(a) A function of a real variable. (b) A real-valued function. (c) A nice function. 


A108(a)(i) Yes, a is well-defined. Every element in the domain is mapped to (exactly) 
one element in the codomain—5 to 10 € Z, 6 to 12 € Z, and 7 to 14 € Z. (ii) Define 
a:{5,6,7} > Z by a(a) = 22. 

(b)(i) Yes, 6 is well-defined. Every element in the domain is mapped to (exactly) one 
element in the codomain—5 to 10 € Z*, 6 to 12 € Z*, and 7 to 14 € Z*. (ii) Define 
b: {5,6,7} + Z by b(2) = 2z. 

(c)(i) No, cis not well-defined, because for example, the element 5 in the domain is allegedly 
mapped to 10, but 10 isn’t an element in the codomain Z. 





®41For each element x in the domain S$, we have n (TJ) possible elements in the codomain to which x can be 
mapped. And so, with n(S) elements, the total number of possible functions is n (7) x n(T) x---xn(T) 
eS 


n(S) times 


or n (7) ™*), 
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(d)(i) No, d is not well-defined, because the element 5.4 in the domain is allegedly mapped 
to 10.8, but 10.8 isn’t an element in the codomain Z. 


(e)(i) Yes, e is well-defined. Every clement in the domain is mapped to (exactly) one 
element in the codomain—5.5 to 11 € Z, 6 to 12 € Z, and 7 to 14 € Z. (ii) Define e: 
{5.5,6,7} + Z by e(a) = 2a. 

(f)(i) Yes, f is well-defined. From the mapping rule and the codomain, it is unambiguous 
that 3 is to be mapped to 4€ {3,4}. (ii) Define f : {3} > {3,4} by f (3) =4. 


(g)(i) Yes, g is well-defined. From the mapping rule and the codomain, it is unambiguous 
that 3 and 3.1 are to be mapped to 4 € {3,4}. (ii) Define g: {3,3.1} > {3,4} by g (8) =4. 
and g (3.1) =4. 


(h)(i) No, A is not well-defined. It is unclear whether 0 should be mapped to 3 or 4. 


(i) (i) No, 7 is not well-defined. It is unclear what we should map 4 to, since there is no 
number larger than 4 in the codomain. 

(j)(i) No, 7 is not well-defined. It is unclear what we should map 2 to, since there is no 
number smaller than 2 in the codomain. 

(k)(i) Yes, & is well-defined. From the mapping rule and the codomain, it is unambiguous 
that 1 is to be mapped to 1 {1}. (ii) Define k: {1} > {1} by k(2) =<. 

(1) (i) Yes, J is well-defined. From the mapping rule and the codomain, it is unambiguous 
that 1 is to be mapped to 1¢ {1,2}. (ii) Define /: {1} > {1,2} by I (a) =<. 

(m)(i) No, m is not well-defined. It is unclear what 2 should be mapped to, since there is 
no number that is “the same” as 2 in the codomain. 

(n)(i) No, n is not well-defined, because for example, the element —1 in the domain is 
allegedly mapped to V-1, but V—1 isn’t an element in the codomain R.®” 

(0)(i) No, o is not well-defined, because the element 0 in the domain is allegedly mapped 
to 1+0, but 1+0 isn’t an element in the codomain R.%? 

(p)(i) No, p is not well-defined, because for example, the element 3 in the domain is 
allegedly mapped to 4, but 4 isn’t an element of the codomain [0, 1]. 

(q)(i) Yes, q is well-defined. Every element x in the domain [0,1] is mapped to (exactly) 
one element in the codomain R, namely x+1¢€R. (ii) Define q: [0,1] > R by q(x) =a+1. 
A109. Change the domain of n to Rj, the set of non-negative reals. We then have the 
function n: Rj > R that is (well-)defined by n(x) = /z.°“4 


Change the domain of o to R \ {0}, the set of all reals except zero. We then have the 
function o: R \ {0} + R that: is (well-)defined by o(x) = 1/2." 


A110(a) Rangea = R}. (b) Ranged = {0,1,4,9, 16, 25,49,...}. 
(c) Rangec = {0,1,4,9, 16, 25,49,...}. (d) Ranged = Z. 
(e) Rangee = Z. (f) Range f = {100, 200}. (g) Rangeg = {100}. 





642 As noted earlier and as we'll learn later, V—1 is not a real but an imaginary number. 

643 As discussed in Ch. 8, 1 +0 is not a real number. Indeed, it is not even a number; it is undefined. 

®4Note that the new domain R¢ is indeed the “largest” subset of R such that the function n is well-defined. 
The addition of any negative number to this new domain would render the function ill-defined. 

®Note that the new domain R \ {0} is indeed the “largest” subset of R such that the function o is 
well-defined. The addition of zero to this new domain would render the function ill-defined. 
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Only d and f are onto. 
A111. Only (b) “Range f ¢ Codomain f” must be true. 
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149.10. Ch. 18 Answers (An Introduction to Continuity) 


(This chapter had no exercises.) 


149.11. Ch. 19Answers (When A Function Is Increasing or 
Decreasing) 


A112(a) Pick any a,b € (-00,0] with a <b. Observe that 
ash or j(a)> jb). 


And so, by Definition 73, 7 is strictly decreasing on (-0o, 0]. 
(b) Pick any a,b €[0,00) with a <b. Observe that 


meas or j(a) <j (b). 


And so, by Definition 73, 7 is strictly increasing on [0, co). 


(c) By Fact 46, 7 is also decreasing on (-00,0] and increasing on [0, oo). 
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149.12. 
lea eae ana (g) 
(g- f) (1) = 1° -(7-1+5) =-11 (hh) 


(c) (g9-f)(2) = (2?) (7-245) = 152 (i) 

(d) = (kg) (1) = 2 (j) 
1 

(e) & )) = 1+5 12 » 

(f) (f+k)() =7-14+54+2=14 (1) 


(k) For each of f+h, f-h, and f-h, the domain is Domain fn Domainh = Rn [-1, c 
[-1,00). For f/h, the domain is Domain f mn Domainh \ {x: h(x) =0} = 


(-1, 00). So, define 


Ch. 20 Answers (Arithmetic Combinations of Functions) 


(h+i) (2) =24+14+V24+1=3+V3 
(i-h)(1) =V14+1-(14+1)=V2-2 
(i-h) (2) = V2+1(2+1) =3V3 
(li) (1) =5V1+1=5V2 
i J1+1 V2 
Oa 





Led 2 
(h+1)(1) =14+1+5=7 


[-1,00) \ {-1} 


(f+h):[-1,0) +R by (f +h) (2) =f (x) +h(a) = 82 +6; 


(f —h):[-1,00) > R by (f-h) (x) = f(x) - h(a) = 6 + 4; 


(fh): [-1,00) +R by (f-h) (2) = f(@)-h(e) = 
f(@) _ 





fcr (eos 
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(7x + 5) (x +1); 


7x+5 
c+10 
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149.13. Ch. 22 Answers (Composite Functions) 


A115(a)(i) The composite function g* = gg? exists because Range g? = R ¢ Domaing = R. 
(ii) Define g: R > R by 


5-2) -1- ote 
2 8 16 


g! (0) = 99° (a) = 9 (9° (2) =9 (5-2 


(iii) g’ (1) = 11/16 and g* (3) = 13/16. 


(b)(i) The composite function g’ = gg* exists because Range g* = R ¢ Domaing = R. 
(ii) Define g? : R > R by 





He) = a6 0) = (Gt (@)) x 0(f + jp) -1- ES a Te 


(iii) g? (1) = 21/32 and g? (3) = 19/32. 


(c)(i) The composite function g° = gg° exists because Range g’ = R ¢ Domaing = R. 
(ii) Define g°: R > R by 
ll «¢ u-m 21 ¢ 
= 7 = 5 = —_ - — = 1 = 16 32 
9 (x) = 99° (x) = 9 (9? (2)) (= =| 
(iii) g° (1) = 43/64 and g® (3) = 45/64. 
2”-(-1)” 


1 i 2 ip ae 1 
(d) bn = 2°", on =(-5)", and hence g” (x) = = +(-5)"2=5+5(-5) +(-5)". 








i? 1 
(e) As n approaches oo, each of (-5) and (-5)" approaches zero. Hence, g” (x) 


approaches 3° 
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A116(a) Rangeg = [1,o0) ¢ Domain f = R. Hence, the composite function fg: R—- R 
exists and is defined by 


(f9) (t) = f(g (a)) = f (2? +1) =e" 7. 


We have fg (1) = eb 1 ¢ and fg (2) = eF 1 = 68, 


Also, Range f = R* ¢ Domaing = R. Hence, the composite function gf : R > R exists and 
is defined by 


(of) (x) = 9 (f (z)) = 9 (e”) = (e?) +1 = 087 +1. 


We have gf (1) =e? +1=e?+1 and gf (2) =e??+1=e44+1. 
(b) Range g = Rx {0} ¢ Domain f = R\ {0}. Hence, the composite function fg: R\ {0} +R 
exists and is defined by 


i 


2 ee 
lx 


1 
(Fa) (2) = f (g(@) =F (5-] 
i 
We have fg (1) =2-1=2 and fg (2) =2-2=4. 
Also, Range f = Rv {0} ¢ Domain g = R\ {0}. Hence, the composite function gf : R\{0} + R 
exists and is defined by 


(af) (@)=9(F @)=9(>)= say =F 
We have gf (1) = 1/2 and gf (2) = 2/2 =1. 


(c) Rangeg = [1,00) ¢ Domain f = R\ {0}. Hence, the composite function fg: R +R 
exists and is defined by 


(f9) (2) = f (g(a) = f (22 +1) = =—. 


v2 4+1 


We have fg (1) = 1/(1? +1) =1/2 and fg (2) =1/(2?+1) =1/5. 
Also, Range f = R \ {0} ¢ Domaing = R. Hence, the composite function gf : R\ {0} +R 
exists and is defined by 
(of) () = 9(f (@)=9(=)= S41 
Lye r))=g(—)= 41. 
g g Oe Nees 
We have gf (1) = 1/17+1=2 and gf (2) = 1/2? +1=5/4. 


(d) Rangeg = [-1,oo) ¢ Domain f = R\ {0}. Hence, the composite function fg: R->R 
does not exist. 


Range f = R \ {0} © Domaing = R. Hence, the composite function gf : R \ {0} > R exists 
and is defined by 


(of) () = 9(f (@) = 9(=) = 5-1. 


rt) x 
We have gf (1) = 1/1?-1=0 and gf (2) = 1/2? -1=-3/4. 
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A117(a) Range f = R* € Domain f = R. Hence, the composite function f? : R > R exists 
and is defined by 


f(x) = ff (2) = f(e") =e. 
We have f2(1) =e° =e° and f?(2)=e°. 


(b) Range f = R € Domain f = R. Hence, the composite function f? : R > R exists and is 
defined by 


f? (x) = f (f (2) = f (82 + 2) = 3 (32 +2)4+2=92r+8. 
We have f? (1) =9-1+8=17 and f?7(2)=9-24+8=26. 


(c) Range f =[1,00) €¢ Domain f = R. Hence, the composite function f? :R > R exists and 
is defined by 


f? (x) =F (f (a) = f (20? + 1) = 2 (20? +1)" +1 = 2 (404 + do? +1) +1 = 804 + 8a? +3. 
We have f? (1) = 8-14+8-17+3=19 and f? (2) =8-24+8-27+3 = 128+ 32+3 = 163. 


(d) Range f = R ¢ Domain f = R*. Hence, the composite function f? : R > R does not 
exist. 


A??. No: 4+ (342) =8/3 but (4+3)+2=4/6 = 2/3. 


1780, Contents www.EconsPhDTutor.com 


149.14. Ch. ?? Answers (one-to-one Functions) 
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149.15. Ch. 24 Answers (Inverse Functions) 


A123(a) Write y =a(z) =5z. Do the algebra: x = y/5. 

So, a is one-to-one and its inverse is the function a“! : R > R defined by a“! (y) = y/5. 

(b) Write y = b(x) = 2°. Do the algebra: x = </y. 

So, b is one-to-one and its inverse is the function 0"! : R > R defined by b7' (y) = 8/y. 

(c) Write y = c(x) = 2? +1. Do the algebra: x = ty/y-1. 

Since Domainc = Rj, we have x > 0. So, we discard the negative value and are left with 

easly ai. 

Hence, c is one-to-one and its inverse is the function c™': [1, 00) > Rj defined by c (y) = 
y—-1. 

(d) Write y = d(x) =2?+1. Do the algebra: x = ty/y—1. 

Since Domaind = Rp, we have x < 0. So, we discard the positive value and are left with 

fS—4/4=1, 

Hence, d is one-to-one and its inverse is the function d~!:[1, 00) > Rj defined by d™! (y) = 

ay A, 

(e) Write y = e (x) =1/(x?+1). Do the algebra: x = +1/\/y-1. 

Since Domaine = Rj, we have x > 0. So, we discard the negative value and are left with 

£=1/ Jy -1. 

Hence, e is one-to-one and its inverse is the function e~! : (0,1] > Rj defined by e7' (y) = 

Ir /y-1. 

(f) Write y = f (x) =1/(x?+1). Do the algebra: x = +1/\/y-1. 


Since Domain f = Rj, we have x < 0. So, we discard the positive value and are left with 


t= -1//y-1. 

Hence, f is one-to-one and its inverse is the function f~!: (0,1] + RO defined by f7! (y) = 
-1//y-1. 

A124(a) No, a is not one-to-one because 0 = a(-1) =a(1). 

(b) No, 6 is not one-to-one because 0 = b(-1) = b(1). 

(c) Write y= c(a) = 27-1. Do the algebra: a = +\/y +1. 


Since Domainc = Rj, we have x > 0. So, we discard the negative value and are left with 


Hence, c is one-to-one and its inverse is the function c! : [-1, 0) > Rj defined by c™! (y) = 
Jy +l. 

(d) Write y = d(x) =a2?-1. Do the algebra: x = +\/y +1. 

Since Domaind = Rp, we have x < 0. So, we discard the positive value and are left with 
e=-/y+1. 


Hence, d is one-to-one and its inverse is the function d7' : [-1, 00) > Rp defined by d7! (y) = 
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A126(a) Write y = f(z) =x+1. Do the algebra: x = y-1. 
So, f is one-to-one and its inverse is f~! : (1,2] > (0,1] defined by f(y) =y—-1. 


NN 2 
y a 


x 
(1, 2) o 











(b) Write y = g(x) = 2x. Do the algebra: x = y/2. 
So, g is one-to-one and its inverse is g™! : (0,2] > (0,1] defined by g7! (y) = y/2. 


IN 
Y 


(1, 2) 











(c) Write y = h(x) =1/zx. Do the algebra: x = 1/y. 
So, h is one-to-one and its inverse is h~':[1,00) — (0,1] defined by h7' (y) = 1/y. 
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(d) Write y =i (x) =a”. Do the algebra: x = +\/y. 
Since x € Domain = (0, 1], we may discard the negative value and be left with x = \/y. 


So, i is one-to-one and its inverse is i' : (0,1] > (0,1] defined by 7! (x) = Vz. 


IN 
Y 








A129(a)(i) Write y = h(x) =1/x. Do the algebra: x = 1/y. 

So, h is one-to-one and its inverse is h~':[1,00) - (0,1] defined by h7! (y) = 1/y. 

(a) (ii) Write x = h~'(y) =1/y. Do the algebra: y = 1/z. 

So, h7! is one-to-one and its inverse is (hn!) : (0, 1] > [1, co) defined by GA) (eye iia: 
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(a)(iii) Yes, Range h = [1, 00) = Codomain h. 

(a)(iv) Yes, h = (ht). 

(b)(i) Write y =i (x) = 2. Do the algebra: x = +,\/¥. 

Since x € Domain = (0, 1], we may discard the negative value and be left with x = \/y. 
So, i is one-to-one and its inverse is i! : (0,1] > (0,1] defined by i! (y) = V/y. 

(b) (ii) Write x = 77! (y) = \/y. Do the algebra: y = 2. 

So, i! is one-to-one and its inverse is (i) : (0, 1] > (0,1] defined by Ca (x) = 27. 
(b) (iii) Yes, Rangei = (0, 1] = Codomain i. 

(b) (iv) Yes, i= (i!) 

A130(a) The restriction of the function f to the set of non-positive reals. 

(b) Pick any distinct 21,22 € Domain Fis = Rp. Suppose x) < #2. Then flee (21) = xf} >22 = 
is les (x2). 

We’ve just shown that f he is strictly decreasing. Hence, f te is also one-to-one. 

(c) Write y = she (x) =x”. Do the algebra: x = +\/y. 


Since x € Domain f|,,, = Rj), we may discard the positive value and be left with x = —,/y. 
Rj 0 


-| -1 
So, f les is one-to-one and its inverse is the function f ps : Rj > Rp defined by f les (uy) = 


fy. 

(d) Yes. In fact, there are infinitely many such sets S. For example, S could be any subset 
of Rj or any subset of Rp. It could also be [-2,-1]u [3,4] (why?).°® And as mentioned 
in n. 194, S could also be @. 


A131(a) The restriction of the function g to the set of non-positive reals. 
(b) Pick any distinct 21,22 € Domain g|._ = Rj. Suppose x; < x9. Then Ip (ny) = [oi|= 
0 0 

Lied = |15|= Dp (22). 

0 
We’ve just shown that Dp is strictly decreasing. Hence, | p- is also one-to-one. 

0 0 

(c) Write y = Ihe (x) =|z|. Do the algebra: x = +y. 

0 


Since x € Domain g|,, = Rj), we may discard the positive value and be left with x = -y. 
Rj 0 


=| -1 
So, Op is one-to-one and its inverse is the function Ile Rj > Ro defined by p- (y) =-y. 


(d) Yes. In fact, there are infinitely many such sets S. For example, S could be any subset 
of Rj or any subset of Rg. It could also be [-2,-1]u [3,4] (why?).°” And as mentioned 
in n. 194, S could also be @. 


A132(a) No, h is not one-to-one because for example, the element 1 in the codomain is 
“hit” twice (by 0 and 2): 





©The range of Tes y 
“7The range of dl,» 


U[3.4] is [1,4] U[9, 16] and each element in this range is “hit” exactly once. 
-1u[3,4] is [1,2] U[3,4] and each element in this range is “hit” exactly once. 
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— and iO 
(O21) (2-1) 


1=h(0) = 
(b) The restriction of the function g to the set of real numbers greater than one. 
(c) Pick any distinct 2,22 € Domain Pl 00) = (1,00). Suppose x1 < 2. Then Fl 00) (tee 


1 1 
—5 > -——__ =A ©). 
(a,-1)° (@.-1)° a (™) 


We’ve just shown that h| aS is strictly decreasing. Hence, h| (1,00) is also one-to-one. 
d) Write y =h t)=1/(1-2)’. Do the algebra: 7 = 1+1/,/y. 
(1,00) 


Since x ¢€ Domain hj , ot i (1,00), we may discard x = 1-1/\/y(< 1) and be left with 
g=1+1/,/y. 


So, Peo) is one-to-one and its inverse is the function PMs) : Rj > (1,00) defined by 


-1 

Peo) (y) =e 1/vV/y. 

(e) The restriction of the function g to the set of real numbers less than one. 

(f) Pick any distinct 71,22 € Domain ce i (—oo,1). Suppose x] < x. Then Pee 1) (2) = 
1 1 . | 

———, < — =A x2). 

(x1 = iD (x9 - 1)” Visca ( ) 


We’ve just shown that Nee 1) is strictly increasing. Hence, h| (a0) is also one-to-one. 


(g) Write y = Nhe 1) (x) =1/(1-2)°. Do the algebra: « = 141//¥. 
Since x € Domain Ile o.1) = (-o0,1), we may discard x = 1+1/,/y(> 1) and be left with 
x=1-1/,/y. 


So, hho 1) is one-to-one and its inverse is the function As 1! Rj > (-c0, 1) defined by 


Pl oot) W) = 1 =U V9. 


(i) Yes. In fact, there are infinitely many such sets S. For example, S could be any subset of 
(1, 00) or any subset of (—00, 1). It could also be (0, 1)U(2,3) (why?).8 And as mentioned 
in n. 194, S could also be @. 





°8The range of h is (1,00) U(1/4,1) and each element in this range is “hit” exactly once. 


(0,1)U(2,3) 
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149.16. Ch. 25 Answers (Asymptotes and Limit Notation) 


XXX 
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149.17. Ch. 26 Answers (Transformations) 


A133(a) We already graphed y = 2f (x) +1 in the above example. Simply reflect that in 
the x-axis to get the graph of y = -2f (#-1). 


‘ y=27 (£)+1 


IX 
Y 





ee 2 
~—_$._-- 


y =-2f (2) =1 








(b) We already graphed y = 2f (x) +1 in the above example. Simply reflect that in the 


y-axis to get the graph of y = 2f (-x) +1. 


\ wal) (a) +1 
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A133(c) We already graphed y = 2f (x+1) in the above example. Simply reflect that in 
the x-axis to get the graph of y = -2f (x+1). 


y = 2f (x +1) 











(d) We already graphed y = 2f (x +1) in the above example. Simply reflect that in the 
x-axis to get the graph of y = 2f (-7 +1). 


y=2f (-a+1)' 
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A133(e) We already graphed y = f (2x) +1 in the above example. Reflect that in the 


x-axis to get y=—-f (2x) -1. 


Then translate upwards by 2 units to get the graph of y = —f (2x7) + 1. 











(f) We already graphed y = f (2x) +1 in the above example. Reflect that in the y-axis to 


get y = f (-2x) +1. 


y = f (-2x) +1 \ 


y=f (27)+1 
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A133(g) We already graphed y = f (2x+1) in the above example. Reflect that in the 
z-axis to get y=-f (2x +1). 






y=-f (27+1) 











(h) We already graphed y = f (2x +1) in the above example. Reflect that in the y-axis to 
get y = f (-2a +1). 


Y 
y = f (-24+1) ' 
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A134(a) First compress horizontally by a factor of 2 to get y = f (22). 
Then stretch vertically by a factor of 2 to get y =2f (22). 


Now reflect the portion below the z-axis in the z-axis to get y = |2f (27)]. 


y = |2f (22) 





aN 








(b) First translate 1 unit rightwards to get y = f(#-1). 


Next reflect the right portion in the y-axis to get the left portion of y = f (|x - 1)) 
Now translate 2 units upwards to get y = f (| —1]) +2. 


aN 


pe Fte<apae |* 
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A135. 











. Start with the graph of y = 1/z. 

. Translate rightwards by 2 units to get y = 1/(a#- 2). 

. Compress horizontally by a factor of 5 to get y = 1/ (5a - 2). 
. Reflect in the x-axis to get y = —-1/ (5x - 2). 

. Translate upwards by 3 units to get y=3-1/ (5x -2). 


oo FW Ww FR 
ee ee 


149.18. Ch. 28 Answers (ln, exp, and e) 


(This chapter had no exercises.) 
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149.19. Ch. 43 Answers (O-Level Review: The Derivative) 


A184. f is both continuous everywhere and differentiable everywhere. m g is continuous 
everywhere but not differentiable everywhere. But it is differentiable everywhere except 
at « = 0. m@ A is neither continuous everywhere nor differentiable everywhere. But it is 
continuous and differentiable everywhere except at x = 0. 





d d 
A185(a) se = 2a, 50 *. =2(0) =0. 
x=0 
(b) oF = 152-82 +7, so Y =3-0-8-04+7=7. ® 
L Olean 





d d 
(c) qa be +3244) = 2x +3 and — (32° - 4a? + 7x - 2) = 1504 - 80 +7. Thus, 
XL 


SHS (20 +3) (32! — 4x? + 7x -2) + (x? + 3x +4) (1524 - 8x +7) 
= 217° + 54x° + 6004 - 16x? - 15x? + 6x + 22. ® 
And so, dy = 22. 
az 5 





A186. Use the product rule for (a) and (b); and the quotient rule for (c)—(e). 
dy x ,1 


d d 1 il 
(a) ao and q, meat Thus, rea oe Ing =e” (= +In2). ® 


i! 
(b) <.? = 27 and <e' Inz =e” (= + Inst. Thus, 





























x 1 
cb S get (= +Inz) +2xe"Ing = ae" (1 +2Ing +2Inz). ® 
a 
(c) dy = o£ — sine _xcosx-sing cosx_ sing © 
dx x 7 x £ ge 
(a) dy . cos#42# ~ sin re 2 cos*xt+sin2x 1 © 
dx cos? x cos? x ~ Cos? a 
d ot _d d 
(e) = —de___de = _ = oe = . By the way, this is called the Reciprocal Rule. 
mn z 
(f) dy (e) dsina/dx cosa © 
dx sin?x Ss sin’? 
(g) dy (e) dcosa/dx  -sing — sing © 
: dx cos?a ss cos?x~— cos? 
(h): dy (e) dtanz/dr_——I/cos*e@ I © 
dx tan?x sin? x/cos?a sin?” 
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A187(a) wy 7 4y . de-N +1)) 
d{x -In(z+1)} d[x-In(r+1)] 
d|x -In(x+1)] dx 


=2[r-nw+1)] [2 7 antes 3] 


dln(x+1)d(x+1) 
~ d(x+1) i | 





P2fe-mn(e+1)]/1 





=2[2-In(e+1)](1- <1] 








He 22 
=2[z-1 1)] —~ =[z-1 1 
[x-In(x+ ree: [x-In(@+ ere © 
dy 0 
—|} =2[0-1 1) = 2 (0 = = 0. 
So a. , [0 -In(0+ are [0-0] x0=0 
(b) Let 2 =14+[x-In(x+1)]. Then, 
dy dsin?q dsin?d> «ade. av 2G - a 
dr dx d& de zdev 2 2 
@ v2-a{[e-In(r+1)]24 
= cos — — 
a 
( 2 ) pene yT eee Oh) 
= { cos ——__,, | —— 
1+[x-In(«+1)]° {1+ [x -In(x+1)P} ° 


where in the last step we simply plugged in z = 1+[z-In(x+1)]’. (We could do a little 
more algebra to simplify a little further, but we wouldn’t get far.) 








Observe that Z| om 1+[0-In(0+1)]?=1. 
Thus, dy = eo ed 
dx I 
x=0 
d 
A188(a) F = — (mv). 
dt 
(b) By definition, a= a4 
dt 
; dm 
If mass is constant (i.e. mass is not changing over time), then ae = 0. 
d 3d d 
Altogether, F = a (mv) = v— + m— =0+ma=ma. 
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d i d 
A189(a) qt (exp x) + or (— expe). 
d 
(b) Since exp is defined to be the inverse of In, we have In (exp x) = x. Thus, re In (exp x) = 
d 


a= = 1. 
dx 


(c) Putting our answers in (a) and (b) together, we have 


1 ( d ) d 

—-{|—expz]=1 or — exp ® = exp Zz. 
expx \dzx da 

A190(a) The only turning point is (0,1). 

(b) The only turning point is (0, 1). 

(c) For every k € Z, the points (2k7,1) and ((2k+1)7,-1) are turning points. (Hence, 

there are infinitely many turning points.) 

(d) The only turning point is (0,1). 

A191. At A, F’, and H, the graph is decreasing. So A, F’', and HA are not stationary points. 

Thus, they cannot be turning points either. 


B and D are “kinks” at which the derivative doesn’t exist. So B and D cannot be stationary 
points. They are thus not turning points either. 


At C and G, the derivative is zero. Thus, C’ and G are stationary points. However, they 
are not turning points (because the derivative isn’t changing from negative to positive or 
positive to negative). 


At E, the derivative is zero and changing from positive to negative. Thus, F is both a 
stationary and a turning point. 
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Stationary 





























Turning 





A192. For convenience, we reproduce the graph of f from Example 659: 


Figure to be 


inserted here. 





(a) False. Points A and F (in the above graph) are a maximum and a minimum point, but 
not stationary points. 


(b) False. Again, points A and EF are a maximum and a minimum point, but not turning 
points. 


(c) False. Point D is a stationary point, but is neither a maximum point nor minimum 
point. 


(d) By Definition 58, it is true that every turning point is an extremum and therefore either 
a maximum or a minimum point. 


(e) By Definition 58, it is true that every turning point is a stationary point. 


(f) False. Point D is a stationary point, but is not a turning point. 
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149.20. Ch. ?? Answers (O-Level Review: Trigonometry) 


A137(a)(i) 7. (a) (ii) (n - 2) 7. (b)(i) (5 - 2) 7 = 37. 

(b) (ii) By symmetry, the pentagon’s five interior angles are equal. Hence, 2 BAF = 37/5. 
(b) (iii) Since AABE is isosceles, 2 ABE = (n- 2 BAE)/2=n/5. 

cA ec ec 


(b) (iv) cos = =cos 2ABE = BI = ; ; 


A 





(c)(i) 4PAQ = 4 BAE-(2BAP + 2QAE) = * (2.2) a: 


(c)(ii) Both triangles already share the angle 2 AQP. In addition, 2 ABQ = : =2 PAQ, 
Since the two triangles share two equal angles, they are similar. 

(c) (iii) Since AAPQ is isosceles, so too is AABQ. Hence, |BQ| = |AB| = 1. 

|AB| * |AQ| 

AQ] [PQ] 

PlGe ABI St, 1AGl =a. and (Pole Pola 1BPlS iat wet - é 


(c)(iv) Since AABQ and AAP@ are similar, 








La 


(c)(v) Rearrange = to get 1-2 =x? or 2?7+2-1=0. Solving, 


=[4s/P=4(1) (<1): =1av5 


nr () ne 


Since x > 0, we can discard the negative value and conclude: x = (-1 + v5) es 


(c)(vi) |BE| = |BQ|+|QE|=1+2=- = 
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IBE|_ 14+V5 
y- - 4° 





(c)(vii) From (b) (iv), cos = : 


m 1+V5_ |BR| 
d 
eee a. = AB 


_ mt |AR|_ \/ |AB/ ale (4) — oe _. 
Hence, sin 5 are 


10-275 
4 


= |BRI. 








a? +b? -¢? 


A138(a) By the Law of Cosines, c? = a? +b? - 2abcosC. Rearranging, cos C = Sar 
a 


(b) Recall that (a + b)” =a? +b? +2ab and (a-b)” 2 a? +b? — 2ab. 


Cara Meera 1 (a+b) -¢ 





Now, 1+cosC'=1+ 





2ab 7 2ab 2ab 
a+P-2 ab-a-B+e 2 e-(a-b) 
A d 1 = ( = 1 = = = 
ae il 2ab 2ab 2ab 


(c) Recall that 2? - y* = (x+y) (x-y). 

So, (a+b) -2 =(at+b+c)(atb-c) = 2s (2s + 2c) =45(s +0). 

And, c = (a-b) = (c+a-—6)(c-a+b) = (2s — 2b) (28 — 2a) = 4(s —b) (s—a). 

(d) By the First Pythagorean Identity, sin? C + cos? C = 1 or sin? C' = 1-cos*C or sinC = 
+V1 - cos? C. 


Since C’ € [0,71], we have sinC > 0. So, we can discard the negative value and conclude 


sinC = V1 -cos?C. 
(e) Area = sabsinC o sav — cos? C' = a (1 + cosC) (1 -cosC) 


2 2 
2 0 b (av) ee ey 5! ab [(a+b)’-][e2 - (a-b)’] 
4s(s+c)4(s—b)(s-—c) =Vs(s—a)(s-—b)(s-c). 
A139. Since 2QPR=A-B and |PR| =1, we have 
|QR| = sin (A - B) and |PQ| = cos(A- B). 
1. 4PTU and 2QPT are alternate; hence, 2 PTU = 2QPT = A. Moreover, |PT| = cos B. 


Thus, |PU| = sin Acos B and |TU| = cos Acos B. 


2. 2 SRT is complementary to 2 S7'R, which is in turn complementary to 2 PTU. Hence, 
ZSRT = 4 PTU =A. Moreover, |RT| = sin B. 


Thus, |ST| = sin Asin B and |RS| = cos Asin B. 
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Q sin(A - B) cos Asin B 
a 





sin Asin B 
cos(A - B) 
rT 
cos Acos B 
A-B 
sin Acos B U 


We now have PQ=UT+TS or cos(A- 8B) =sin Asin B + cos Acos B. 


And, QR = PU - RS or sin(A- B) =sin Acos B - cos Asin B. 


1800, Contents www.EconsPhDTutor.com 


A140(a) sin2A = sin Acos A +cos Asin A = 2sin Acos A. 
(b) cos 2A = cos Acos A - sin Asin A = cos” A - sin? A = cos” A - (1 - cos” A) = 2cos” A - 1. 
Also, cos 2A = cos? A - sin? A = (1 — sin? A) ~sin? A=1-2sin? A. 


Al4l(a) sin3A =sin(A+2A) 
= sin Acos2A + cos Asin2A (Addition Formula) 
= sin A (1 — 2sin? A) +cosA(2sin Acos A) (Double-Angle Formulae) 
= sin A-2sin®? A +2sin Acos”? A 
= sin A-2sin® A+2sin A(1-sin? A) (sin? A + cos? A = 1) 
= 3sin A -4sin® A. 


(b) cos3A =cos(A+2A) 
= cos Acos2A -sin Asin2A (Addition Formula) 
= cos A(2cos? A-1)+sin A(2sin AcosA) (Double-Angle Formulae) 
= 2cos? A-cos A+2sin? Acos A 
= 2cos® A - cos A + 2(1- cos” A) cos A (sin? A + cos? A = 1) 
= 4cos® A -3cos A. 


A142. Take the hint and apply the Addition and Subtraction Formulae: 


P+Q P-Q P+Q P-Q P+Q 
5 _— cos ——— +c 








sin 


= sin 5 5 Os 5 a. 








sin P = sin( 


ie) 

















sin@ = sin( 


P+Q P- 

















P+Q —*) _P+Q P-Q P+Q .P-Q 
2 


cos P = cos ( 

















5 5 + sin 


cosQ = cos(—S* - =") Ped) P-Q P+Q. P-Q 
2 2, 9 an 


You can easily verify that the four S2P or P2S Formulae now follow. 


Pp P- 
A143. Let 2x7 = ae And bes a 











2 
Then [ein Sa ee 
2 2 
P P- 
And, = 1G 4 = 2-80 = ~Ba, 


And so, by the P2S Formulae, we have 
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sin7z+sin(-37) _ sin7x-sin3x 








in 2a cos 5x = _ ‘ 

(a) sin 2x7 cos 5x ; ; 

(b) cos 27 sin bz = See Zt (=3:) ee ie tg + sin 3a ae 
2 2 

(c) cos 2x cos 5x = cos 7x + cos (~32) = cos 7x + cos 3a % 
2 D 

(d) sin 2rsin 5s = Sos T= cos (30) = Sos — cos Te - 


A144. Domain tan = R\ {x:cosxz =0}=R\ {k7/2:k is an odd integer}. 


Codomain tan = R. 


in(A+B 
A145. tan(A+ B) = i (By definition of tangent) 
_ sin Acos B + cos Asin B By Add. and Sub. 
~ cos Acos B ¥ sin Asin B Form for sin and cos 
sin A/cos A + sin B/cos B 8 
Ey NES GEE <a a eo Divide b Acos B #0 
1 + (sin Asin B) / (cos Acos B) Ace ean 
tan A+tan B 





er See rete (By definition of tangent). 


tanA+tanA 2tanA 


A146. tan2A = —_—_ = ——___. 
= l-tanAtanA 1-tan?A 


A147. 


tan3A =tan(A+2A) 


tan A+tan2A ae 
Gar (Addition Formula) 


tan A + 2tan A/(1- tan” A) 


= —______—__~__~—__ (Double-Angle Formula 
1-tan A[2tan A/(1- tan” A)] ( ) 


tan A (1 — tan” A) +2tanA 
1(1- tan? A) - tan A (2tan A) 


A-tan* A 
Pelee (Multiply by 1 - tan? A) 














1 =3tan? A 

A148(a pone a+b. snAsinB asinAsinB+bsinAsinB ; asinAsinB+asinBsinB _ 
“a-b a-b “sin Asin B ~asinAsinB-bsinAsinB asin AsinB-asinBsinB ~ 
sin A +sin B 
sin A-sin B’ 
(b) By Fact 75 (S2P or P2S Formulae), 

a+b sinA+sinB 2sin4¢2cos452 sin $2/cos4$2 tan 432 

a-b sinA-sinB  2cos4¢ sin as ~ sin 453/ cos 452 ~ tan 458 
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(As stated on p. 344, “Whenever you see a question with trigonometric functions, put 
MF 26 (p. 3) next to you!”) 








: is x x is x x s x 
r . sing | 2 sin 5 COs 5 _singzcosy sing _ v 
149(c)(i) = 8 = 2 tan. 
l+cosx 1+2cos eo COs" 5 COS 5 2 
l-cose 1-(1-2sin’#) sin’ 4 sin? £ a 
(c) (ii) ———_ = ——.—- = < = ae SRE 3 == — = tan —. 
sin x 2sin = cos = sin=cos=  cos% 2 
2 2 2 2 2 

x sin?2 1-cosz x 1-—cosz 
(c) (iii) tan? = = = = ———.. So, tan = = +, / ———. By Corollary 13, 

2 cos 5 1+cosx 2 1+cosx 


a0 ip 7 L es 7 
= > —= canes ae _ Per . 
tan > >0 if e|0,7) and tan > <0 if 5 €( =. 0| 


1-cosx £ Tl 
a \ ; for 0,5), 
Thus, tan — = oe 
2 /1-—cosz x ( 1 | 
-\ / ————__, for ~€{-—,0}. 
1+cosz 2 Z 


(Note: Since x/2 is not an odd integer multiple of 7/2, 1+ cosx # 0, so that there is no 
danger that the denominator in the surd equals zero.) 


A154. The sole error is in Step 6. 


Since x € [-1,1), we have A = cos! z € (0,7] and hence —A ¢€ [-71,0). So, by Fact 99(b), 
cos’! x = cos"! (cos (—A)) # —A and Step 6 is incorrect. 
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149.21. Ch. 38 Answers (Factorising Polynomials) 


16x+3 
A156(a) In the expression —— the dividend is 16x + 3 and the divisor is 5x — 2. 
G — 


Terms: xz! x 
we 

5a-2/16r +3 
16x -6.4 

9.4 


The quotient is 3.2 and 9.4 is the remainder. We have 


1 3 9.4 
is =. 


: at 
5x — 2 5a - 2 








_ 447-3741 = — oe 
(b) In the expression 45 the dividend is 4x%° —- 3x + 1 and the divisor is 7+ 5. 
qi 
Long division: 


Terms: 7° £ x 





+5/4e? -32 +] 
4a? +202 

—232 +] 

-23x4 -115 

116 








The quotient is 4x — 23 and 116 is the remainder. We have 


Age 4 116 
ae % 
g+5 g+5 





2 +043 


, the dividend is «?+2+3 and the divisor is —x? — 2x + 1. 
x? -2r¢+1 


(c) In the expression 


Long division: 





The quotient is —1 and -x+4 is the remainder. We have 


2 —-7+A4 
Mba oa be ie t+ % 


rs ie ge 3p 
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A159(a) (2x7 + 7x? - 3x +5) +(x-3) leaves 2-3°+7-3?-3-34+5= 113. a 
(b) (204 + 3x? - 72 — 1) + (x + 2) leaves -2- (-2)* +3-(-2)? -7-(-2) -1=-7. ae 


A160(a) I'll use the SSGACM: 
(QQ =3) (@+1) 2227 =¢=3.% 
Aiyah, sian. Now try again, but switch -3 and 1: 
(27 + 1) (2-3) = 227 -52r-3.V Yay! Done! 


(b) I'll use the quadratic formula. We have b?-4ac = (-19)’-4 (7) (-6) = 361+168 = 529 > 0 
and 529 = 23. Thus, 








= 19 +2 
72? - 190-6 =7(2-= =)\(«- oes 
14 14 


)=7(0+=) (@-3) Kt OG 8), 
(c) I'll use the SSGACM: = (24 +1) (84-1) = 62? +4-1. Vv Yay! Done! 


(d) [ll start by using the FTGACM. Since the constant term is -14 = —2 x 7, let’s try 
plugging in 2: 


p(2)=2-2°-2?-17-2-14<0.x 


Aiyah, sian. Doesn’t work—by the FT, x — 2 is not a factor for 2a? - x? — 17x - 14. 
Let’s instead try —2: 


2a (Oy 39) 17-2) = 14 = - 16 4 a 1S, 7 
Yay, works! By the FT, x +2 is a factor for 27° — x7 - 17x - 14. 
Now, as usual, write 2z2°-2?-17x-14=(2+2) (ax? + bx + c) ; 
The coefficients on the cubed and constant terms are a = 2 and 2c = -14. And so, c = -7. 


To find b, look at the coefficients on the squared term, which are 2a+b=-1 and so b=-5. 
Thus, ax? + bx + c= 24? — 5x -7. 


To factorise 2x? - 5x —7, I'll use the SSGACM: 
(Q2=7) @41) =22? =5¢= 7.7 
(Wah! So “lucky”! Success on the very first try!) 


Altogether, we have 22° - 2? -17x-14= (x+2) (22-7) (x+1). % 
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A163(a) By (i) and the RT, p(1) =a+6-31+3+3=a+b-25=5. And so, b=30-a. 
1 a. & Bl .3 

B ii): = = _ 
y (ii) 0 (5) ‘are ri 08 

a 6b 138, a 30-a_ 13 

= + _— = + 

16 8 4 16 8 4 

_ 60-a 13 x16 


-— = 60-a-52=8-a. 
ig ji 60-a-52=8-a 











Thus, a= 8 and b= 22. And we have 
p(x) = 8+ + 222° - 3127 + 32 +3. 
(b) Observe that p(0) > 0. Given also (iii) p(-1/3) < 0, the IVT says there must be some 


-1/3<c<0 such that p(c) =0. 
So, let’s try the FTGACM, by plugging in -1/4: 


: (-) +22 (-+) -31 (-1) +3(-z)+3-0.7 


Yay, works! By the FT, x +1/4 or 4(4+ 1/4) = 42 +1 is a factor of p(z). 
From (ii), we also already knew that x - 1/2 or 2(4- 1/2) = 2x-1 is a factor of p(z). 


So write p(x) = 8x* + 222° - 3127+ 32+3 
= (2x-1) (4x +1) (dz? +ex + f) 
= (8x? - 2x -1) (dx? + ex+f). 
The coefficients on the 4th-degree and constant terms are 8d = 8 and —f =3. And so, d= 1 


and f = -3. To find e, look at the coefficients on the linear term, which are -2f -e = 3. 
And so, e=—-2f —3=3. Thus, dz? +er+ f =27+32-3. 


To factorise this last quadratic polynomial, we observe that b?—4ac = 37-4 (1) (-3) = 21> 0. 
And so, by the quadratic formula, 


Tec _-38-V21\( -3+V21)_ btval 3oval 
a + 3-3 =|x-—— ]|z-—— J = [2 > r 5 , 














) 21 3-V2l 
Thusp (0) = Sr « 24° 314? + 3043 = (Ber 1) (40+ 1)( + ety es vt) x 
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149.22. Ch. 39 Answers (Solving Systems of Equations) 


A165. Let A, B, and C be Apu, Beng, and Caleb’s ages today. Let k be the number of 
years between today and the day Apu turned 40. 


The second sentence says that A -k 240 and B-k22 (C-k). 
The third sentence says that A 3 2B and C £28. 

Plug 3 into + to get 2B -k 2 40, 

Plug 4 into 2 to get B-k 22(28-k) =56-2k or B+ k 256. 
Take 2 + £ to get 3B = 96 or B = 32. 

So, Beng turns 32 today. And from 3 Apu turns 64 today. 


A166. At the instant they turn, Plane A is 300km northeast of the starting point, while 
Plane B is 600km south of it. The angle formed by their flight paths is 37/4. So, by the 
Law of Cosines (Proposition 7), at this instant, the distance between the two planes is 


\/3002 + 6002 — 2 (300) (600) cos (37/4) » 442 km. 


Thereafter, this distance shrinks at a rate of 100 + 200 = 300kmh™. So, they’ll collide in 
another 


442 km 


——_ 3 1 ATh. 
300 km h-! 


300 km 


g 


600 km x 442 km 
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A168(a) From the given information, we have this system of equations: 


244-124+b-1l+c=at+bte, 
5 2 a-32+b-34+c=9at3dt+C, 


93¢-624+b-6+c=36a+6b+c. 


Take 2 - 4 to get 8a+ 2b =3 or b 41.5 -4a. 

Plug * into + to get a+ 1.5-4a+c=2 or c23a+0.5. 

Now plug * and 2 into 2 to get 36a + 6 (1.5 - 4a) +3a4+0.5 =9 or 15a = -0.5. 

Hence, a = -1/30, b = 49/30, and c= 0.4. So 
(b) First, 2 -a+bte. 


Next, recall that the strict global minimum point of a quadratic equation occurs at x = 
—b/2a. Hence, b = 0 and 


-o(-2) +0(-2)+e2 BF vene-H=0 
a pio " 2a 2a 4a 2a 4a ~ 


So, c=0. Now from é we also have a= 2. % 





A169(a) We'll use Method 2 from the last example—we’ll rewrite the two equations as: 


y=u?-x?+2- 


| 
1+ fx’ 
then graph this equation on our GC. It looks like there are no x-intercepts. We thus 
conclude that this system of equations has no solutions and its solution set is ©. % 


(b) Again, use Method 2 and rewrite the two equations as: 


1 3 . 
= ——. - x” - sing, 
YT @ 
then graph this equation on our GC. It looks like there is only one x-intercept. Using the 
“zero” function, we find that it’s at x » -1.179. 


Plug this value of x back into either of the original equations to get y » —2.563. We conclude 
that the (unique) solution is ~ (-1.179, —2.563). t 


(Answer continues below ...) 
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(... Answer continued from above.) 


A169(c) Recall that x? + y? = 1 describes the unit circle centred on the origin. Recall also 
that to get the T184 to graph it, we must break it up into two equations: 


y=V1-2? and y=-V1-2?. 


1. Enter the three equations y = V1-2?, y=-V1-2?, and y=sinz. 
2. Press (QiNNSs) to graph the three equations. 
3. Zoom in by pressing YARN), (2), then CAwBRE. 


It looks like y = sinx intersects the circle x? + y* = 1 at two points. To find what these 
two points are, we will use the “intersect” function. 


4. Press MD), then CALC to bring up the CALCULATE menu. Then press (5) to select 
the “intersect” function. 


As usual, the TI84 asks, “First curve?” Ill first look for the intersection that’s to the 
right of the y-axis. So, 

5. Press (NIN35) to confirm that we’re selecting y; = V1-—.x? as our first curve. 
The TI84 now asks, “Second curve?” For our second curve, we want to select y3 = sin (2). 
To do so, simply: 

6. Press the down arrow key once. 


7. Now as usual, use the left and right arrow keys to move the cursor to approximately where 
we think the intersection point is. In my case, I’ve moved it to (x, y) » (0.745, 0.678). 


8. Now press (2QU8515). The TI84 now annoyingly now asks, “Guess?” So press UBININSIN 
once more to learn that the intersection point is (x,y) * (0.739, 0.674). 


By repeating Steps 4-8 (and making the necessary changes), you should be able to find 
that the other intersection point is (x,y) » (—0.739, -0.674). (Alternatively, you can save 
yourself some time by observing the symmetries here and immediately infer that this is the 
other intersection point.) 


We conclude that this system of equations has two solutions: (+0.739...,+0.673...). 


After Step 1. After Step 2. After Step 3. After Step 4. 


After Step 5. After Step 6. After Step 7. After Step 8. 
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149.23. Ch. 40 Answers (Partial Fractions Decomposition) 
A170(a) Since x? + 2-6 = (x-2) (x +3), write 


8 A 7 B  A(#+3)+B(a-2) (A+B)x£+3A-2B 
a+e-6 2-2 2+3 (x-2)(4+3) (4-2) (x4+3) © 








Comparing coefficients, A+ B=0 and 3A-2B=8. So, A=8/5 and B=-8/5. 


8 8 8 
a+e-6 5(x-2) 5(x+3) 





Thus, 


(b) Since 32? - 82 - 3 = (32 +1) (x - 3), write 


l7z-5 A : B  A(«-3)+B(3r+1)_ (A+3B)¢-3A+B 
322-82-3 34+1 2-3 (3@4+1)(x-3) = (3441) (2-3) 





Comparing coefficients, A+ 3B +17 and -3A+B2-5. From 3x ++ 2 we get 10B = 46 or 
B=4.6 and then also A = 3.2. 








ce ae a i : i7 = a 
A171. To factorise the denominator x? - x? —- x +1, try plugging in 1: 
p(1) =1-1-14+1=0. "A 
Yay, works! By the Factor Theorem, x - 1 is a factor for p(x). So, write 
w—x?-x+1=(2-1) (ax? +br+c). 
Comparing coefficients, a = 1, -c = 1, and c-b=-1. So, c= -1 and b = 0. Hence, 


az? +ba+c=2?-1=(4+1)(x-1). Thus, 


ve —2?-2+1=(r-1)(x4+1) (2-1) =(r4+1)(x-1)’. 





Now, write 
7 ial he : B P C _ A(@-1)' + B(a+1)(a-1)+C(#+1) 
a—a¢2-7+1 +1 2-1 (x-1) (x+1)(x-1) 
Ae =2Art At Be = B+eCatC (At Bj)’ +(-2A4+ Cat A=Bec 


(r+1)(x-1) (r+1)(x-1)’ 


Comparing coefficients, A+ B = 2, -24+C =-1, and A-B+C =7. Summing up these 
three equations, we get 2C' = 8 or C =4 and then also A = 5/2 and B = -1/2. 


Qo —2+7 5 1, 4 
e8—g?-g¢+1 2(e+1) 2(e-1) (¢-1)* 








Thus, 


A172. To factorise the denominator x° — x? — x +1, try plugging in 1: 
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p(1)=1-2+4-8<0. xX 


Aiyah, sian, doesn’t work: By the Factor Theorem, x - 1 is not a factor for p(2). 


Second try—plug in 2: 
p(2) = 23 -2-27+4-2-8=0. J 
Yay, works! By the Factor Thm, x - 2 is a factor for p(x). Next, write 
x? — 2x? + 4x -8 = (a - 2) (ax? + br +c). 


Comparing coefficients, a = 1, -2c = -8, and c—- 2b=4. So c=4 and b=0. 


The quadratic polynomial ax? + br + c = x? + 4 has negative discriminant and cannot be 
further factorised. 


Thus, x? - 2x? + 4x -8 = (a - 2) (x? +4). Now, write 


32? +5 A , BarG _ Aa’ +4) + (Br +0) (e-2) 
—Q024+4r-8 2-2 z2+4 © (x — 2) (x? + 4) 
_ Az? +4A+ Ba*+(C-2B)2-2C (A+B)a?+(C-2B)x4+4A-2C 
7 (x — 2) (x? + 4) 7 (x — 2) (x2 +4) 


Comparing coefficients, A+ B 2 -3, C-2B 2 0, and 4A -2C 35. From 2x 2+ 2 we get 
24+C 4-6. Next, 2x 4+ 2 yields 8A =-7 or A= -7/8. Now we also have B = -17/8 and 
C =-17/4. 


377 +5 of, Tle +2) 
a3 —In?+4¢-8 8(x-2) 8(x2 +4)’ 








Thus, 
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149.24. Ch. 41 Answers (Solving Inequalities) 


A173(a) (22 + 3) /(-%+7) <9 <=> 9+(244+3)/(a@-7)>0 <> (9x - 63+ 27+3)/(x-7) > 
0 <> (11-60) /(x-7)>0 <> (11r-60,2-7>0 OR 1lx-60,2-7<0). a % 
lle -60,2-750 <> (x>60/11AND2>7) <> 2>7. 

lle -60,2-7<0 <> (x<60/11AND2<7) <> 2<60/l1. 


ZN 















60 
11 








(b) (-4a +2) /(a+1)>13 <=> (-40+2- 1382-13)/(4+1)>0 — > (-17x2-11)/(4+1)> 
(<> <17e=1Leods 0 OR 172 =11.¢41< 6: a % 


-W7e-1je+130 <> (4<-1/17ANDx>-1) <> xe (-1,-11/17). 
-17x-11,2+1<0 <=> (4>-11/17 ANDx<-1) <= Contradiction. 


uN 







Altogether then, 


—47 +2 
r+1 





11 
>i13 ze(-1,-7). 
17 
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A175(a) Since a < 0, this is a n-shaped quadratic. Since b?-4ac = 17-4 (-3) (-5) = -59 < 0, 
the graph doesn’t touch the z-axis at all and is completely below the x-axis. Hence, there 
are no values of x for which this inequality is true—the solution set is 2. af 


Y 


ZN 








LED 


(there are no values of x 
for which —327 + x -5>0) 





(b) Since a > 0, this is a U-shaped quadratic. Since b? - 4ac = (9-40) Gl 28s 6 
the graph intersects the x-axis at two points—these are simply given by the two roots of 
the quadratic, which are x = (2 + v8) /2=1+ /2. Hence, x? - 2x7 -1>0 is true “outside” 


those two roots—the solution set is R \ [1 a ae v2]. af 





y=a?-Qe-1 


\ 








i732 














A176(a) |x -4|< 71 <— > -71<a4-4<71 <— > -67<2< 75. The solution set is [-67, 75]. 
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(b) [5-2] >13 <— > (5-2>130R5-2<-13) — (-8>20OR18<z2z). The solution 
set is (—00, -8) U (18, co) or R \ [-8, 18]. t 
(c) Sketch y = |-32 + 2|- 4 and y=a-1. Observe |-32+2/-4>a2-1 <= 1 is to the left 
or right of the two intersection points P and Q. P is given by -37+2-4=a2-1 or -1=42 
or x =-1/4. Q is given by 32 -2-4=a2-1 or 2x =5 or x = 5/2. Thus, the solution set is 
(—o0, -1/4] U[5/2, 00) or Rv (-1/4, 5/2). a 





y =|-32 + 2|-4 








(d) Sketch y = |x +6] and y = 2|2x - 1]. Observe that |x + 6| > 2|27-1] <= z is between 
the two intersection pointsP and Q. P is given by -~-6 = 2(2x - 1) or -4 = 5a or x = -4/5. 
Q is given by 7 +6 =2(2x-1) or 8 = 32 or x = 8/3. Thus, the solution set is (—4/5, 8/3).% 


aN 
























y = |x +6 ! 
! 8 ; 
3 
: > 
1 2 
ager, 2 7el_ 24. 1 oy _ ke, wy y el | 
1+|r+y| 1+|x+y| 1+ al+|y| 1+ |al+|y| 1+ 2z|+|y|~ 1+ |e] 1+(y| 


1 2 
where <uses the Triangle Inequality and < uses 1 + |x| +|y| > 1+ |x| and 1+ |x] +|y| > 1+ yl. 
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A178(a) N=27+2x2+1=(r+ 1) is positive everywhere except at x = —-1, where it equals 
zero. So, the inequality is equivalent to 7? -372+2>0 AND x ¢-1. 


Observe that 2?-3a+2 = (x -1) (x -2) is a U-shaped quadratic which intersects the z-axis 
at 1 and 2. Thus, 2?-32+2>0 < >zeR\[1,2]. 


Thus, the inequality’s solution set is R\[1,2]\{-1} or (-00,1)uU (-1,1) U (2, ). % 











x € (—00,-1) U (-1, 1) U (2, 0) 


1 xit+2e+1 
solo. —$—$=—=$—— > 
‘ fee Bp 
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A178(b) N =2?-1=(x2+1)(x-1) is positive if x ¢ R\[-1,1] and negative if x « (-1,1). 
D=x?-4=(x+2) (x -2) is positive if x ¢ R\[-2,2] and negative if x ¢ (2, 2). 
The given inequality is true if N,D>0 or N,D <0. We have 


N,D>0 <— gceR\[-1,1] AND zeR\[-2,2] <> zeR\[-2,2]. 
N,D<0 <> we(-1,1) AND ve (-2,2) <> we(-1,1). 


Thus, the inequality’s solution set is R\[-2,2]U (-1,1) or (-00,-2) U(-1,1)U (2,0). 








(—oo - 2) U(-1,1)U(2, 0) 


S108 
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A178(c) N = 2? - 32-18 = (x-6) (x +3) is positive if x ¢ R\[-3,6] and negative if 
x € (-3,6). 


D=-2x? + 9x -14=-(x-2)(x-7) is positive if x € (2,7) and negative if x ¢ R\ [2,7]. 
The given inequality is true if N,D >0 or N,D <0. We have 


N,D>0 <> weR\[-3,6] AND re (2,7) <> ze (2,7)\[-3,6] = (6,7). 
N,D<0 <> xe(-3,6) AND ze R\[2,7] <=> xe (-3,6) \ [2,7] = (-3,2). 


Thus, the inequality’s solution set is (-3,2) U (6,7). 


NM 
(-3, 2) U (6, 7) 
x? —3x-18 


lves —.————_ > 0. 
iad —1? +9x-14 








xe? — 32-18 
—7? + 9x -14 





Y= 





V 
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A179(a) Rewrite the inequality as 2? -a2?+2-1-e*>0, then graph y = 2° -2?+2-1-e¢" 
on your T184. It looks like the graph may be a little above the x-axis near 3, so let’s zoom 
in. But we'll zoom in not through the ZOOM function, but by adjusting Xmin and Xmax 
in the WINDOW menu (I’ve adjusted them to 0 and 4). 


And now as usual, we can “simply” find the two roots using the ZERO function. They are 
x ® 3.047, 3.504. 


Altogether then, based on these two roots and what the graph looks like, we conclude: 


g—a?t+ae-1l>e® —a2?-2?+2-1-e>0 <— re (3.047...,3.503...). at 


After graphing. After zooming in. The two roots. 


(b) Rewrite the inequality as //x -cosx > 0, then graph y = \/z — cos on your TI84. It 
looks like there’s at least one x-intercept near the origin, maybe more. So let’s zoom in. 


Alright, it looks like there’s only one x-intercept. We can as usual find it using the ZERO 
function—zx * 0.642. 


Altogether then, based on this one root and what the graph looks like, we conclude: 





Jx>cosxr — > V/xz-cost>0 —> x > 0.642. x 


After graphing. Zoom in. The only root. 


(c) Rewrite the inequality as 1/ (1 -x?)-23-—sin x > 0, then graph y = 1/(1-2)-2 
on your TI84. It looks like there’s only one x-intercept near x = -1. 


So let’s simply find it using the ZERO function—zx » -1.179. 
Altogether then, based on this one root and what the graph looks like, we conclude: 





3_sing 





>a+sing <> -g?-sing>0 <> xe (-00,-1.179...)U(-1,1). * 


1-2? 1-2? 





After graphing. The only root. 
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149.25. Ch. 42 Answers (Extraneous Solutions) 


A180. We verify that 2 20 and x = 71/2 satisfy =, while « 27 and x = 7/2 do not and are 
extraneous solutions. 


As usual, the squaring operation in Step 1 is an irreversible step. That is, this implication 
is true: 


2 
sinz + cosz =1 => (sinz +cosz)° = 1?. 
; 2 
But its converse is not(sin x + cos x)" = 1? > sing + cosz = 1. 


For example, (sin7t+ cos 7) = (-1)° = 1, but sin7t+ cos7 = -1. 


And so, we may (and indeed do) introduce extraneous solutions in Step 1. 


A181. We verify that x 2 1 satisfies = but x 2 64 does not and is an extraneous solution. 


Plugging x 2 1 back into the original equation 2 we have 
12714+12+V¥1+1=14+14+14+1=4#0, 


so that x a4 does not solve z 


The error is in Step 3, where raising to the power of 6 is an irreversible step (similar to 
squaring). That is, this implication is true: 


But its converse is not: a® = b® > i= 0D. 


For example, (-1)° = 1° but -1 #1. 


And so, we may (and indeed do) introduce extraneous solutions in Step 3. 


A182. We verify that x «1 does not satisfy + and so is an extraneous solution. 


The error is in Step 3: Plugging 3 into = is an irreversible step. 


It is true that 2 implies . (More explicitly, if there exists x € R that satisfies 3 then any 


such x also satisfies 2) 


However, the converse may not be true and indeed isn’t. There may exist x € R that satisfies 
2 (and there does, namely x = 1), but such x may not (and indeed does not) satisfy Z. 


And so, we may (and indeed do) introduce extraneous solutions in Step 3. 
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149.26. Ch. 44 Answers (Conic Sections) 


A193. Translate x?/a? + y?/b? = 1 leftwards by c units to get (x +c)” /a? + y?/b? = 1. Then 
further translate downwards by d units to get (x +c)? /a? + (y +d)’ /b? = 1. t 





(w+e)” | (ytd) _ ! : 


a? b2 : 













(-c,b-d) is a strict 
global maximum. 


Line of symmetry 
y=-d 


Line of symmetry 
L=-C 


Se ee 


(-c,-b +d) is a strict a 
global minimum. 





So,(x +c)’ /a?+(y +d)’ /b? = 1 is the exact same ellipse as x2/a? +y?/b? = 1, but now centred 
on the point (-c,—-d) (instead of the origin). 

x? /a? + y?/b? = 1 had two turning points—the strict global maximum (0,b) and the strict 
global minimum (0,-b). Since (x +c)? /a? + (y+d)° /b? = 1 is simply 2?/a? + 2/0? = 1 
translated c units leftwards and d units downwards, (x +c)” /a? + (y +d) /b? = 1 again has 
two turning points—the strict global maximum (-c,b-d) and the strict global minimum 
(-c, -—b - d). 

By observation, there are no asymptotes. 

By observation, there are two lines of symmetry y = -d and x = -c. 


(Answer continues below ...) 
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(... Answer continued from above.) 


Following the hint, to find the y-intercepts, plug in x = 0: 














(O+ c) (y+ d)° (y+ d) OD gia 
2 + 2 =|] <=> 3 =|- 3 = 5 
b ji oe ; : 
2 wad b 2_ 2 
So, if |a] > |c|, then the y-intercepts are (0 —d- we and (0 —d+ wee 
a 


If ja| = |c|, then the (only) y-intercept is (0,-d). (Either the leftmost or rightmost point of 
the ellipse just touches the y-axis.) 


And if |a| < |c|, then there are no y-intercepts (the ellipse doesn’t touch the y-axis). 


Similarly, to find the x-intercepts, plug in y = 0: 


(nto) (+d)? _ 











a? b> : e BP PR 
r+c +Vb-d? av b? — d? 
—>> = —>_— « T=-CH+ 
a b b 
ae) Jpeom 
So, if |b] > |d|, then the x-intercepts are (-- wo) and (-c+ of 


If |b| = |d|, then the (only) x-intercept is (0,-d). (Either the topmost or bottommost point 
of the ellipse just touches the x-axis.) 


And if |b| < |d|, then there are no z-intercepts (the ellipse doesn’t touch the z-axis). 
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32 +2 4 
_3t+2 


A194 Do the | division: = 3- —.. 
(a) Do the long division: y 5 5 





1. There are two branches—one on the top-left and another on the bottom-right. 
2. Intercepts. Plug in x = 0 to get y = 2/2 = 1. So, the y-intercept is (0,1). 

Plug in y = 0 to get 32 + 2 =0 or x = -2/3. So, the x-intercept is (-2/3,0). 
3. There are no turning points. 


4. Asymptotes. The value of x that makes the denominator 0 is -2—hence, the vertical 
asymptote is x = —2. 
The quotient in the long division is 3—hence, the horizontal asymptote is y = 3. 


(Note that since the two asymptotes x = —2 and y = 3 are perpendicular, this is a 
rectangular hyperbola.) 

5. The hyperbola’s centre (the point at which the two asymptotes intersect) is (-2,3). 
(These coordinates are simply given by the vertical and horizontal asymptotes.) 

6. The two lines of symmetry may be written as y = «+a and y = -% + and pass 
through the centre (—2,3). Plugging in the numbers, we find that a = 5 and 6 = 1. 
Thus, the two lines of symmetry are y=2+5 and y=-x+1. 





Line of symmetry 
y=r+d __- 


Line of symmetry 
~. vece +1 


- 


~ 
~ 


- 


---~ Horizontal asymptote 








Vertical asymptote 
z=-2 
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eo tae 


A194(b) Do the long division: y = ~ 
94(b) Do the long division: y ee a 








1. There are two branches—one on the top-right and another on the bottom-left. 

2. Intercepts. Plug in x = 0 to get y = -2/1 = -2. So, the y-intercept is (0,-2). 
Plug in y = 0 to get x-2=0 or x =2. So, the z-intercept is (2,0). 

3. There are no turning points. 

4. Asymptotes. The value of x that makes the denominator 0 is 1/2—hence, the vertical 
asymptote is x = 1/2. 
The quotient in the long division is -1/2—hence, the horizontal asymptote is y = -1/2. 


(Note that since the two asymptotes x = 1/2 and y = -1/2 are perpendicular, this is a 
rectangular hyperbola.) 

5. The hyperbola’s centre (the point at which the two asymptotes intersect) is (1/2, -1/2). 
(These coordinates are simply given by the vertical and horizontal asymptotes.) 

6. The two lines of symmetry may be written as y = +a and y = -% +8 and pass 
through the centre (1/2,-1/2). Plugging in the numbers, we find that a =-1 and 6 =0. 
Thus, the two lines of symmetry are y = x -1 and y = -2. 








Line of symmetry 
Ris apc 


Line of symmetry 
al alt Or ll 


- 
- 












Horizontal asymptote 


1 


ai ole 
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-3c+1 3. 11/2 
= + 


A194(c) Do the long division: y = ae ae 
ae te 








1. There are two branches—one on the top-right and another on the bottom-left. 

2. Intercepts. Plug in x =0 to get y=1/3. So, the y-intercept is (0, 1/3). 
Plug in y = 0 to get -3% + 1 =0 or x = 1/3. So, the x-intercept is (1/3,0). 

3. There are no turning points. 

4. Asymptotes. The value of x that makes the denominator 0 is -3/2—hence, the vertical 
asymptote is x = -3/2. 
The quotient in the long division is -3/2—hence, the horizontal asymptote is y = —3/2. 


(Note that since the two asymptotes x = —3/2 and y = —3/2 are perpendicular, this is a 
rectangular hyperbola.) 

5. The hyperbola’s centre (the point at which the two asymptotes intersect) is (-3/2, -3/2). 
(These coordinates are simply given by the vertical and horizontal asymptotes.) 

6. The two lines of symmetry may be written as y = «+a and y = -7+ and pass through 
the centre (-3/2,-3/2). Plugging in the numbers, we find that a =0 and 6 =-3. Thus, 
the two lines of symmetry are y = x and y= -x -3. 


IN 
Vertical asymptote a 
a 
L=-= 
2 


Line of symmetry 


Line of symmetry 
‘—~ Ye -t—d 


~ 


~ 








‘ooo Horizontal asymptote 
i ee 3 
Centre ee 
3 3 Oia. 
73) om 








1824, Contents www.EconsPhDTutor.com 


e? +2641 ee 25 
—<——__ Si ; 

xz—-A g—4 
Intercepts. Plug in x = 0 to get y = 1/(-4) = -1/4. Thus, the y-intercept is (0,-1/4). 
Plug in y = 0 to get x? +22 +1=0, an equation which has one (real) solution « = —1. Thus, 
the (only) x-intercept is (-1,0). 





A195(a) Do the long division: x 


Asymptotes. The vertical asymptote x = 4 is given by the value of x for which x - 4 = 0. 
The oblique asymptote y = x + 6 is given by the quotient in the long division. 


The centre’s x-coordinate is given by the vertical asymptote x = 4. For its y-coordinate, 
plug x = 4 into the oblique asymptote to get y=4+6=10. Hence, the centre is (4,10). 


You should be able to sketch the two lines of symmetry and the two turning points.” 
(The two lines of symmetry run through the centre and bisect an angle formed by the two 
asymptotes. And there is one turning point for each branch.) 













In this example, the x-intercept 
(-1,0) coincides with the 
maximum turning point. 


7 


oi = (1+ V2)2+6-4V2 


7 
7 
7 
7 
7 








d 2 cs 

To find the turning points, write 7 = < (« +6+ ~.) =1-25(2-4)” £0 <> (a — 4)? = 25. 
xL x LR 

So z =-1,9. The corresponding y-values are -1 +6 + 25/ (-1-4) =0 and 9+ 6+ 25/ (9-4) = 20. Thus, 

the two turning points are (-1,0) and (9,20). (If necessary, we can also show that these are respectively 


the strict local maximum and minimum.) 





1825, Contents www.EconsPhDTutor.com 


BD cap 2 
A195(b) Do the long division: oe Se 
ee ee 





ae 


Intercepts. Plug in x = 0 to get y = -1/1 = -1. Thus, the y-intercept is (0,-1). Plug in 
y = 0 to get -x?+a-—1-=0, an equation for which there are no (real) solutions. Thus, there 
are no x-intercepts. 


Asymptotes. The vertical asymptote « = —1 is given by the value of x for which «+1 = 0. 
The oblique asymptote y = —x + 2 is given by the quotient in the long division. 


The centre’s x-coordinate is given by the vertical asymptote 7 = —-1. For its y-coordinate, 
plug x = -1 into the oblique asymptote to get y = —(-1)+2 = 3. Hence, the centre is (-1,3). 


You should be able to sketch the two lines of symmetry and the two turning points.°” 
nN 
_-a +n-1 y 
gt] 







y=(-1-V2)o+2+V2 


ssa. (-1+ V3,3-2v3) 


~ 
. 

















d d 3 : 
®°To find the turning points, write - ae (-2 <2 —) =-1+3(£+1)” £0 <> (x+1)? =3. 


So « =-1+ V3. The corresponding y-values are 1 ¥ V3 + 2- 3/ (-1 24/3 + 1) = 3+2\/3. Thus, the two 
turning points are (-1 +/3,3F 2/3). (If necessary, we can also show that these are respectively the 


strict local maximum and minimum.) 
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ae 





207 = — 
Al Do the | ivision: ———————— = —1 . 
95(c) Do the long division ae x 0+ 


Intercepts. Plug in x = 0 to get y = -1/4. Thus, the y-intercept is (0,-1/4). Plug 
in y = 0 to get 27? - 2x —-1 = 0, an equation for which there are two (real) solutions: 


Le (2 + v12) /4= (1 4 v3) /2. Thus, there are two z-intercepts: ((1 as v3) /2, 0). 
Asymptotes. The vertical asymptote « = —4 is given by the value of x for which «+4 = 0. 
The oblique asymptote y = 2x — 10 is given by the quotient in the long division. 


The centre’s x-coordinate is given by the vertical asymptote x = —4. For its y-coordinate, 
plug x = —4 into the oblique asymptote to get y = 2(-4) - 10 = -18. Hence, the centre is 
(-4,-18). 


You should be able to sketch the two lines of symmetry and the two turning points.™! 


zn 





y= (2+V5)0-10+4V5,. 











> 











®!To find the turning points, write dy = a (20 -10+ a =2-39(4+4)° Li aay (x +4)” = 39/2. 
dx dx +4 


So « = -4+/39/2. The corresponding y-values are 2 (-4 + 39/2) - 10+ 39/ (-4 + 4/39/2 + 4) =-18+ 
2/78. Thus, the two turning points are (-4 + \/39/2,-18 + 2V 78). (If necessary, we can also show that 


these are respectively the strict local maximum and minimum.) 
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149.27. Ch. 45 Answers (Simple Parametric Equations) 


A349(a) As stated in the above example, at time t = 0, particle P is at (x,y) = (cos0,sin0) = 
(1,0). In contrast, particle Q is at (x, y) = (sin0,cos0) = (0,1). 

(b) At ¢ = 0, Q is at (x,y) = (sin0,cos0) = (0,1). A little after t = 0, x = sint will have 
grown a little while y = cost will have shrunk a little. That is, the particle Q will have 
moved a little to the right and a little to the south. Thus, Q travels clockwise. 


(c) Every 27ts, each particle travels one full circle. Therefore, at t = 66471, each particle 
will be at its starting point. And 7ts later, each particle will have travelled an additional 
half-circle. Thus, at ¢ = 66571, particle P will be at (x,y) = (cos7t,sin7) = (-1,0) (at the 
left of the circle), while particle Q will be at (x,y) = (sin 71, cos7t) = (0,-1) (at the bottom 
of the circle). 


(d) The two particles are at the exact same position whenever (cost, sint) = (sint, cost). 


Thus, they are at the exact same position whenever cost = sint. By inspecting the graphs 
of cos and sin (see e.g. p. 372), we see that this occurs at t = (k + 1/4) 7, for every k € Z)5. 


(e) For the particle Q, we have 


d d d d? du; d? 
U_ = —~ = cost, vy = =-sint, Ay = ee Sg yo __ 
dt dt 











A350(a) For the particle R, we have 


Vv ee asint es eee a dv, _ die acost, a my dy bsint 
v dt a pe de de ane: ne: ae 











(b)(i) At t=7/4, (a,y) = (acos (7/4), bsin (7/4)) = (av2/2,bV2/2), 
(Uz, Vy) = (-asin (7/4) , bcos (71/4)) = (-av2/2,bV2/2), 
(dz, dy) = (—acos (7/4) ,-bsin (71/4)) = (-av2/2, -bV/2/2). 


The particle R starts at (a,0) and travels anticlockwise. At ¢ = 7/4, R has completed 
one-eighth of the full revolution and is now at the top-right of the ellipse; it is travelling 
leftwards at aV/2/2ms"! and upwards at b/2/2ms"!; and it is accelerating leftwards at 
aV/2/2ms~ and downwards at bV2/2ms~?. 


(b) (ii) At ¢ = 77/2, (x,y) = (acos (7/2) , bsin (71/2)) = (0,0), 
(Uz, Vy) = (-asin (7/2) , bcos (7t/2)) = (-a,0), 
(dz, @y) = (—acos (7/2) ,-bsin (7/2)) = (0, -0). 


At t = 7/2, R has completed one-quarter of the full revolution and is now at the top of the 
ellipse; it is travelling leftwards at ams (and not upwards at all); and it is accelerating 
downwards at bms~ (and not rightwards at all). 


(Answer continues below ...) 
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(... Answer continued from above.) 
A350(b) (iii) At t=27, (x,y) = (acos 27, bsin 271) = (a, 0), 
(Uz, Uy) = (-asin 271, bcos 27) = (0,0), 
(dz, dy) = (-acos 27, —bsin 27t) = (-a,0). 
At t = 27m, R has completed one full revolution and is back at its starting position; it is 


travelling upwards at bms™! (and not rightwards at all); and it is accelerating leftwards at 
ams” (and not upwards at all). 


7 aN 7 

Att=— = 

5" Y At t A 
(0.9) = (0.0), ep ele 
(Uz; Vy) ~ (—a,0), aloes oS 2° 


(dz, ay) = (0,-5). 











(Q,0;) = (-«S. ca ; 







At t= 2H, 


(x,y) es (a,0), 
(Qe. iy) = (0, b), 
(x,y) = (-a,0). 


v2 y? 
U={@): +451] 


eee ee eer Arrows indicate instantaneous 


direction of travel. 





(c) At t=0, (,y) = (acos0, bsin0) = (a,0) and (vz, v,) = (-asin0,bcos0) = (0,6). Hence, 
(i) If a,b <0, R starts at the left of the ellipse. It also starts by moving downwards and is 
thus moving anticlockwise. 


(ii) If a>0, b<0, R starts at the right of the ellipse. It also starts by moving downwards 
and is thus moving clockwise. 


(ii) Ifa<0,b>0, R starts at the left of the ellipse. It also starts by moving upwards and 
is thus moving clockwise. 


A197(a) An instant after t = 1.571, the particle magically reappears “near” “bottom-right 
infinity” (00, -00). 

(b) During t € (1.571,2.57t), the particle moves upwards along the right branch of the 
hyperbola. At t = 271, it is back to its starting position (1,0). And as t > 2.571, it “flies off” 
towards “top-right infinity” (00, 00). 
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A351(a) x=tant, y=sect => y?-27=1. 

(b) At t = 0, (x,y) = (tan0,sec0) = (0,1)—the particle B is at the midpoint of the top 
branch of the hyperbola. 

(c) v, = dx/dt = sect is always positive and so the particle is always moving rightwards. 
(d)(i) At t = 0, B starts at the midpoint of the top branch of the hyperbola. During 
t €[0,0.57t),B travels rightwards along the right portion of the top branch. As t > 0.57, B 


“flies off” towards the “top-right” infinity; its position, velocity, and acceleration in both 
the x- and y-directions approach oo. 


(d)(ii) An instant after t = 0.57, B magically reappears “near” “bottom-left” infinity. 
During ¢ € (0.57, 1.570), it travels rightwards, along the bottom branch of the hyperbola. 


Specifically, during t € (0.571, 71), it is on the left portion of the bottom branch. At t = 7, it 
is at the midpoint of the bottom branch. And during t € (7, 1.57t), it is on the right portion 
of the bottom branch. 


As t > 1.57, B “flies off” towards the “bottom-right” infinity; its position, velocity, and 
acceleration in the x-direction approach oo; and in the y-direction, they approach —oo. 


(d) (iii) An instant after t = 1.57, B magically reappears “near” “top-left” infinity. During 
t € (1.57, 2.571), it travels rightwards, along the top branch of the hyperbola. 


Specifically, during t € (1.571, 27t), it is on the left portion of the top branch. At t = 27, it 
is back to its starting position—the midpoint of the top branch. And during ¢ € (271, 2.571), 
it is on the right portion of the bottom branch. 


Ast > 2.57, B again “flies off” towards the “top-right” infinity as it did when ¢t approached 
0.57t; its position, velocity, and acceleration in both the x- and y-directions approach oo. 


(e) At t=0, B is at the midpoint of the top branch—hence, By. 


Since 1 € [0,0.57t) » [0,1.57), at t= 1, B must be on the right portion of the top branch— 
hence, B,. 


Position | Ba | By | Be 
Timet| 5 ]0,>1]2/3/ 4 














Since 2,3 € (0.571, 7) » (1.57, 3.14), at t= 2 and t = 3, B must be on the left portion of the 
bottom branch. Since B is “near” “bottom-left” infinity an instant after ¢ = 0.57 and t = 2 
is earlier than ¢ = 3, it must be that t = 2 corresponds to By and t = 3 corresponds to Be. 


Since 4 € (7, 1.571) » (3.14,4.71), at t= 4, B must be on the right portion of the bottom 
branch—hence, By. 


Since 5 € (1.57, 27t) » (4.71,6.28), at ¢ = 5, B must be on the left portion of the top 
branch—hence, Bg. 
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A198(a)(i) Rewrite x = t-1 ast =2+1 and plug this into y = In (t+ 1) to get y = In(x +2). 


Noting that t>O0 <— > t+12>1 <— > y=lIn(t+1) 20, we can rewrite the set as: 


A= {(x,y):y=In(x +2) ,y 2 Of. 


(ii) The graph of y = In(# +2) is simply the graph of y = Ina shifted leftwards by 2 units. 
Note the constraint y > 0—the particle A travels only along the black graph and does not 
travel along the grey portion. a 


At t=e-1#1.72, 
(x,y) = (2 ~ 2, 1) a (0.72, 1) 


| 
x) = L_———= = 1, : 
(v2, 0) = (1, } » (1,07) 










—Y 





Starting Pa 


point 
Att=0, . 


(x,y) = (-1,0) 
(ve, Vy) = (1, 1) 





A=i (ig) er=t—1Lay=ln(i +1), 720} 
={(7,y):y=\in(2+2),y>0} 





d d 1 
(iii) As usual, compute v, = on = land v, = 7) aa 


At t =0, A starts at the position (x,y) = (0-1,ln(0+1)) = (-1,0), and is moving right- 
wards 1ms~! and upwards at 1/(0+1)=1ms"t. 

A’s rightwards velocity stays fixed at 1ms', while its upwards velocity decreases towards 
zero. As time progresses, A travels steadily towards “top-right infinity”. 
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A198(b)(i) Rewrite x = 1/(t+1) ast = 1/x-1. Plug into y = ??+1 to get y = (1/x-1)° +1. 


i 
Noting that? >0 <> ¢4+12>1 — gre a € (0, 1], we can rewrite the set as: 
+ 


B= {(y):y=(2-1) +L26(0.1}} 


(ii) Using your graphing calculator, we see that the complete graph of y = (1/x - 1) +1 
has two branches. % 


However, we have the constraint x € (0,1]. And so, particle B travels only along the black 
graph and does not travel along the grey portion. 


1 
B= [e=—— yah’ 1,20] 
{(o.9) w= syst +1, 


- {ew su= (2-1) +126 00.1] 











ay 1 d 
(iii) As usual, compute v, = — = -———, and v, = oy, 
dt = (t+1) 
At t = 0, B starts at the position (x,y) = (1/ (0 +1) ,0? + 1) = (1, 1), and is moving leftwards 
at 1/(0+ i =1ms"™ and is upwards at 2-0=Oms!. That is, B is initially not moving in 
the y-direction. 
As time progresses, B move leftwards and upwards. Its leftwards velocity decreases towards 
zero, while its upwards velocity increases towards infinity. 
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A198(c)(i) Rewrite x = 2sint-1 as (x +1)/2=sint and y =3cos*t as y/3 = cos*t. By the 
identity sin? t + cos?¢ = 1, we have [(x +1) /2]? + (y/3) =1, or, 


x+1\? g2 +2741 3 3. «O@¢dW 
=— 1- = 1- =—_— 2 
y s| ( 5 ) | 3] Zi | Tha a a 


Noting that t>0 = > sinte[-1,1] <= x =2sint-1€ [-3,1], we may rewrite the set as 
C= { (a, y):y = 0.752? — 1.54 + 2.25, 2 € [-3, 1}}. 











(ii) The graph of y = -0.75x? - 1.52 + 2.25 is simply a n-shaped quadratic, with turning 
point at x = -1 and roots x = -3,1. But note the constraint x € [-3,1]—the particle C 
travels only along the black graph and not along the grey portion. te 


za 


C= 1(@,9)* f= cae 1y= 3cos’ t,t > 0} y 
3 9 
2 
nee ee, ree 1 
={(.9); ? 2° aa’ eels, } Starting 


point 












— 





At t=0, 


(x,y) = (-1,3) 
(vx, Vy) 7 (2, 0) 





37t 
At t = — 
pa 


(x,y) = (-8, 0) 
(vz, Vy) = (0, 0) 


og 


7 
At t=— 
a? 


(x,y) = (1,0) 
(tig, Oy) = (0,0) 


S > 






At t= 27, 
C' returns to its 
starting position. 








d 

<7 O _6 costsint. 

dt 
At t = 0, C starts at (2 sin 0 - 1,3 cos? 0) = (-1,3) (the maximum point of the parabola) 
and has velocity (v;,v,) = (2cos0,-6sin0cos0) = (2,0). That is, it is moving rightwards 
at 2ms/ (and not moving in the y-direction). 


dx 
(iii) As usual, compute v, = rh = 2cost and v, = 


During t € (0,0.57t), it moves rightwards along the parabola. At t = 0.57, it is at the 
rightmost point of the black graph and (vz, v,) = (0,0). 


It then does a U-turn—during t € (0.571, 1.571), it moves leftwards along the parabola. At 
t = 7, it is again at the maximum point of the parabola. And at t = 1.57, it is at the 
leftmost point of the constrained parabola and again (v;, vy) = (0,0). 


It then does a U-turn—during t € (1.57, 271), it moves rightwards along the parabola. And 
at t = 271, it is again at the maximum point of the parabola. 


The particle has completed one period and will during t € [ 271, 471] repeat exactly the same 
movement made during t € [0,27t]. And so on. 
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150. Part II Answers (Sequences and Series) 


150.1. Ch. 46 Answers (Sequences) 


A199(a) Define a: {1,2,3,4,5,6,7, 8,9, 10} > R by a(n) =n. 

(b) Define b: {1,2,3,...,100} > R by b(n) = 3n-1. 

(c) Define c: {1,2,3,4,5,6,7} > R by c(n) =n’. 

(d) Define d: Z* > R by d(n) = 2n for n odd and d(n) = 3n for n even. 


(e) There is no obvious pattern here. So simply define e : {1,2,3} > R by e(1) = 5, 
e(2) =0, and e(3) = 99. 


(f) Define f:Z* > R by f(n) =1x2x---xn=nl. 


(g) Observe that the Ist, 3rd, 6th, 10th, 15th, and 21st terms are 1, while all the other 
terms are 0. Thus, define g: Z* > R by g(n) = 1 ifn isa triangular number (i.e. 1,3,6,...) 
and g(n) = 0 otherwise. 


A200(a) (cn) = (-1,-3,-5,-7,-9,.-.). 

(b) (dp) = (1,8, 27,64, 125,...). 

(c) (cy + dn) = (0,5, 22,57, 116,...). 

(d) (@,=d,) = (2 Al): -71, 134... ). 
(e) (Cndy) = (-1,-24, -135, -448, -1125,...). 


(f) () = (-1-3-3.-a- | 
dy 8 27° 64’ 125 

(g) (ken) = (-2,-6,-10,-14, -18,...). 

(h) (kdn) = (2, 16,54, 128, 250,...). 





150.2. Ch. 47 Answers (Series) 


(This chapter had no exercises.) 
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150.3. Ch. 48 Answers (Summation Notation ©) 


7 
A201(a) Do nl! =14+2+6+4 24+ 120+ 720 +5040 = 5913. 


n=1 


8 
(b) >) (8n-1) =2+5+4+84+114 144+ 17+ 20+ 23 = 100. 


n=1 


Ln 3 5 i u 
(c) Merrit peas ese aud (d) ¥) (9-n) =8+74+64+54+4+3 =33. 


2 2 2 


n=1 


6 
A202(a) Do (n+1)!=1+2+6+ 24+ 120 +720 + 5040 = 5913. 


n=0 


i 
(b) >) (8n4+2) =24+54+84+114+ 14+ 17+ 20+ 23 = 100. 
n=0 


6 1 i} 5 
(c) Y (Se 5)-pelegeeegeseg-u4 (d) >) (8-n)=84+74+6+54+4+3 =33. 
n=0 


2 2 2 2 


n=0 


A203(a) s(2 Oy 20-1) 0-37 40-3) 0-2) 


i=1 


POs (Gly ey a1 14 16 16. 


17 
(b) ¥> (4* +5) = (4-16 +5) + (4-17+5) = 69 +73 = 142. 
*=16 


(c) > (x -3) = (31-3) + (32-3) + (33-3) = 28 + 29+ 30 = 87. 


A204(a) DE De eee eae aa Osh 


(b) Yn - 1) = ¥ (8n-1) =24+54+84114144+174+204+23+. 


n=l 


(©) 5 = Yea pele see osatae... 


(d) 3 (9—n) =F (9-n)=847464544434... 


A205(a) Do (n+1)!=1+2+6+ 244 120+720+5040+... 
n=0 


(b) Y) (3n-+2) = 24+54+84+114+144+174+20+23+. 


n=0 


i 


©) > (5+ r5)a gels pret seaegen. (d) >) (8-n) =84+7+6454+44+3+... 
n=0 


2 2 z 
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150.4. Ch. 49 Answers (Arithmetic Sequences and Series) 
A206(a) a; = 2, ax = 997, and d=5. So, k = (997-2) /5+ 1 = 200. Thus, the sum is 
(a1 +x) 5 = (2+997) = 99 900. % 
(b) b; = 3, by = 1703, and d=17. So, k = (1703-3) /17+1=101 terms. Thus, the sum is 
(b; +by) 5 = (3 +1703) - 86153. ue 
(c) c, = 81, cp = 8081, and d=5. So, & = (8081 - 81) /8+1=1001 terms. Thus, the sum is 


k 1001 
(c1 + cx) 5 = (81+ 8081) —— = 4085081, SY 
KI 


150.5. Ch. 50 Answers (Geometric Sequences and Series) 
A207(a) ai =7, ax = 896, and r = 2. By Corollary 20, the sum of this series is 


= a), 
Gy ~ TAK _ 7 896 _ 1785. % 
l=r 1-2 





(b) 6; = 20, by = 5/8, and r = 1/2. By Corollary 20, the sum of this series is 





anh a8 -o/ 53) = 40-2 





1.1 
rT = 8B (1-2) = ea 
ley l=, 2 729 243 
A208(a) a; =6 and r = 3/4. Thus, the sum of this series is 6/ (1 - 3/4) = 24. t 
(b) 6; = 20 and r=1/2. Thus, the sum of this series is 20/ (1 - 1/2) = 40. * 
(c) c, = 1 and r =1/3. Thus, the sum of this series is 1/ (1 - 1/3) = 3/2. 
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150.6. Ch. 51 Answers (Rules of Summation Notation) 


100 100 4 
A209(a) )°1= )°1- 5>1=100-4=96. *# 
n=5d n=l n=l 
100 100 4 
(b) )in= >)  n- Y)n=5050- 10 = 5040. a 
n=d n=1 n=l 
100 100 100 
(c) ) (nt+1)= dont ¥1=5040+ 96 = 5136. *# 
n=5d n=5 n=5 
100 100 100 
(d) >) (8n+2)=35)>n+2)°1=3-50404 2-96 = 15312. 
n=d n=d n=d 


100 100 
(e) Ving =2 > n=5040z. 
n=d n=5 











A210(a) Let Ss =1+2x2+3a7 + 4a? + 52%. 
Then, eS5 = x2 +207 +303 + 4x4 + 5°. 
1-2 
And so, S.=05,=(1<2) S521 beer’ po? 4g = 52? = 7 5a" 
~2 
tae be pba 16a? Soe" 
7 1-2 7 Ly 
i= 5 6 
Thus, Ss a os = 
(1-2) 
(b) Let S_= 1422 +307 +423 +--+ 4+ kat, 
Then, cS, = 0 +207 +322 +4et+---+ ke". 
— 
And so, Sp-aSp= (1-2) Sp = ltt a? ead taht — kat = 2 * kat 
~x 


_ bea ake the Jake le eke 


1-2 1-2 





_1-(k4+1)a* + kak 


Thus, S, 
° (l-2) 
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150.7. Ch. 52 Answers (Method of Differences) 


A211(a) Observe that 3 =4-1=2?-1, 8=9-1=3?-1, 15=4?-1, etc. 








i] 
Hence, the nth term is —— 
(n+1)°-1 
999 
And, eee sen eee : =e : 
3.8 15 24 35 999999 “4 1 (n+1) = 


Take the nth term, factorise its denominator, then do the partial fractions decomposition: 


Oe ee ee 
(n+1)?-1 (n+1-1)(n+14+1) n(n+2) 


A B _ A(n+2)+Bn_ (A+B)n+2A 
nm ont2 n(nt+2) — n(nt+2) ~ 





Comparing coefficients, A+ B=0 and 2A =1. Hence, A= 1/2 and B = -1/2 and 





999 1 
Thus, go 
mari) =i 
tow i! 1 1 
==tiot+m+—+—te + 
3 8 15 24 35 999 999 
5(; I: ot aE Od dd 1 1 1 i) 1 =): 
Se Se a ee eS Bie a Ss ee SS 
2\1 3 2 4 3 5 4 6 997 999 998 1000 * 999 1001 


Observe that all the terms with denominators 3 through 999 will be cancelled out. 














— 1 it 2 A 1 3 1 i 
Thus, sy s—= ( pe — ) =—- — = 0.749... x 
S(n+1-1 2412 1000 1001) 4 2000 2002 
More generall 3 : : : : Ss 
y; = ~ _ 7 
mi(nt1)?-1 4 2(k+1) 2(k+2) 
H Ee ee = lim : 
a 3° 815 24.35 kot (ne 1-1 





1 
3 1 1 3 
4 2(k+1) 2(k+2)) 4 
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(b) The nth term is simply: lg ue po lgn-Ig(n+1). 
nt 


= n 1 2 2 999 
ie ep ip eae i 
a Qe gt gt eat Bro 
=Ig1—-Ig2+lg2-Ig34+lg3-lg4+---+1g¢g999 — lg 1000 
=lg1-lg1000=0-3=-3. % 
n 
More generally: Pe era =lg1-lg(k+1)=-lg(k+1) 
n=1 n 
H (Sie oie nik >] "= lim (-Ig(k+1)) = a 
ann of es! “ee en ee ral 


That is, the infinite series diverges. 





1 
c) The nth term is . Rationalise the surds: 
(c) (nt+1)J/n+nV/n4+1 
1 (n+1)J/n-nJ/nt+1_ (nt+1)J/n-nvn+1 


(n+1) f/ntnJ/nt+1(n+1)J/n-nvn+1 (n+1)?n-n2(n+1) 
_— (ntlj)J/n-nvn+l (n+l) Jn-nvnt+1_ Vn _ vat 


n3 +2n?+n-(n? +n?) nn n ntl 





a 1 i i 1 
2, (n+1)J/ntnJV/nt+1 2/141V2 3/2+2V3 100\/99 + 99/100 


V1 v2, V2 v3 V99 100 
— _ + = eps esse ta = 


Thus, 


























1 2 2 3 99 100 
1 1 1 
ee ee s 
1 100 100 
k il Vk+1 il 
More generally: =] = = 12 : 5S 
. ae a Temes k . 5 
Hence lim a re lim (1-4) =1 
k>oo fai (n+1)JS/ntnJ/n+1 ko Vk+1 
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(d) First, observe that (n + 1)*-n* = 4n? + 6n? +4n +1. Hence, 


Ss [(n+1)*-n4] = S* (4n3 + 6n? + 4n +1) 


n=1 


n+ 


a 


k k 
1 
n=l i=l i=l =1 


On the other hand, we also have 


n=l 
=(k+1)°-142 k4 4 4k? + 6k? 4 4k. 


Putting = and 2 together, we have 


k 
4° n?+k(k+1)(2k+3) +k =k* + 4k? + 6k? + 4k 


n=1 


k 4 3 2 _ 3 2 4 5 0 2 2 
_ eee + 4k? + 6k? + 4k — (2k + 5k +4hk) _ kk +2k3 +k? _ ik (k +1) | 
n=l 


4 


4 
And so, we have in particular: 
100, 1002(100+ 1)” 10000(10201 
pg UN 
n=1 4 
The corresponding infinite series diverges. That is, 
i k2(k +1)? 
17427 +394+---= lim \ n= lim ueUAeon = 00 
k-oo 1 k-0o 4 
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k 
(et ont) jot a 1 eta 2 oe =o et Pel) Se 


4 


ae 


151. Part III Answers (Vectors) 


151.1. Ch. 53 Answers (Introduction to Vectors) 

















A212. Tail | Head | Length | This vector carries us ... 
AG= (4,7) = (*) =a| A G V65_ | 4unit(s) E and 7 unit(s) N 
BA(-3,-4)-(“ op B| A 5 |38 ©“ We4 © 8 
BG= (1,3) -(1} -« B G V7i0 |1 “ E*3 * N 
Gi=(1-7-(1) =a G A V65 14 “ W* 7 * S 
Gi =(-1,-2)-(“)}=« G B Vid, |1 * we3 * S 














A213. Here is one possible counterexample (yours may be different but still correct). 


Let u = (1,0) and v = (0,1). Then |u| = |(1, 0)| = V1? + 0? = 1 and |v| = |(0, 1)| = V0? + 1? = 1, 
so that |u| +{v| = 1+1=2. However, u+v = (1,1), so that jut v| =|(1,1)| = V1? +12 = V2. 
Hence, |u + v| # jul + |v}. 


(As we’ll learn later, the correct assertion is this: Ju + v| < |u| +|v|, with |u + v| = Jul + |v] if 
and only if u and v point in the same direction.) 


A214. Given the vector (4, -3): 

(a) If its tail is (0,0), then its head is (0,0) + (4, -3) = (4, -3). 
(b) If its head is (0,0), then its tail is (0,0) - (4,-3) = (-4,3). 
(c) If its tail is (5,2), then its head is (5,2) + (4,-3) = (9,-1). 
(d) If its head is (5,2), then its tail is (5,2) - (4,-3) = (1,5). 


A215. AC'+CB = AB, DC +CA= DA, BD + DA = BA, AD-CD = AD + DC = AC, 
-DC - BD = CD + DB =CB, and BD + DB = BB = (0,0) =0. 


A216(a) cv = (cv), cv2). 


(b) |ev| = |(cv,, cv2)| = (cv)? + (cv2)* = \/c2u? + v2 = \/c2 (v? +. v2) = |cly/v? + v2 = |e| |v]. 
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A217(a) AB = (1,3), AC = (4,2), BC = (3,-1). 

2AB = (2,6), 3AC = (12,6), 4BC = (12, -4). 

(b) |24B| = V2? +6? = V40 = 2V10, |AB| = V2 +3? = V10, and so |2AB = 2| AB. 
s4C| = 12? + 62 = 180 = 3/20, |AC| = V4? +2? = 20, and so sc = 3|AC|. 


4BC| = 12? + (-4)" = V160 = 410, |BC| = 3? + (-1)” = V10, and so |4BC| = 4| BCI. 


A218. The vectors b and c point in the exact opposite directions because c = —3b. 


The vectors b and d point in different directions because b + kd for any k. 
A219. By Facts 121 and 122, |c| = |c||¥| = |c]- 1 = |e. 


A220. The unit vectors of AB = (1,3), AC = (4,2), and BC = (3,-1) are, respectively, 


> (1,3) 1 ( i 32 
AB= ——_ = —(1,3) =-|——], 
V12 +32 in| ) 10 V/10 
ae (4, 2) 1 (= 7] 
AC= ———— = —=(4,2) =[—~,—], 
V4? + 22 m0 | ) V5 V5 
3 = 





—_  — > —_— > 
Of course, the above are also the unit vectors of 2AB, 3AC, and 4BC, respectively. 


A221. For v = (3,2), first write 3 = la +38 and 2220 +48. 
2 minus 2x + yields -26 = —4 or 6 = 2 and hence a =-3. Thus, 


(3). 


For w = (-1,0), first write -1 A lat 38 and 0 2 20 +46. 
2 minus 2x + yields -26 = 2 or 6 = -1 and hence a =2. Thus, 


(a)o(2) b= 
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A222. Since a + b, any vector can be written as a linear combination of a and b. 


For i= (1,0), first write a+ 76 +1 and 3a+ 58 20. 
2 minus 3x = yields -168 = -3 or 8 = 3/16 and hence a = —5/16. Thus, 


(3) =e(3) ie(3) 


For j = (1,0), first write a+ 76 +0 and 3a+ 568 21. 
2 minus 3x = yields -168 = 1 or G = -1/16 and hence a = 7/16. Thus, 


H(i) 26) 2) 


For d = (1,1), first write a+ 76 +1 and 30+ 5B 21. 
2 minus 3x = yields -168 = -2 or 8 = 1/8 and hence a = 1/8. Thus, 


1 1/1 1 eg 

d= == += ; 
A223. The position vectors of P, Q, and R are 

6a+5b 6/1 9 | 3 1 
p = DOC + — - 
59+6 11\ 2 11\ 4 11 

a+ 5b 1/7 1 o{ 2 

q = = + — 

5+1 6\ 4 6\ 3 
3a+2b 3/ -1 oa es il 
fT = = + — = — 
2+3 5\ 2 5\ -4 5 


2k: Be 11 19 3. 2 
Thus, P= (=.5), G) = (= =) and R= (= -=). 
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151.2. Ch. 54 Answers (Lines) 


A224(a) (1,3) (or any scalar multiple thereof). (b) (1,-2/7) (ditto). (c) (1,0) (ditto). 
(d) (0,1) (ditto). 


A225(a) The line described by -5a+y+1=0 contains the point (0,-1) and has direction 
vector (1,5). Thus, it can also be described by r = (0,-1) + A(1,5) (AER). A=-1, A=0, 
and A = 1 produce the points (-1,-6), (0,-1), and (1,4). 


(b) The line described by x - 2y-1=0 contains the point (1,0) and has direction vector 
(2,1). Thus, it can also be described by r = (1,0) +A (2,1) (A€ R). A= -1, A=0, and \=1 
produce the points (-1,-1), (1,0), and (3,1). 


(c) The line described by y- 4 = 0 contains the point (0,4) and has direction vector (1,0). 
Thus, it can also be described by r = (0,4) + A(1,0) (AER). A=-1, A=0, and A\=1 
produce the points (-1,4), (0,4), and (1,4). 


(d) The line described by x-4 = 0 contains the point (4,0) and has direction vector (0,1). 
Thus, it can also be described by r = (4,0) + A(0,1) (AER). A= -1, A=0, and A\=1 
produce the points (4,-1), (4,0), and (4,1). 


A226. If v =0, then the “line” would not be a line, but the single point P: 
{R:r=p+ Av (Ac R)}={R:r=p}={P}. 


And so, we impose the restriction that v # 0 to rule out the above trivial (or degenerate) 
case. 


A227(a) Write out x 2-1+) and y23-2h. 

Then 2 plus 2x 2 yields: y+ 2x2 =1 or y=-2r+1. 

(b) Write out x 25+7A and y 26482. 

Then 7x 2 minus 8x = yields: 7y - 8x =2 or y= =a + = 

(c) Write out x = 3d and y 2-3. 

This is a horizontal line. We can discard + and be left with the single equation y 23. 
(d) Write out x 21 and y21+2A. 


This is a vertical line. We can discard 2 and be left with the single equation x £1, 








| 7 
A228(a) — Jy D rape ped: 








=) 
z-5 y-6 8 2 

b = as nee 

y= 2G a 

(c) y=-3. 

(d) 21. 
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151.3. Ch. 55 Answers (The Scalar Product) 


A229. wae (T)-( 5] = 1047 =29, % 


w-x=[ —~ |-[ 8 | =-3240= 39, a 
0 7 


Since the scalar product is commutative, w-v =—-8, x-v = 23, and x- w= -32. 
Since the scalar product is distributive, w:-(x+v) =w-x+Ww-v=-32-8 =—-40, 
Also, (2v)-x =2(v-x) = 2-23 = 46 and w- (2x) = (2x)-w=2(x-w) = 2(-32) = -64. 


A231. lal = |(-2,3)| = \/ (-2)7 + 32 = /(-2,3)- (2,3) = Vaca. 


Ib] = \(7,)| = V7?24+12 = VJ(7,1)-(7,1) =Vb-b. 
Ic] = |(5,-4)| = 52+ (-4)? = V(5,-4) - (5,-4) = Vb-b. 
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151.4. Ch. 56 Answers (The Angle Between Two Vectors) 


A232(a) The third side of the triangle corresponds to the vector u - v. 
(b) |ul, |v], and [uv 

2 2 2 ¥ u=¥ 
(c) ju-v| =|ul” + fv] - 2 |u| {v|cos@. 


(d) We'll actually use distributivity twice: a 


(u-—v)-(u-v) =(u-v)-u+(u-v)-(-v) 
=U-U-U-V—-U-'VtFtV-V=U-Ut+V-V—-2U:V. 


(c) lu—vi?=|ul2+\v2—2Iul|v|cos@ (By (c)) 
=> (u-v):-(u-v) =u-ut+v-v-—2|ul|v|cosé (By Fact 131) 
<— u-ut+v-v-2u:v=u-utv-v—2|ul|v\cosé (By (d)) 
— —2u-v = -2|ul|v| cosé 
— u-v = |u||v|cosé 
= §= cost —— 
ju [v| 


A233(a) The angle between u = (2,0) and v = (0, 17) is 


_, (2,0) - (0,17) 7 2-0+0-17 _ 1 
coc CO get egies ™ % 
I(2,0)||(0, 17)| S242 V0? +17? 2 
7 is right. u and v are perpendicular and point in different directions. 
(b) The angle between u = (5,0) and v = (-3,0) is 
(5,0) - (-3, 0) _ 83) e020) £3 


@=cos | ~~ 2 = Os => = cos" (-1) =7. t 
(5, )11(-3,0)] \/52+02,/(-3) +02 


7 is straight. u and v are parallel and point in exact opposite directions. 


(c) The angle between u = (1,0) and v = (1, v3) is 


6 = cos” 


1 (1,0) - (1, V3) a4 1-1+0- Rey : 1 1 
———_-__—_ = cos os =cos —=—. 
(2,0)|](, V3)| 12402) 12 4 JE On f+ (V3) 2 3 


7 is acute. u and v are neither parallel nor perpendicular and point in different directions. 
(d) The angle between u = (2,-3) and v = (1,2) is 


2 12 2-1 2 —4 
6 =cos"! (2,-8) «(1 ye os! + (=8) = cos)! ———— »w 2.0899. ae 


(2,-3.2) \/22 + (3° V2 V13-V5 


7 is obtuse. u and v are neither parallel nor perpendicular and point in different directions. 
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A234. 


A235. 


ju tv] =(ut+v)-(u+v) 


=uUu:'uTtTU'VtV'UTV:V 


=u-u+2u:v+v:-v 
= |ul’ +2u-v+|v/ 


= ful’ +0+Iv/ 


= ful’ + [vf 


< |u|’ + 2ful|v| + vp 


= (Jul + |vl)*. 


Fact 131) 


Distributivity) 


Fact 131 again) 


u-v =0 because u | v) 


jutv|> = ul’ +2u-v + |v} 


( 
( 
(Commutativity) 
( 
( 


(Cauchy’s Inequality) 


(Complete the square) 


We’ve just shown that ju + vl < (jul + Iv|)°. And so, taking square roots, we also have 


ju + v| < Jul + |v}. 


A236. 


(b) (4,2) 


(c) (-1,2) 











1 1 4 4 2 —] —] 
v-direction cosine | ———— = —— | ————— = —— = — | —_____ = __ 
V1? +3? 10 | V42+22 V20 5 (eee. 9) 
Ae eet : a 5) 2 2 1 2 2 
y-direction cosine | ——— = —— | ——— = —— = — | ——__— = — 
V1? + 3? 10 | V42 4 22 20 V5 [(-1y 4g Vd 
2 


Unit vector 
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151.5. Ch. 57 Answers (The Angle Between Two Lines) 

















wt =1,.1) (2,8) 1 |-5 1 9 
A237(a 1 (-1,1)-@,-8)| = zt = i ~ 0.197 % 
“ es ° ae aes 

_, |C1,5)- (8, 1)| 7 [13 24 
b cos! (1, : = cos) | ——~ = cos"! wx 1.249 % 
() 5G, DI °° Vases °° Vig 

_,12,6)-(3,2)] [18 , 9 
Cc eg E = cogs! ——— = cos! — x 0.661 % 
©) (2,618.2) °° vans °° Vaso 


151.6. Ch. 58 Answers (Vectors vs Scalars) 


(This chapter had no exercises.) 


151.7. Ch. 59 Answers (Projection Vectors) 


A238(a) Since (33,33) || (1,1), the projection of (1,0) on (33, 33) is equal to the projection 
of (1,0) on (1,1). Hence, 








(,0)eCisl)|_ 11407 2. 
@y il «= oo = 


(b) The length of the projection of (33,33) on (1,0) is 


[Proj 33,33) (i, 0)| = [Proja.1) (1,0)| = | 


lprojcs.o) (33, 33)| = |(33, 33) - 71, 0)| 33. % 





ae — 334+0_ 
f (G0, foot 


(c) We just showed that [Proj (33 33) (1,0)| # [Proj (1.9) (33,33)|, 


Therefore, the given statement is false. In general, the projection of a on b is not the same 
as the projection of b on a. 
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151.8. Ch. 60 Answers (Collinearity) 
A239(a) The unique line that contains both A = (3,1) and B = (1,6) is 
r= OA+XAB =(3,1)+X(-2,5) (AER). 


If this line also contains C’, then there exists \ such that 


ae 
o-({%)-(3)sa2) or 088-28 
-] 1 a -1=1+5,. 

From a we have \ = 1.5. But this contradicts 2. So, there is no solution to the above 


vector equation (or system of two equations), meaning our line does not contain C’. Thus, 
A, B, and C are not collinear. 


(b) The unique line that contains both A = (1,2) and B = (0,0) is 
r= OA+)\AB =(1,2)+X(-1,-2) (AER). 


If this line also contains C’, then there exists \ such that 


Observe that \ = —2 solves the above vector equation (or system of two equations). Hence, 
our line does indeed contain the point C’' (it corresponds to \ = —2). Thus, A, B, and C are 
collinear. 
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151.9. Ch. 61 Answers (The Vector Product) 


A240(a) a x (b +c) =Q] (by + co) — ag (b; +C1) = 01 b9 — and; + A14C29 — A2C{ =axbt+axc. 


(b) axb= ayb = ab, = (azby = ab2) = (by a2 = boa) 2=bp* a: 


(c) axa = aya - aga, = 0. 


A241. ax b= (1,-2) x (3,0) =1-0-(-2)-3=0+6=6. 
axc=(1,-2)x (4,1) =1-1-(-2)-4=1+8=9. 

bx c= (3,0) x (4,1) =3-1-0-4=3-0=3. 

By anti-commutativity, bx a=-axb=-6,cxa=-axc=-9,andcxb=bxc=-3. 
By distributivity, ax (b+c)=axbt+axc=6+9=15. 


A242(a) |al = \/a? + a3, |b] = \/b? + 03, Ja x b| = |ayb2 - agbi|, and 
_ a-b ab) + agb. 
lal|b| Ja? + a2, /8? + 62 
(b) If 0€ [0,7], then sin 6 > 0. 
(c) The identity is sin? 6+ cos? @ = 1 (Fact 71). Rearranging, 


sin 6 = +V1—cos?96. 
But by (b), sin@ > 0—so, we can discard the negative value. Hence, 


sin @ = V1—- cos? @. 





2 
ay 2 _ (abi + abo) 
(d) sin@ = V1-cos 1 CEE NCET A 


(e) (aj + a5) (by + b5) - (aby + ayb2)° 
= ajb{ + atb3 + aby + a3b3 — (afby + a3b5 + 2ayazb1b2) 


= = aib? a arb: = 2a1a2b1b»2 = (a,b2 = ayb,)” : 


2 2 (a,b; + agbo)° 
(f) |al |b] sin @ = \/a7 + a3\/b7 + b2 “(area 
(a2 + a2) (b? + 63) — (ayb, + aab2)° 


Oe Gheaty aeceneee: 





x 
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151.10. Ch. 62 Answers (The Foot of the Perpendicular) 


AGA. PA=A=P=(41,0)=0,5)=C3, 3); Pe = B= PSG) HS): 


(-3,3) - (5, 1) _715+3 5 6/5 
projyPA = projis 1) (-3, 3) = an a 3G J) Bla 


By Fact 150 then, the feet of the perpendiculars from A and B to the line are 


P +proj,PA = 2 ae : ae 7 oe ° ’ 
= 13\ 1 13 \ -9 13\ 1 
=> 1 1 
P+proj,PB = @ + ah eon ee oul 
26 13\ 1 ee Gee 13 \ -9 
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A244(a) Let P = (8,3) and v= (9,3). 
Method 1 (Formula Method). First compute PA-= (7,3) - (8,3) = (-1,0) and 


: , (-1,0)-(9,3) [ 9 -9+0/ 9 1/9 
proj,PA = projg 3) (-1, 0) = 92 4 32 SW eorim eect: x» 








So, B = P+ proj, PA = (8,3) —0.1(9,3) = (7.1,2.7). 
And the distance between A and I is 








— —_ —-1)-3-0O0- 1 /1 
[AB = |PAx@ -|(-1,0) x22, (|e acl >| = joe 
J92 + 32 /90 Js Vi0 1 


Method 2 (Perpendicular Method). Let B = (8,3) + (9,3). Then, 
AB = B- A= (8,3) +\(9,3) - (7,3) = (9 + 1,3). 
Since B is the foot of the perpendicular, we have AB 1 lor AB Lv or AB: (9,3) =0; 
0 = AB- (9,3) = (9\+1,3A) -(9,3) = 9(9\+ 1) +3(3A) = 90149. 
Rearranging, \ = —9/90 = -1/10 = -0.1 and so, 
B = (8,3) + X(9,3) = (8,3) — 0.1 (9,3) = (7.1, 2.7). 
And the distance between A and / is 


AB = |B — Al =|(7.1,2.7) - (7,3)| = |(0.1,-0.3)| = 0.1|(1,-3)| = 0.1V10. 


Method 3 (Calculus Method). Let R be a generic point on J. Then, 


AR=(9\+1,34) and —_—-|AR| = (94+ 1)? + (3A) = V90N? + 18d +2. 





d 
Differentiate: 7 (90 + 18+ 2) = 180A + 18. 
~ 18 i! 
FOC: 180A +18)}, - =0 \=-— = -—. 
_ TONE TNs ad is0 10 
~ 18 18 1 
Alternatively, we could simply have used “—b/2a”: A=- ee =- 


And now, we can find B and |AB| as we did in Method 2. 
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A244(b) Let P = (4,4) and v = (6,11) - (4,4) = (2,7). 
Method 1 (Formula Method). First compute PA-= (8,0) - (4,4) = (4,-4) and 


ee (4,-4)-(2,7)(2\_ 8-28/2\ 2/2 
projyPA=projan(—4)= “aa \ 7 i= sa \ 7] oaal7f * 





—> 2 1 
So, B= P+ py, PA-00= = (2,7) = = (172, 72) 


And the distance between A and | is 








—> (2,7) eee 36 
AB| = = |(4,-4) x 24] = |——__+_ =| = —_.. x 
| | ( )* 92 + 72 53 J53 


Method 2 (Perpendicular Method). Let B = (8,3) + (9,3). Then, 
AB = B- A= (4,4) +\(2,7) - (8,0) = (24-4, 7 +4). 
Since B is the foot of the perpendicular, we have AB 1 lor AB 1 vor AB: (2,7) 20: 
0 = AB- (2,7) = (2\-4,74+4)- (2,7) = 2(2\-4) +7 (7144) = 53A +20. 
Rearranging, \ = —20/53 and so, 
B=(4,4)+(2,7) = (4,4) - = (2, 7) = = (172, 72). 


And the distance between A and I is 


36/53 36 
53 (4/53 





[AB] = IB- A= = (1 7\= (8,0)| - 5 ( 252 72)|- = (-7,2)|= 


Method 3 (or the Calculus Method). Let R be a generic point on J. Then, 


AR=(2\-4,7\+4) and [AR| = (24-4) + (7A +4)” = V5? +400 +32. 


d 
Differentiate: aa (53) + 40 + 32) = 106A + 40. 
~ AO 20 
FOC: 106A + 40)|, . =0 A =-— =-—. 
ee = 10653 
; ; x AO 20 
Alternatively, we could simply have used “—b/2a”: A= a a 


And now, we can find B and |AB| as we did in Method 2. 
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A244(c) Let P = (8,4) and v = (5,6). 
Method 1 (Formula Method). First compute PA= (8,5) - (8,4) = (0,1) and 


=> (0,1)- (5,6) {5 0+6/5 6 [5 
projyPA = proj(s.¢ (0,1) = a ; er ; Fi 6 | ae 





ay 1 
So, B= Ps proj, PA=@:4)4 = (5,6) = — (518, 280). 


And the distance between A and | is 





-|(0.9 «9, ix 





x 


Method 2 (Perpendicular Method). Let B = (8,4) + (5,6). Then, 
AB = B- A= (8,4) +.\(5,6) - (8,5) = (5A,6\- 1). 
Since B is the foot of the perpendicular, we have AB 1 lor AB Lv or AB: (5,6) =0; 
0 = AB- (5,6) = (5\,6A-1)- (5,6) = 5 (5A) +6 (6\- 1) = 61-6. 
Rearranging, \ = 6/61 and so, 
B = (8,4) +A(5,6) = (8,4) + = (5,6) = = (518, 280). 


And the distance between A and I is 


61 
[AB] = |B - Al = = (518, 280) - (8,5)|- mK (30, -25)| = (6 -5)| = = 


Method 3 (or the Calculus Method). Let R be a generic point on J. Then, 


AR=(5\,6\-1) and _—_—-|AR| = (5A) + (6A- 1)? = VOL? + 12d + 1. 


d 
Differentiate: (61\? + 12A +1) = 122\ +12. 
~ 12 6 
FOC: 122 4+12.)|, .= A =-—— =-=—. 
OC (122 + 12.) Lae 0 or ae TI 
; ; x 12 6 
Alternatively, we could simply have used “—b/2a”: A= Ns aay 


And now, we can find B and |AB| as we did in Method 2. 
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151.11. Ch. 63 Answers (Three-Dimensional Space) 


(This chapter had no exercises.) 


151.12. Ch. 64 Answers (Vectors in 3D) 


A245(a) AB = (0-2,1-5,1-8) = (-2,-4,-7). 
(b) OA = (2,5,8) and OB = (0, 1,1). (c) AA = 0= (0,0,0). 


A246. AB = (-2,-4,-7). So, |AB| = |(-2,-4,-7)] = y (-2)” + (-4)” + (-7) = V69. 
A247(a) A+ B is undefined. 

(b) A-B is the vector BA = (1-(-1),2-0,3-7) = (2,2,-4). 

(c) B+C is undefined. Hence, A+ (B+ C) is also undefined. 

(d) B-C is the vector CB = (-1-5,0-(-2) .7~3) = (-6,2,4). And so, A+(B-C) isa 
well-defined vector, namely A+ (B-C) =A+CB = (1,2,3) + (-6,2,4) = (-5,4,7). 
A248(a) u+ v= (1,2,3) + (-1,0,7) = (0,2, 10). 

(b) u—v =(1,2,3) - (-1,0,7) = (2,2, -4). 

(c) u+ (v+w) =u+v+w = (0,2, 10) + (5, -2,3) = (5,0, 13). 

(d) u+(v-w) =u+v-w = (0,2, 10) — (5, -2,3) = (-5,4,7). 

A249. AB = B- A= (3,6,-5) - (5,-1,0) = (-2,7,-5). 

AC =@=A=12,2,3)=(5,41,0) =(4;3;9). 

BO =O 002 9-6 eC 

AB = AC S( 957.5) 3-95 9) 04 2) = =8 C= 0e 

AB + BC = (-2,7,-5) + (-1,-4,8) = (-3,3,3) = AC. / 


A250(a) v and w point in the exact opposite directions because v = -1.5w. They are thus 
also parallel—v || w. 


(b) v and x point in different directions because x # kv for all k ¢ R.They are thus also 
non-parallel—v 4 x. 


(c) w and x point in different directions because x # kw for all k ¢ R.They are thus also 
non-parallel—w + x. 


(d) By our definitions (which covered only non-zero vectors), the zero vector 0 does not 
point in the same, exact opposite, or different direction as any other vector (including, in 
particular, u). 


Also, it is neither parallel nor non-parallel to any other vector (including, in particular, u). 
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ie 
A251(a) [al = |(1,2,3)| = V1? +2? +3? = V14. And 4 = re 





a 
(b) |b| = |(4,5,6)| = V42 + 52 +62 = V77. And b= ( a 


(c) |a- b] = |(-3, -3, -3)| = / (-3)? + (-3)? + (-3)? = V27 = 3V3. And, 


ee ($3, -3)> Ld d 
aa 3, —3, 3)_ (LY 











3/3 J3 
Le 12,3) 
d) |2a| = 2 i tains : 
(d) [2a| = 2V a 
we x (4,56) 
e) [3b] =3 77. And 3b=b = {2 : 
(e) [3b] = 3V Jt 


HitG-bieie mise 5-52 
J3 
A252. v =(9,0,-1) = 9i-k and w = (-7,3,5) = -7i+ 3j + 5k. 


A253. By the Ratio Theorem, the point P that divides the line segment AB in the ratio 
2:3 has position vector: 


11 
pat+Ab 3(1,2,3)+2(4,5,6) 1 
SS Oe 16 
A+ LL 24+3 i) 
21 


i 
Hence, the point is P = (11,16,21), 


A254(a) Informally, a vector is an “arrow” with two properties: direction and length. 


(b) A point and a vector are entirely different objects and should not be confused. Nonethe- 
less, each can be described by an ordered triple of real numbers. 


(c) Let A = (a),a2,a3) be a point and a = (a1, da2,a3) be a vector. We say that a is A’s 
position vector. 


(d) The vector a = (a1, a2,a@3) carries us from the origin to the point A = (a1, a2, a3). 
ns “1 
A255. OA=A-O = (a1, 42,03) =a=] ay |=a1i+ajt+ask= 7d. 
a3 
A256. A+B is undefined. 
A+ OB is the point (a, +b), a2 + b2, a3 + bs). 
OA + OB is the vector (a1 + bi, a2 + b2, a3 + bs). 
OA-OB is the vector BA = (a1 — by, a2 — be, a3 — 3). 
OA- BA is the vector OB = (by, bo, 3). 
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151.13. Ch. 65 Answers (The Scalar Product in 3D) 


A257. (1,2,3)-(4,5,6) =4+ 10+ 18 = 32. % 
(DAB) O20 3) = 62 a8. % 
A258(a) The angle between the vectors (1,2,3) and (4,5,6) is 

if (1,2,3)"(45,6) 


a2 
a4 
cos» —————_—_ = cos > ——— = 8 0.226. % 
|(1, 2,3)||(4, 5, 6)| J14/77 


Thus, the vectors (1, 2,3) and (4,5,6) are neither parallel nor perpendicular. Instead, they 
point in different directions. 


(b) The angle between the vectors (—2,4,-6) and (1, -2,3) is 


—2.4 -6)-(1,-2,3 —28 —28 —28 
cos’! (a2 heb) Ae) = ¢os >= = cos + — = cos | = 7. 5S 


I(-2, 4, 6)||(1, -2, 3)| V56V14 28 28 


Thus, the vectors (-2,4,-6) and (1,-2,3) are parallel (and point in exact opposite direc- 
tions). Actually, we could also have arrived at this conclusion by observing that u = -2v. 


A259(a) The vector (1,3,-2) has length v/ 12 + 32 + (-2)” = V14 
Hence, its unit vector is (1,3, -2) /V14. 
Its x-, y-, and z-direction cosines are 1/V14, 3/V14, and -2/V14. 


And the angles it makes with the positive x-, y-, and z-axes are 











1 3 —2 
=] -1 =A 
cos # 1.300, cos x 0.641, and cos x 2.135. 
V14 V14 V14 
(b) The vector (4,2,-3) has length \/ 42 +22 + (-3)? = 29 
Hence, its unit vector is (4, 2,-3) /V29. 
Its x-, y-, and z-direction cosines are 4/V29, 2/29, and -3/V29. 


And the angles it makes with the positive x-, y-, and z-axes are 


4 3 
cos} x 0.734, cos} ~ 1.190, and cos? x 2.162. 
V 29 V 29 


(c) The vector (-1,2,-4) has length \/ (-1)" +22 + (-4)* = V21. 
Hence, its unit vector is (-1,2,-4) /V 21. 
Its x-, y-, and z-direction cosines are -1/V21, 2/V21, and -4/V21. 


And the angles it makes with the positive x-, y-, and z-axes are 











—] A —4 
ai =i aff 
cos x 1.791, cos # 1.119, and cos 
J 21 V21 V21 











x 2.362. 
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151.14. Ch. 66 Answers (The Proj. and Rej. Vectors in 3D) 





. x - (25,21) Ci) 2+5-4 1 
A260. Plojpa = (a:b) b= 43, pep (11,4) = 18 (1,14) = g (11,4). aS 


1 1 
rejpa = a— projpa = (2,5,-1) - 5 Cl LA) = 5 (11, 29, -10). 


=2 
2.0.=1)2( 2.2, 1 
(-2)" + (-2)" +1? 
_-4-10-1 7 15 i) 3 
oe os =I a) = — & 
5 9 9 3 
rej,a =a — proj,a = (2,5,-1) - > (2,2,-N) 4 (-4,5,2). 1 1 
(rej,a)- b =~ (11,29, -10) -(1,1,4) = ¢ (11 +29 -40) =0. M4 
1 1 
(rejea) -¢ = = (-4,5,2)-(-2,-2,1) = 5 (8- 10+2)=0. ¥ 
4 4A 4 
oe ~\e _ (1,2,3)-(4,5,6) _4+10+18 _ 32 
EN eee el ae la 


_ 32 32 
roj,a| = —V77 = ——. 


3 
rejpa = a — projpa = (1, 2,3) - = (4, 5,6) = = (- 51,-6,39) =- ma (172,=13), 
rej,al = es [177 + 2? + (-13)"] = > /462 = 3, 77 = 3) Ae 
se 77 77 ia 
32 
proj,a = 77 b and so projya || b. Also, proj,a points in the same direction as b. 


us — (17,2,-13)- (4,5,6) = -= [17 4+2-5+(-13)-6]=0 and so rejpa 1 b. 


ia)» be= 
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151.15. Ch. 67 Answers (Lines in 3D) 


il =a ae 
A262(a) — a ae 7 This line is not perpendicular to any of the axes. 
T=—o Y=0. 2-1 
7 8 1 
e 2-1 ae eee ; ; 
(c) o—-4 and y = -3. This line is perpendicular to the y-axis. 











. This line is not perpendicular to any of the axes. 





(d) y=z=9. This line is perpendicular to the y- and z-axes. 


(e) 7 = 5 = > This line is not perpendicular to any of the axes. 


(f) c =1 and z=5. This line is perpendicular to the x- and z-axes. 


x-2/7 y-50/3 z 

5/7 ~~ 70/3—~S—«7/8 

So, this line may be described by r = (2/7, 59/3, 0) + A (4/7, 7/3, 7/8) (A€R). 
It is not perpendicular to any of the axes. 

(b) Rewrite the given equations: 1D = Te = Te 
So, this line may be described by r = (0,0,0) + A (7/2, 1/3, 1/5) (A€R). 

It is not perpendicular to any of the axes. 

e-4/17 y-1/3 z 

Vig =. ys 

So, this line may be described by r = (4/17, 1/3,0) + A (1/17, 2/3, 1/3) (A€R). 


It is not perpendicular to any of the axes. 








A263(a) Rewrite the given equations: 








(c) Rewrite the given equations: 








11 -—3 — 2/5 
(d) Rewrite the given equations: y = - and — an = x 


So, this line may be described by r = (3, 11/3, 2/5) + A (2,0, 7/5) (A€R). 
It is perpendicular to the y-axis. 


(e) The free variable is y. So, this line may be described by 
r = (65,0,1/2)+A(0,1,0) (eR). 


It is perpendicular to the x- and z-axes. 


5 =(=2 
(f) Rewrite the given equations: x = 5 and 2 ( ) 7 


So, this line may be described by r = (-=. -2,0] +A(0,5,1) (A¢R). 





It is perpendicular to the x-axis. 
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A264(a) If the two lines intersect, then there are real numbers \ and fi numbers such that 


A241, 1-423, and 14+A423+2f. 


= immediately contradicts 2. Hence, the two lines do not intersect and are not identical. 


The angle between them is ae 


1,-1,1)-(0,0,2 0+0+2 
eas ’ ’ ) ( gee I. os! 0+ 042 TUT | = cos! 


2 
(4, -1, 1)]|(0,0,2)| 4 /124(- i 412 02 + 02 + 22 J3-2 


The two lines are neither parallel nor perpendicular. And since they do not intersect 
either, they are skew. 





(b) If the two lines intersect, then there are real numbers \ and fi numbers such that 
-128f, 2+\2-3f, and 3252. 


z immediately contradicts 3. Hence, the two lines do not intersect and are not identical. 


The angle between them is % 


—1,0)-(8,-3,5 0+3+0 3 
re tae ede i eae = cos” "ca 


|(0, -1, 0)||(8, -3, 5)| V0? + 1? + 02\/8? + (-3)° +5? 


The two lines are neither parallel nor perpendicular. And since they do not intersect 
either, they are skew. 


(c) If the two lines intersect, then there are real numbers \ and ji numbers such that 
7+8\294+3f8, 34+3\23-4f, and 4+4\27-36. 


+ plus 2 yields 11+12\ = 16 or \= 5/12. And now from 3 we have ji = 4/9. But these values 
of \ and fi contradict 2 Hence, the two lines do not intersect and are not identical. 


The angle between them is % 
11(8,3,4)-(3,-4,-3)]_ (24-12-19 ip 
cos © ———_———_ = cos © ——____————_ = cos 0 = =. 
(8, 3, 4)| |(3, -4, -3)| (8, 3, 4)||(3, -4, -3)| 2 


The two lines are perpendicular (and hence not parallel). And since they do not intersect 
either, they are skew. 


(d) The two lines have parallel direction vectors because (-3, -6,-3) = -3(1,2,1). Hence, 
the two lines are parallel (and thus neither perpendicular nor skew). Since they are parallel, 
the angle between them is zero. 


The first line does not contain the point (1,0,0)—the only point on the first line with 
x-coordinate 1 is (1,2,3) (plug in A = 1). Hence, the two lines are not identical. Since the 
two lines are parallel and distinct, they do not intersect at all. 
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A265(a) The unique line that contains both A and B is 
—>-. — > 3 
r=OA+AAB=] 1 |+Al 5 (A€R). 


If the above line also contains C’, then there exists \ such that 


0=3-2i, 

0 3 2 
C=| -1 l=] 1 |+Al 5 I. or 1 248A. 
0 y) 3 : 
(2948). 


From Z we have \ = 1.5. But this contradicts 2. This contradiction means that there is no 
solution to the above vector equation (or system of three equations). 


Thus, our line does not contain C’. We conclude that A, B, and C are not collinear. 


(b) The unique line that contains both A and B is 


1 =| 
r=] 2 [+A] -2 (A €R). 
4 my 


If the above line also contains C’, then there exists \ such that 


oe 8 
3 1 = 

C=! 6 =| 2 1+Al 2 Lb or 622-21, 
10 4 = 

10 24-3) 


As you can verify, \ = —2 solves the above vector equation (or system of three equations). 


Thus, our line contains C’. We conclude that A, B, and C are collinear. 
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151.16. Ch. 68 Answers (The Vector Product in 3D) 


A266(a) Given u = (0,1,2) and v = (3,4,5), we have ae 
0 3 1x5-2x4 —3 
uxv=| 11x! 4 f=] 2x3-0x5 |=] 6 
2 OD 0x4-1x3 —3 


We next verify that ux v 1 u,v: 


(ux v)-u = (-3,6,-3)-(0,1,2)= 0+6-6 =0, Vv 
(uxv)-v = (-3,6,-3)-(3,4,5) = -9+24-15=0. Vv 


(b) Given w = (-1,-2,-3) and x = (1,0,5), we have ae 
-1 1 (-2) x 5-(-3) x0 —10 
wxx=] -2 |x] 0 |=] (-3)x1-(-1)x5 |=] 2 
—3 5 (-1) x 0-(-2)x1 2 


We next verify that w x x L w,x: 


(w x x)-w = (-10,2,2)-(-1,-2,-3) = 10-4-6 =0, Vv 


(wxx)-x = (-10,2,2)-(1,0,5) =-10+0+10=0. v 
ab3 — azbo ay 
(c) (axb)-a=] agbi-aibs |-| ae 
aybe — ab, a3 


= (agb3 = a3b2) ay + (a3by = a1b3) ag + (a,b2 = ab, ) ag 
= a a9b3 = ay boaz + byagaz = a) a9b3 = a, b2a3 = bya2a3 =(, 


agb3 = a3b5 by 
(axb)- b=] azbj-ayb3 |-] be 
a bo = ab, b3 


= (ayb3 — a3b2) by + (a3b1 — a1b3) by + (ab2 — agb) bg 
= by a2b3 = bi b.a3 a bi b2a3 = a, bob; F a, bob; = bab; =, 
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ay by + Cy ag (b3 + ¢3) — a3 (b2 + C9) 


A267(a) = ax (b+e) =] ay [x] bo +co [=] a3 (bi +c1) — a4 (b3 +03) 


a3 b3 + C3 ay (by + C2) — dz (by + €1) 


aob3 = a3b»5 + G2C3 — A3C9 


a3b1 a a, b3 + A3C1 — a,C3 
aybo = ab; + Q1C2 — A2C{ 


azb3 — agbo a9C3 — A3C2 
=| asb, -—a,b3 | +] agcy-—aicg J=axbtraxc. 
aby — agby a14C2 — A2Cy 


(b) In Example 907, we already showed that (1,2,3) x (4,5,6) = (-3, 6, -3). 


4 1 3°5-2°6 3 
We now have 5 |x =| 1.6-3-4 |=] -6 
6 a 2-4-1-5 3 
4 1 1 4 
So that indeed: 5 |x =-| 21x] 5 
6 3 a 6 


To prove that a x b =—-bxa, simply write them out: 


agb3 = a3b5 boas = ba 
axb= a3b1 — ayb3 and bxa= b3a, — by a3 
a 1b - agb, by d2 — bga4 


(c) (a1, a2, a3) x (a1, a2, a3) = (a2a3 — agag, a3a1 — 4143, 2142 — dea) = (0,0,0) = 0. 
(d) Let a = (a1, @2,a3) and b = (by, bg, 63). Then da = (day, day, daz) and 


dagb3 — da3bo a9b3 — a3b5 

(da) x b=] dazb,-dayb3 |=4| a3b, -a1b3 | =d(axb), 
dayby — dab; a b5 — aoby 
aodbs = ag3dby aob3 = a3bo 

ax (db) =| agzdb; —aidbs; |= 4] ab) -aib3 | =d(axb). 
a dbo = apdb; ab = ab, 
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A268(a) jal = \/a? + a2 + a, |b] = \/b7 + b5 + b3, 


lax bl 





(agb3 = agby)” + (a3b, = oe + (a by = ayb,)°, 


a-b a1b4 + agby =F a3b3 


lal {b| Jae + a2 + a2, /0? + 02 + Be 


cos @ = 


(b) Since 6 € [0,7], it must be that sin@ is non-negative, i.e. sin > 0. 


(c) The identity is sin?@+cos?@ = 1. Rearranging, we have sin@ = +V1-cos?6. Since 
sin @ > 0, we can discard the negative value. Altogether, 


sin = V1—-cos? @. 


2 
(4) sind= VT reo =a]1-( 2B) _,| 1 (arbi tarda + abs)” _ 
|a| || (a? + a3 + a2) (b7 + b3 + B32) 


(e) As per the hint, fully expand each of LHS and RHS: 


LHS = (a2 + a2 +02) (b? +62 + b2) — (ayb) + aabe + agb3)” 
3 3 


= ath? + a2b3 + a2b2 + a3b? + a2b3 + a2b3 + abt + a3bs + a3be 


— (abi + a3b5 + a33 + 2ay.a2b1by + 2aja3b1b3 + 2aza3b2b3) 
= athe + azb3 + aah? + a5b3 + a3b? + a2b5 -— (2ayagb,b + 2a ,a3b1b3 + 2aza3b2b3). 
RHS = (a9b3 — agb2)” + (a3b, — a1b3)” + (ayb2 — ab)” 
= Sane + azbs — 2a9a3b9b3 + abs + aD; — 2a)a3b1b3 + azb? + azbe - 241491 bo. 


“Clearly”, LHS = RHS. 


_ (a,b; + abe + a3b3)” 
(a? + a3 + a2) (b7 + b3 +b) 





(f) |a| |b| sin 0 = \/(a? + a3 + a3) (b7 + b3 + 63) 


(a2 + a2 + a2) (b2 + 62 + b2) — (ab, + agby + 4363)” 





(agb3 = agb)° +e (a3b, = ayb3)” =F (a,bo = ayb,)” = la x bl. 
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151.17. Ch. 69 Answers: The Distance Between a Point and a Line 


A269(a) Let P = (8,3,4) and v = (9,3,7). Let B be the foot of the perpendicular from A 
tol. Method 1 (Formula Method). First, PA = (7,3, 4) - (8,3,4) = (-1,0,0). So, 








——> — 9 
PB = proj, PA = proj —-1,0,0) = -—— (9,3,7). % 
By Fact 150: B =P + proj,PA = (8,3,4) - = (9,3, 7) = — = (1 031, 390, 493). 
And by Corollary 28, the distance between A and I is 
[AB| - = (-1,0,0) x eS |= % 
J92 4324+ 72| 139 





Method 2 (Perpendicular Method). Let B = (8,3,4) + (9,3,7). Write down AB: 
AB = B- A= (8,3,4) +\(9,3,7) - (7,3,4) = (QA +1,3A,7). 
Since AB 1 1, we have AB iv or, 
0 = AB- (9,3,7) = (9A +1,3A, 7A) - (9,3, 7) = 9 (9A +1) +3(3A) +7(7A) = 139149. 
Rearranging, \ = —9/139. 


And now, B = (8,3,4) - a 


O35 0 39 (1.081, 390, 493 
a5 (93,7) = 35 ( ). 


Lovely—this is the same as what we found in Method 1. And now, the distance between 


A and I is 
[AB] = y/ (9\+1)° + (3A) + (7A) = V139)2 + 181 +1 = 58/139. 


Method 3 (or the Calculus Method). Let R be a generic point on 1, so that AR = 
(9\ + 1,3A,7A +4) and the distance between A and R is 


|AR| = 1392 + 18A +1. 


+ (139A? + 18 + 1) = 278 + 18. 


~ 18 9 
F ° 2 1 —— = -—-— = —-—, 

OC (278A +18)|,;=0 or A=-3e=-7e 
Lovely—this is also what we found in Method 2. We can now find B and AB (omitted). 
Alternatively, we could simply have used “—b/2a”: \ = -18/ (2-139) = -9/139. 
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A269(b) Let P = (4,4,3) and v = (6,11, 5) - (4,4,3) = (2,7,2). Let B be the foot of the 
perpendicular from A to lI. 


Method 1 (Formula Method). First, PA= (8,0,2) - (4,4,3) = (4,-4,-1). So, 


= — Ze 
PB= projyPA = PLOj(2,7,2) (4, 4, =) 7 3, (2, t, 2) : at 


= 29 1 
By Fact 150: B= P + projyPA = (4,4,3) — = (2,7,2) = = (184, 74, 127). 


And by Corollary 28, the distance between A and I is 








a a 79 |.=10,36 1397 
aa = [PA xe] = [4,2 x 2E2)_| _ |t-10.98))_ 1g 
22 4.72 +22 J57 57 


Method 2 (Perpendicular Method). Let B = (4,4,3) +(2,7,2). Write down AB: 
AB = B- A= (4,4,3) + \(2,7,2) - (8,0,2) = (2\-4,71+4,2 +41). 
Since AB 1 1, we have AB Lv or, 
0 = AB-(2,7,2) = (2\- 4,7) +4,2\ +1) - (2,7,2) = 570 +22. 
Rearranging, \ = —22/57. 
Ze (1 
And now, B= (4,4,3) - = (2,7,2) = = (184, 74, 127). 


Lovely—this is also what we found in Method 1. And now, the distance between A and | 
is 


|AB| = (21-4) +(Th+4)’ + (2i+1)" = Vo7¥ + 44h +38=y/ 


Method 3 (or the Calculus Method). Let R be a generic point on 1, so that AR = 
(2\ -4,7A + 4,2 +1) and the distance between A and R is 


|AR| = V57\2 + 44) +33. 
d 2 
oo (57A? + 44) + 33) = 114) + 44. 
FOC: (114\+44)|,,=0 or \=-——=-=. 


Lovely—this is also what we found in Method 2. We can now find B and AB (omitted). 
Alternatively, we could simply have used “—b/2a”: \ = —44/ (2-57) = -22/57. 
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A269(c) Let P = (8,4,5) and v = (5,6,0). Let B be the foot of the perpendicular from A 
tol. Method 1 (Formula Method). First, PA = (8,5,9) - (8,4,5) = (0,1,4). So, 


—> —> 6 
PB= proj,PA = PTOJ(5,6,0) (0, lis 4) 7 61 (5, 6; 0) ‘ 


— 1 
By Fact 150: B= P+ proj,PA =(8,4,5) + = (5,6,0) = — (518, 280, 305). 


And by Corollary 28, the distance between A and I is 














[AB| = |PAx@ = |(0,1,4) x (5,6,0) | _ |(-24,20,-5)| _ /1001 ‘ 
V5? + 62 +02 J/61 61 


Method 2 (Perpendicular Method). Let B = (8,4,5) + \(5,6,0). Write down AB: 
AB = B- A=(8,4,5) +\(5,6,0) - (8, 5,9) = (5A, 6A — 1, -4). 
Since AB 1 1, we have AB Lv or, 
0 = AB-(5,6,0) = (5\,6A - 1,-4) - (5, 6,0) =5 (5A) +6 (6\-1) + 0(-4) = G1\-6. 
Rearranging, \ = 6/61. 
6 i 
And now, B= (8,4,5) + = (5,6,0) = = (518, 280, 305). 


Lovely—this is also what we found in Method 1. And now, the distance between A and | 


1S 
[AB] = \/ (5A) + (6A-1) + (-4)” = VOIX = 12 +17 = y j= 


Method 3 (or the Calculus Method). Let R be a generic point on 1, so that AR = 
(5A, 6A - 1,-4) and the distance between A and R is 


[AR 6/610) ur 


* (61\? - 12\ + 17) = 122A - 12. 


~ 12 6 
A= = 


FOC: (122\-12)|_,;=0 or = ai 


Lovely—this is also what we found in Method 2. We can now find B and AB (omitted). 
Alternatively, we could simply have used “—b/2a”: \ = — (-12) / (2-61) = 6/61. 
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151.18. Ch. 70 Answers (Planes: Introduction) 


(b) (0,2,0) and (0, 2,1). 
(c) (0,0,3) and (0, 1,3). 
(d) (0,2,3) and (1, 2,3). 
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151.19. Ch. 71 Answers (Planes: Formally Defined in Vector Form) 


A271. B is on the plane q, but A and C are not: 


OAS = 64 j= 5213S Soe x 
OB (5A) = 0,6) C58) = Sais 2 Sok r 
OC 457 3) C83 0 aa =. 1 x 


A272. No, ! contains the point (7,3,1), which isn’t on q because (7,3, 1)-(4,-3,2) # -10. 
A273. a = (2,-2,2) and c= (-v2, V2, -V2) are parallel to (1,-1,1), while b = (2,2, -2) 
is not. Hence, a and c are normal to q, but not b. 


AWA(s) AB = 82 A=(23, 030419) = (2 44) and AC = 0 = A= 
(1,-1,2) = (-1,0,-1). And so, a normal vector of q is 


n = AB x AC = (3,4, -2) x (-1,0,-1) = (-4,-1,4) a 


(b) n-OA = (—4,-1,4)-(1,-1,2) = -4+1+8 = 5. So, q may be described by r-(-—4,-1, 4) =5. 
(c) Another normal vector of g is m = 2n = 2 (-4,-1,4) = (-8, -2,8) 
(d) Hence, the plane g may also be described by r- (—8, -28) = 10. 


A275. Each of the plane q and the line / is a set of points. So, in particular, / is not a 
point. Similarly, the vector v is not a point. 


Hence, each of (a), (d), (e), and (h) (ii) is only ever strictly speaking incorrect, but 
nonetheless written by us anyway (because we’re sloppy and lazy). 


In contrast, each of (b), (c), (f), and (g) (ii) could be perfectly correct. 


A276. Only b is perpendicular to q’s normal vector (see below). Hence, only b is on q. 


a-(8,-2,1) = (3,7,-5)-(8,-2,1) = 24-14-5= 5 x 
b-(8,-2,1) = (1,6,4)-(8,-2,1) = 8-12+4 = 0 "4 
c-(8,-2,1) = (3,10,1)-(8,-2,1) = 24-204+1= 5 x 


A277. Method 1. First find B = A+ AB = (1,4,-1) + (7,3,-2) = (8,7,-3). Then show 
that B does not satisfy the plane’s vector equation and is thus not on q: 


OB +(7,-1,3)=(8,7,=8)«(7,=1,3) =56=-7-9419: * 
Method 2. Simply check if AB 1 (7,-1,3): 
ABs 49) = 3.2) AS39620, £ 


So no, AB / (7,-1,3). Hence, by Fact 163, B ¢ q. 
A278. Since n { a,b, we know for sure that n cannot be a normal vector of q. 


It is true that m 1 a,b. However, observe that a || b and so Corollary 31 does not apply. 
That is, we are unable to conclude that m 1 g. ‘The answer is thus, “m may be a normal 
vector of g, but we don’t know for sure.” 
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151.20. Ch. 72 Answers (Planes in Cartesian Form) 


A279(a) «+ 2y+3z=17 doesn’t contain the origin. 
(b) -x - 2z =0 contains the origin. 


(c) -2y + 5z =-3 doesn’t contain the origin. 


A280(a) Rewrite +5 =17y+z as ©-17y-z=-5. Reading off, the plane may also be 
described by r-(1,-17,-1) = -15 and doesn’t contain the origin. 


(b) Rewrite y+1=0 as Ox +y+0z=-1. Reading off, the plane may also be described by 
r-(0,1,0) =—1 and doesn’t contain the origin. 


(c) Rewrite r+ z=y-2asxu-yt+z=-2. Reading off, the plane may also be described by 
r-(1,-1,1) =-2 and doesn’t contain the origin. 


A281(a) The plane described by r-(0,0,1) = 32 or z = 32 contains the points (0,0, 32), 
(1, 0,32), and (0,1, 32). It does not contain the points (1,0,0), (0,1,0), or (0,0, 1). 


(b) The plane described by r- (5,3, 1) = -2 or 54+ 3y+z = -2 contains the points (-1,1,0), 
(0,0,-2), and (0,-1,1). It does not contain the points (1,0,0), (0,1,0), or (0,0, 1). 


(c) The plane described by r- (1,-2,3) = 0 or x - 2y + 3z = 0 contains the points (0,0,0), 
(2,1,0), and (-3,0,1). It does not contain the points (1,0,0), (0,1,0), or (0,0, 1). 


A282(a) (2, 1,0), (3,0,-1), and (0,3,2). (ce) (4,0,-1), (4,1,-1), and (4,2, -1). 
(b) (3,-5,0), (1,0,-5), and (0,1,-3). (d) —(1,0,0), (0,0,1), and (1,0,1) 
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151.21. Ch. 73 Answers (Planes in Parametric Form) 


A283(a) (i) x+2y+3z=4. 

(ii) Since u- (1, 2,3) = (2,-1,0)- (1, 2,3) =0 and v- (1, 2,3) = (-3,0,1)- (1, 2,3) =0, both 
u and v are on the plane. Also, u ++ v because they aren’t scalar multiples of each other. 
(iii) Since w- (1, 2,3) = (-1,-1,1)- (1, 2,3) =0, the vector w is on the plane. 


The vectors u and v are on the same plane and u |} v. Hence, by Theorem 19, we should 
be able to write w as the LC of u and v, as indeed we can: 


w = (-1,-1,1) = (2,-1,0) + (-3,0,1) =u+v. 


(iv) Since (1,1,1)-(1,2,3) # 0, we have (1,1,1) { (1,2,3) and so (1,1,1) is not on the 
plane. Hence, by Theorem 19, it cannot be written as a LC of u and v. 


(b) (i) T= 2 = 0. 


(ii) Since u-(1,0,-1) = (0,1,0)-(1,0,-1) =0 and v-(1,0,-1) = (1,0,1)-(1,0,-1) =0, both 
u and v are on the plane. Also, u | v because they aren’t scalar multiples of each other. 


(iii) Since w- (1,0,-1) = (1,-1,1)-(1,0,-1) =0, the vector w is on the plane. 
The vectors u and v are on the same plane and u + v. Hence, by Theorem 19, we should 
be able to write w as the LC of u and v, as indeed we can: 


w =(1,-1,1) = (1,0,1) - (0,1,0) =v-u. 


(iv) Since (1,1, 1)-(1,0,-1) = 0, we have (1,1,1) 1 (1,0,-1) and so (1, 1,1) is on the plane. 
And hence, by Theorem 19, it can be written as a LC of u and v: 


(1,1,1) = (1,0,1)+(0,1,0) =utv. 


(c) (i) Or+yt+z2=-5. 


(ii) Since u- (9, 1,1) = (0,1,-1)- (9,1, 1) =0 and v- (1,0,-1) = (-1,9,0)-(9,1,1) =0, both 
u and v are on the plane. Also, u + v because they aren’t scalar multiples of each other. 


(iii) Since w- (9, 1,1) = (-1, 10, -1) - (9, 1,1) = 0, the vector w is on the plane. 


The vectors u and v are on the same plane and u | v. Hence, by Theorem 19, awe should 
be able to write w as the LC of u and v, as indeed we can: 


w = (-1,10,-1) = (0,1,-1) + (-1,9,0) =u+v. 


(iv) Since (1,1,1)-(9,1,1) # 0, we have (1,1,1) { (9,1,1) and so (1,1,1) is not on the 
plane. Hence, by Theorem 19, it cannot be written as a LC of u and v. 
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A284(a) The plane r- (-1,2,5) =5 has cartesian equation: -x + 2y+5z=5. 


It contains the point (—5,0,0) and the non-parallel vectors (2,1,0) and (5,0,1). And so, 
it may be described by the following parametric equation: 


r = (-5,0,0) + A(2,1,0) + w(5,0,1) = (-5 + 2d + 5p, A, pw) (A,u€R). 


(b) The plane r- (0,0, 1) =0 has cartesian equation: z = 0. 


It contains the point (0,0,0) and the non-parallel vectors (1,0,0) and (0,1,0). And so, it 
may be described by the following parametric equation: 


r= (0,0,0)+A(1,0,0) + #(0,1,0)=(,4,0)  O,ueR). 


(c) The plane r- (1,-3,5) = -2 has cartesian equation: x - 3y + 5z = -2. 


It contains the point (—2,0,0) and the non-parallel vectors (3,1,0) and (0,5,3). And so, 
it may be described by the following parametric equation: 


r = (-2,0,0) + A(3,1,0) + w(0,5,3) = (-2+ 3A, A + 5p, 3p) (A, ue€R). 


A285(a) This plane contains the vectors (4,5,6) and (7,8,9). And so, a normal vector is 
(4, 5,6) x (7,8,9) = (-3,6, 3). a 


Since (-3,6,-3) || (1,-2,1), a normal vector of the plane is (1,-2, 1). 


The plane contains the point (1,2,3). Since (1,2,3)-(1,-2,1) = 1-4+3 =0, the plane may 
be described in vector and cartesian forms by r- (1,-2,1) =0 and x-2y+z=0. 


(b) First, rewrite the given parametric equation as: 
r= (\-,4+ 5,0) = (0,5,0) +A (1, 4,0) + w (-1, 0, 0) (A,u€R). 
Thus, this plane contains the vectors (1,4,0) and (-1,0,0). And so, a normal vector is 
(1,4,0) x (-1,0,0) = (0,0, 4). % 


Since (0,0,4) || (0,0,1), a normal vector of the plane is (0,0, 1). 


The plane contains the point (0,5,0). Since (0,5,0)-(0,0,1) =0+0+0=0, the plane may 
be described in vector and cartesian forms by r-(0,0,1) =0 and z =0. 


(c) First, rewrite the given parametric equation as: 
r=(1+y,1+A,A\+y) =(1,1,0) +A(0,1,1) + (1,01, 1) (A, we R). 
Thus, this plane contains the vectors (0,1,1) and (1,0,1). And so, a normal vector is 
(0,1, 1) x (1,0, 1) = (1,1,-1). % 


The plane contains the point (1,1,0). Since (1,1,0)-(1,1,-1) = 1+1+0 = 2, the plane may 
be described in vector and cartesian forms by r-(1,1,-1) =2 and x+y-z=2. 
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A286(a) The plane that contains the points (7,3,4), (8,3,4), and (9,3,7) also contains 
the vectors (8, 3,4) — (7, 3,4) = (1,0,0) and (9,3, 7) - (7,3, 4) = (2,0,3). 


Since (1,0,0) + (2,0,3) , the plane may be described in parametric form by 


7 1 y) 7T+A4 2p 
r=] 3 |+Al 0 [+z] 0 [= 3 (A, ue€R). 
4 0 3 44+3p 
It has normal vector (1,0,0) x (2,0,3) = (0,-3, 0). * 


It thus also has normal vector (0,1,0). Compute (7,3,4)-(0,1,0) =0+3+0=3. So, this 
plane may be described in vector or cartesian form by 


r(0,1,0) =3 or y 


I 
- 


(b) The plane that contains the points (8,0,2), (4,4,3), and (2,7,2) also contains the 
vectors (4, 4,3) — (8,0,2) = (-4,4,1) and (2,7,2) - (8,0,2) = (-6,7,0). 


Since (-4,4,1) + (-6,7,0), the plane may be described in parametric form by 


8 4 6 8-4 - 6u 
r=! 0 |+A] 4 [+e] 7 [=] 04447 (A, € R). 
2 1 0 2+A 
This plane has normal vector (—4, 4,1) x (-6,7,0) = (-7,-6, -4). Sa 


It thus also has normal vector (7,6,4). Compute (8,0, 2)-(7,6,4) = 56+0+8 = 64. So, this 
plane may be described in vector or cartesian form by 


r-(7,6,4)=64 or Tx+6y+4z = 64. 


(c) The plane that contains the points (8,5,9), (8,4,5), and (5,6,0) also contains the 
vectors (8,5,9) — (8,4,5) = (0,1,4) and (8, 5,9) — (5,6,0) = (3, -1, 9). 


Since (0,1,4) + (3,-1,9), the plane may be described in parametric form by 


5 0 3 5 + 3u 
r=] 6 [+A] 1 |+e#] -1 [=] 64+A-p (A, we R). 
0 4 9 Ad + 9 
This plane has normal vector (0,1,4) x (3,-1,9) = (13, 12, -3). Sf 


Compute (5,6, 0)- (13, 12,-3) = 65+ 72 +0 = 137. So, this plane may be described in vector 
or cartesian form by 


r-(13,12,-3)=137 or = 13a + 12y-32 = 137. 
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151.22. Ch. 75 Answers (The Angle Between a Line and a Plane) 


A287(a) The angle between a line with direction vector (-1,1,0) and a plane with normal 

vector (3, 4,5) is 

-1 (1, 1,0) - (3, 4, 5)| = sin! [1| 
I(-1, 1, OIG, 4, 5)| V2y/50 

(b) The given line has direction vector (-1,4,9)-(-1, 2,3) = (0,2,6) or (0,1,3). The given 

plane has normal vector (—3,1,0) x (0,5, -3) = (-3, -9, -15) or (1, 3.0 % 


Hence, the angle between the line and the plane is 


= sin! 0.1 + 0.100. % 





sin 


sin”! (0, 1, 3) - (1,3, 5) = sin7! [18] w 1.295. aS 
(0, 1,3)|](1, 3, 5)| /10\/35 


(c) The given line has direction vector (0, 11, 11)-(-1, 2,3) = (1,9,8). The given plane also 
contains the vector (1.5,0,0) - (0,0, 1.5) = (1. 5,0,-1. 5) or (1, 0, —1); hence, it has normal 
vector (4,-1,0) x (1,0, Ve (1,4, 1). * 


Hence, the angle between the line and the plane is 





-1 (1, 9,8) - (1,4, 1)| = sin-! [45] 
(1,9, 8)][(, 4, 1)| V146\/18 
A288(a) The line / has direction vector v = (2,3,5) and the plane q has normal vector 
n = (-10,0,4). Now, v-n = (2,3,5)-(-10,0, 4) = -20+0+20=0. So, vin and thus / || g. 


The point (4,5,6) is on / but not on qg because (4,5,6)-(-10,0,4) = -40+0+24 = -16 # -26. 
Hence, / and q do not intersect at all. 


(b) The line J has direction vector v = (5, 5,6) — (3, 2,1) = (2,3, 5). 

The plane q has normal vector n = (2,0,5) x (2,1,5) = (-5, 0, 2). xe 
Since v-n = (2,3,5)-(-5,0,2) = -10+0+10=0, we have v 1 n and thus / || ¢. 

Compute (3,0, 1)-(—5,0,2) = -15+0+2 =-13. So, q has vector equation r-(—5,0,2) = -13 


The point (3,2,1) is on J and is also on q because (3, 2,1) -(-5,0,2) = -15+0+2 = -13. 
Hence, the line lies entirely on the plane. 


(c) The line / has direction vector v = (6,8, 11) - (4, 5,6) = (2,3, 5). 


The plane q contains the vector (2, 1,-2) - (2,0, -2) = (0,1,0) and thus has normal vector 
n = (0,1,0) x (3,0, 10) = (10, 0,-3). % 


Since v-n = (2,3,5)-(10,0,-3) = 20+0-15=5 #0, , we have v { n and thus / | q. Hence, 
/ and q share exactly one intersection point. 


Compute (2, 0,-2)- (10,0, -3) = 20+0+6 = 26. So, q has vector equation r-(10,0,-3) = 26. 


To find the intersection point, plug a generic point of / into q’s vector equation: 


w# 1.071. ss 





|(4,5,6) + A(2, 3,5)] - (10, 0, -3) =-26 <> 2245-26 <=> d=08. 


Thus, / and q intersect at: (4,5,6) + \(2,3,5) = (4,5,6) + 0.8(2,3,5) = (5.6, 7.4, 10). 
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151.23. Ch. 76 Answers (The Angle Between Two Planes) 


A289(a) The angle between planes with normal vectors (—1,-2,-3) and (3,4, 5) is 


—1,-2,-3)-(3,4,5 —26 
6 =cos eee) RACE = cos! ale ~ 0.186. a 

|(-1, -2, -3)||(3, 4, 5)| V 14/50 
(b) The first plane has normal vector (1,-1,0) x (3,5,-1) = (1,1,8). t 
The second plane has normal vector (0, 1,0) x (10, 2,3) = (3,0, -10). % 
And so, the angle between the two planes is 

Le a : —1 _ 

cos’! \( ’ ,8) (3,0, 0)| a -l | 77| 0 133. G 





= cos” > —==—_— 
(1, 1, 8)| (3, 0, -10)| V66\V/109 
(c) The first plane contains vectors (3,0,0) - (1,1,0) = (2,-1,0) and (3,0,0) - (0,0,1) = 
(3,0,-1). And so, it has normal vector (2, -1,0) x (3,0,-1) = (1, 2,3). at 
The second plane contains vectors (1,0,-1)-(1,-1,0) = (0,1,-1) and (0,3,1)-(1,-1,0) = 
(-1,4,1). And so, it has normal vector (0, 1,-1) x (-1,4,1) = (5,1,1). at 


Thus, the angle between the two planes is 


-110,2,3)-G1, Dl oor LOL 937. ” 


6 = cos > ————_——"—~_ = cos * ———— 8 
(1, 2, 3)||(5, 1, 1) J14\/27 


A290(a) Since (4,9,3) + (1,1,2), q@ and q are not parallel and intersect along a line 
with direction vector (4,9, 3) x (1, 1,2) = (15, -5, -5) or (-3,1,1). t 


To find an intersection point, plug x = 0 into their cartesian equations to get 9y + 3z +61 
and y+2z 219. + minus 9x 2 yields -15z = -110 or z = 22/3. And so, y = 13/3. Thus, their 
intersection line has vector equation r = (0, 18/3, 22/3) + A(-3,1,1) (A€ R). 


(b) qi has normal vector (1,-1,0) x (1,-1,1) = (-1,-1,0) or (1,1,0). % 
qo has normal vector (6,-1,0) x (8,0,-1) = (1,6,8). 5 
Since (1,1,0) + (1,6,8), q and q are not parallel and intersect along a line with direction 
vector (1,1,0) x (1,6,8) = (8,-8,5). * 


Compute (1,3,-2)-(1,1,0) =1+3+0=4 and (2,3,5)-(1,6,8) =2+18+40=60. Hence, q 
and q2 have cartesian equations 7+ y= 4 and x + 6y + 8z = 60. 


To find an intersection point, plug in x = 0 to get y + 4 and 6y + 8z 2 60. So, y= 4 
and z= 4.5. Thus, their intersection line has vector equation r = (0,4, 4.5) + (8, -8, 5) 
(A€R). 
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A290(c) q: contains the vectors (7,7,0) - (1,1,6) = (6,6,-6) or (1,1,-1) and (5,3,3) - 
(1,1,6) = (4,2,-3). And so, it has normal vector (1,1,-1) x (4,2,-3) = (-1,-1,-2) or 
(L152), ae 
q2 contains the vectors (7,3, 1) - (5,5,1) = (2,-2,0) or (1,-1,0) and (5,5,1) - (3,5,2) = 
(2,0,-1). And so, it has normal vector (1, -1,0) x (2,0,-1) = (1,1, 2). *t 
Clearly, gq; and qj are parallel. 

To check if they intersect at all, we’ll pick any point on q;—say (7,7,0)—and check if it’s on 
qo. Compute (5,5, 1)-(1, 1,2) =5+5+2 = 12—hence, q has vector equation r-(1, 1,2) = 12. 
Since (7,7,0)-(1,1,2) =7+7+0=14# 12, the point (7,7,0) is not on q. Thus, q, and q 
do not intersect at all. 


(d) q; contains the vectors (5,3, 2)—(1, 5,3) = (4, -2,-1) and (10, 0, 1)-(5, 3, 2) = (5, -3,-1). 
And so, it has normal vector (4,-2,-1) x (5,-3,-1) = (-1,-1,-2) or (1,1, 2). * 
q2 contains the vectors (8, 8, -2)-(5, -1, 4) = (3, 9, -6) or (1,3, -2) and (8, 8, -2) -(3, 5, 2) = 
(5,3,-4). And so, it has normal vector (1, 3,-2) x (5,3, -4) = (-6,-6,-12) or (1,1, 2). 
Clearly, the two planes are parallel. 

To check if they intersect at all, we'll pick any point on q;—say (10,0, 1)—and check if it’s on 
qo. Compute (3, 5,2)-(1,1,2) =3+5+4 = 12—hence, q has vector equation r-(1,1,2) = 12. 
Since (10,0,1)-(1,1,2) = 10+0+2 = 12, the point (10,0,1) is on the second plane. Since 
gq, and q are parallel and share at least one intersection point, they must be identical. 


(e) Since (7,1,1) 4 (1,1,2), q and q are not parallel and intersect along a line with 
direction vector (7,1,1) x (1,1,2) = (1, -18,6). * 
To find an intersection point, plug x = 0 into their cartesian equations to get y+ z + 42 and 


y+ 2z 26. 2 minus 2 yields z = -36. And so, y = 78. Thus, their intersection line has 
vector equation r = (0, 78, -36) + A(1,-13,6) (Ae R). 


(f) Since (0,1,3) + (-1,1,3), q@ and @ are not parallel and intersect along a line with 
direction vector (0,1,3) x (-1,1,3) = (0,-3, 1). * 
Observe that if here we try plugging x = 0 into their cartesian equations, then we get 
yt3z +0 and yt+3z 2 2, which are contradictory. This contradiction tells us that the two 
planes have no intersection point with x-coordinate 0. 


So, let’s instead try plugging in y = 0 to get 3z 30 and -a+3z 42. Solving, z = 0 and x = -2. 
Thus, their intersection line has vector equation r = (—2,0,0) + A(0,-3,1) (A €R). 
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151.24. Ch. 77 Answers (Point-Plane Foot and Distance) 


A291(a) Write B = A+kn = (7,3,4) +k(9,3,7). Since B €q, we have 
OB -(9,3,7)=109 or [(7,3,4)+k(9,3,7)]-(9,3,7)=109 or 100+139k = 109. 


So, k = 9/139 and B = A+ kn =(7,3,4) +3(9 3,7) = ay (1 054, 444,619). 


And the distance between A and q is 


i (9,3,7)| = a5 V 185 = oe ae 


[AB = pen] = in = | Ji 


139 


(b) Write B = A+kn = (8,0,2)+(2,7,2). Since Be q, we have 
OB-(2,7,2)=42 or — [(8,0,2)+k(2,7,2)]-(2,7,2)=42 or 20+57k = 42. 


22 1 
So, k = 22/57 and B= A+ kn = (8,0,2) + 57 (2, 2) = 57 (500, 154, 158). 
And the distance between A and q is 


AB| = |kn| = |k| [n = =l2.7.21= 5 Vit =, Ss 
[AB] = in| = [A | = |J=]|(2, 7, 2)] = az 


(c) Write B = A+kn = (8,5,9) +k(5,6,0). Since B eq, we have 
OB-(5,6,0)=64 or  [(8,5,9)+k(5,6,0)]-(5,6,0)=64 or  70+61k=64. 


So, k = -6/61 and B= A+kn= An =(8,5,9) -=(6, 6,0) = = (458 269,549). 


And the distance between A and q is 


= 6 6 6 , 
[AB] = len = [A In| = =| (5,6,0)] = 5; VOL = 
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A292. Since B is the foot of the perpendicular, we have AB 1 q or AB || n or for some 
k #0, kn=AB=OB-OA. 


So, (b) |AB| = |kn| = |A||n|. Also, OB = OA + kn or (a) B= A+kn. 
We now show that / is as claimed in Fact 178. Since B € q, 


OB mad or (OA+kn)-n=d or OA none. 





d-OA-n d-OA-n 
Rearranging, k= = | p 
n-n n 
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A293. By Fact 178, the distance between the origin O and the given plane is 
ld-O0-n| _ld| 
[n| In| 


And hence, if |n| = 1, then the distance between the plane and the origin is simply |d]. 


A294(a) Perpendicular Method. Let A and B be the feet of the perpendiculars from 
Sand T tog. Write A=S+kn and B=T +n. 


Since A, B € q, we have 
OA-(5,-3,1)=0 or  [(-1,0,7)+k(5,-3,1)]-(5,-3,1) or 2+35k =0, 
OB-(5,-3,1)=0 or  [(3,2,1)+1(5,-3,1)]-(5,-3,1)=0 or 10+351=0. 


Solving, we have k = —2/35 and / = -10/35 = —-2/7. Thus, 


2 1 
A= = (-1 = =—-(8,=3,1) = —(—45, 6; 248 
S+kn ( 50,7) | ’ y] ) or y] v] y, 
2 1 
B= T+im= (3,2,1) - -3(5,-3,1) = 7 (11,20,5). 


Formula Method. First compute |n| = \/5? + (-3)” + 12 = V35. Then compute 


= d-O$-m _ 0-(-1,0,7)-(5,-3,1)_0-(-5+0+7) 2 





in? 35 35 35° 
)_d- OT n _ 0-(3,2,1)-(5,-3,1) _0-(15-6+1)_ 2 
InP 35 - 35 —— 


So, the feet of the perpendiculars from S and T’ to q are, respectively 
S+kn = (-1,0,7) - me <3, 1) + (R45 6, 243) 
~ gg 35 ’ ’ ~ 35 bet) ’ 
2 i! 
T+In= (3,2,1) - ~7(5,-3, 1) = 7 (11,20, 5). 


(b) The distances between qg and the points S and T are, respectively 


2 2 | 
k| In| = —-V35 = ——. % 
|| [xa 35 Ea 

2 2/5 
[||n| = — -./35 = —“E. % 
[Z| [na] 7 77 


(c) Since the origin is on q, the distance between the origin and q is 0. 
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151.25. Ch. 78 Answers (Coplanarity) 


A295. The line AB is described by r = (1,0,0) +A (1,-1,0) (A € R) and does not contain 
C' or D. Hence, A, B, and C are not collinear and neither are A, B, and D. 


The line CD is described by r = (0,0,1) +A(1,1,-2) (A € R) and does not contain A or B. 
Hence, C’, D, and A are not collinear and neither are C’, D, and B. 


A296(a) Let q be the plane that contains A, B, and C. The non-parallel vectors BA = 
(3,2,4) and BC = (5,8,4) are on q. 

Method 1 (Vector Form). qg has normal vector BAxBC = (3,2,4)x(5, 8,4) = (-24,8, 14) 
or (-12, 4,7). % 
Compute OA- (-12,4,7) =(0,1,5)-(-12,4,7) =0+4+35 = 39. Hence, q may be described 
by r-(-12,4,7) = 39. 

Now check if D €q: OD: (-12,4,7) = (6,6, 1) -(-12, 4,7) = -72+24+7+#39. Nope, it isn’t. 
So the four points aren’t coplanar. 


Method 2 (Parametric Form). The plane g may be described in parametric form as: 


0 3 5 3d + 5p 
r=] 1 ]+A] 2 [+e] 8 [=] 14+2d\+8p (A, €R). 
5 4 4 5+4A +4 
Now check if D € q: 
6 3+ 5p 623A +5p, 
6 |=] 14+2\+8p or 6214204 8p, 
1 5+ 4 +4y 1325444 4p. 


2x 2 minus 2 yields 11 = 12-3 or pu < 110, Lyos 2 minus = yields 3 = 1.5+7p or p= 3/14, 
which contradicts 4. So, D ¢q and the four points are not coplanar. 

(b) Let q be the plane that contains A, B, and C. The non-parallel vectors AB = (1, -3, 5) 
and BC = (6,1,-2) are on q. 

Method 1 (Vector Form). gq has normal vector AB x BC = (1=3,5) *. (6; 1.2) = 
(1, 32,19). 4 
Since q contains the origin, it may be described by r- (1,32, 19) =0. 

Now check if D € gq: OD- (1,32,19) = (4,7,-12) -(1,32,19) = 4 +224 -228=0. Yup, it is. 
So D €q and the four points are coplanar. 


Method 2 (Parametric Form). The plane g may be described in parametric form as: 


0 1 6 d+ 6u 
r=] 0 J+A] -3 [+e] 1 f=] -3A4+p (A,u€R). 
0 5 2 5A — 2p 


Now check if D € q: 
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4 \+ 6p 42+ 6, 
7 (=|, =3)4% or 72-34 p, 
~12 BA -2Qu ~12 25)\-2p. 


Qx 2 plus 3 yields 2 = —A or X = —2 and hence pw =1. These values of A and pu also satisfy Z. 
So, Deégq and the four points are indeed coplanar. 


(c) The line AB may be described by r = (0,1,2)+A(1,1,1) (A€ R). By picking \ = 2, we 
observe that AB also contains the point C' = (2,3,4). Hence, A, B, and C are collinear. 
And so by Fact 179, the four points are coplanar. 


The plane containing them contains the non-parallel vectors AB = (1,1,1) and AD 
(19,-1,-7). Hence, it may be described by 


r= (0,1,2)+A(1,1,1)+4(19,-1,-7) (A, R). 


A297(a) Since (3,2,1) 4 (5,6,7), the two lines are not parallel. Now write 


8 3 1 5 8+3\=14+5f, 
1]+Al 2 ]=] 2 |+2A] 6 or 149) 22467 
5 1 3 i 5+\ 2347p. 


2x = minus (+ + >) yields -11 = 0, a contradiction. So, the two lines do not intersect. 


Thus, the two lines are skew and not coplanar. 

(b) Since (3,9,0) || (1,3,0), the two lines are parallel. They are also distinct because, for 
example, the point (1,1,1) is on the second line but not on the first. 

Thus, they are coplanar and do not intersect. Compute (0,0,6) - (1,1,1) = (-1,-1,5). 
So, the (unique) plane that contains both lines is r = (0,0,6) + A(1,3,0) + w(-1,-1,5) 
(A, e€R). 

(c) Since (1,0,1) + (0,1,1), the two lines are not parallel. Now write 


6 1 9 0 6+) 29, 
5 |+Al 0 [=] 3 [+2] 1 or 5 234A, 
5 1 6 1 5+\ 264+p 


From z \=3. From 2 ji = 2. These values of \ and [i satisfy 3. Plugging these back in, 
the two lines intersect at (6,5,5) +3(1,0,1) = (9,3,6) + 2(0,1,1) = (9,5,8). 


Thus, the two lines are coplanar and the (unique) plane that contains them is r = (6,5,5)+ 
d(1,0,1) +2(0,1,1) (eR). 


(d) Since (-5,0,1) || (10,0,2), the two lines are parallel. Indeed, they are identical 
because the point (9,3,6) which is on the second line is also on the first (to see this, plug 
A = -2 into the first line’s vector equation). 


Since they are identical, they intersect at every point along either line. 


They are coplanar and there are infinitely many planes that contain both lines. 
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152. Part IV Answers (Complex Numbers) 


152.1. Ch. 79 Answers (Complex Numbers: Introduction) 


A298(a) True because (i) every integer and rational non-integer is rational; and (ii) every 
rational is either an integers or rational non-integer. 

(b) True because nothing is both rational and irrational. 

(c) False because 0€ R, so0¢C\R={xeCANDzeR}. 

(d) True because nothing is both imaginary and impure. 

(e) False because for example 1 is an integer but isn’t imaginary. 

(f) False because Zn {Imaginary numbers} = {0}. 

(g) False because ({Imaginary numbers} U {Impure numbers}) 9 R = {0}. 





























A79. 2.5 | lt+in | l+m |1-V6| 1-81 | V5+0i| -200i 
Complex "A J v Jv v Jv J 
Imaginary J 
Impure J "A 
Real J v J A 
An integer 
Positive J "A 
Negative "A J 
Rational J 
Irrational J J J 























A299. Rationalise any denominators with surds and write out the sine or cosine values: 


V3_ V2 


pees | 


v2 V2 


SS 


2 2 


2 


By/ 2. 42 


ene ae 


2 


2 


Comparing the real and imaginary parts, we see that only c= d. 


A300(a) z = (33, 33e). 
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(b) w = (237 +7,3 - v2). 


V3 


V2 


Gate) 


2 


2 


(c) w= (p,q). 
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152.2. Ch. 80 Answers (Some Arithmetic of Complex Numbers) 
A301(a) z+w = (-5 + 2i) + (7+ 3i) =24+5i, z-w = (-5 + 2i) - (7+ 31) = -12-1. 
(b) z+ w = (3-i)+ (114 2i) = 14+i, z-w = (38-i) - (11 + 2i) = -8 - 3. 
(c) z+w = (1+2i) +(3- V2i) =4+(2- V2)i, 2-w = (1 + 21) - (3 - V2i) = -2+ (2+ V2)i. 


A302(a) zw = (-5 + 2i) (7+ 3i) = -35 - 1bi+ 141 + 622 = 41-1. G 
2 = (-5 + 2i)” = 25 +2(-5) (2i) -4 = 21 - 201. G 
23 = (-5 + 21)? (-5 + 2i) = (21 - 20) (-5 + 2i) = -105 + 42i + 100i + 40 = -65 + 142i. G 
(b) zw = (3-i) (11+ 2i) = 33 + 6i- 11i - 2i? = 35 - 5i. G 
2 = (3-i)? =9+2(3)(-i)-1=8-6i. G 
2 = (3-i)° (3-i) = (8 - Gi) (3- i) = 24 - 8i- 18i- 6 = 18 - 261. G 
(c) zw = (1 + 2i) (3 - V2i) = 3 - V2i + 6 - 2V2?? = 3 + 2V2 + (6- V2)iL % 
2 = (140i) 31490) Cija4s-3 44 G 
2 = (142i) (1+ 2i) = (-3 + 4i) (1 + 2i) = -3 - i+ 41-8 = -11 - 21. G 
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A303. zw = (a+ib) (c+ id) = ac + iad + ibe + i2bd = (ac — bd) + i(ad + bc). 


A304. (2+i)2=4+42(2)(i)-1=3+44i. 
(2+i)? = (2+i)(2+i)? = (2+ i) (3+4i) = 6+ 81+ 3i-4=24 111. 


Hence, az® + bz? +3z-1=(2+i)%a+(2+i)7b+3(2+i)-1 
= (2+1li)a+(3+4i)b+3(2+i)-1 


= 2a+3b+5+i(1la+4b+3). 


Two complex numbers are equal if and only if their real and imaginary parts are equal. So, 


9a+3b+520 and 11a+4b+320. 


Take 3x 2 minus 4x 4: 3(1la+ 4b +3) —4(2a+3b45) =25a-11=0. 





it 
So, a= =0.44 and b= oe —1.96. 
25 3 
. : 1 1 ae | ; | 
A305(a) z* =-5-2i. So So Boage” 56 a 5G (-5,-2) =-—-—i 
1 1 1 
(b) w* =3+i. So = 3 pw WO) I0 ae 1) =0.3+0.1i. 


i! 
2422” 





1 
*=1-21.8 = =—(1-2i)= 
(c) w i. So — = ( i) 


: 
) 
A306(a) zz* = (a+ ib) (a—ib) = a? - (ib)? = a? - 170? = a? +B. 


lL i2 
b = =.= ——{} = 
( Zz 


e” (ay 2" 
zz* a2 +2 





zw* (1+3i)i 














A307(a) “ap 7 =-34+1. 
(b) z  zw*  (2-3i)(1-i) _ 2741-31-38 _-1-5i_ pe on 
w 12412 2 2 Oe 
E zw* (/2-mi)(3+V2i) 3/2+2i-3in+ V2n 347 
ak a em 
Ba (V9) 
z  gw*  (114+2i) (-i) , 
ay poe Seri. 
7 ee i 
(jt 3 OS os ori 
w 27+1? i) i) 
os oe AT HQ) (bei) joe 12 _33 1. 
w 52+ 12 26 7 26 26 26 
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(1,-2) = 0.2 - 0.4. 





G 


ae 
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152.3. Ch. 81 Answers (Solving Polynomial Equations) 
A308(a) x = (-1+ V-3) /2 = -1/2 + V3i/2. % 
(b) x = (-2+ V-4) /2=-14i. ts 
(c) x= (-3+ V-3) /6 = -1/2+ V3i/6. % 


A309. (-14 V3i) 22123 (1) 9/1431) (v3i) + 3V3i = -1 + 3V3i+ 9 = 3V3i = 8. 
J 


A310. Since (Ay +64 =0, one root is —-4 and z+ 4 is a factor for x? + 64. 


To find the other two roots, write 
xv? +64 = (x +4) (ax? + br +c) = ax? + (b+ 4a) a*+?x + de. 
Comparing coefficients, v3 +64 = (x +4) (2-47 +16). 


We can now find the other two roots using the usual quadratic formula: 


pi IPedge Ws /P=1) C6 
p= DENT Hae VEO) 94 VIRT6 = 24 VBI = 24 2V5. 
7 | 


Thus, the three roots of 2? +64 =0 are —4, 2+ 2\/3i, and 2 -2V3i. af 


A311(a) Since 1 is a root, write 
xv? +2?-2=(x-1) (ax? +bx+c) =ax*+ (b-a)a?+?r-c. 


Comparing coefficients, vi +a?-2=(x-1) (2? + 2x +2). 


We can now find the other two roots using the usual quadratic formula: 


Pe —b + Vb? - dac _ ses VV (-2)° - 4(1) (2) ee ee ee 


2a 2e1 





Thus, the three roots of 7? +2?-2=0 are 1 and -1+i. et 
(b) Since 1 is a root, write 

vt — x? - Ix +2=(x-1) (ax? + ba? + cx +d) = ax +(b-a) 23+ (c-b) 2?+?2 -d. 
Comparing coefficients, 24 -2*-2r+2=(x#-1) ie + x? - 2) 


But in (a), we already worked out the three roots of #? + x2? -2-=0. Thus, the four roots 
of «* -— 2? -27+2=0are 1, 1 (repeated), and -1 +i. ae 
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A312. Both equations have real coefficients and so the Complex Conjugate Root Theorem 
applies. That is, since 2 - 3i solves both equations, so too does 2 + 3i. 


Compute [x - (2-3i)] [x - (2 + 8i)] = (x - 2) - (31)? = 2? - de +18. 
(a) x4 6x? + 18x? - 142-39 = (2? - 4 + 13) (ax? + ba +c) = ax* + (b- 4a) v3 +2074? x + 18e. 
Comparing coefficients, ax? +bx+c= 27 -2Qx-3. 


By the quadratic formula or otherwise, we have x? — 2x — 3 = (2 +1) (x - 3). 
Thus, the four roots are 2 — 3i, 2+ 3i, -1, and 3. Ss 
(b) —2a* + 21x° — 93x? + 2292 - 195 = ax* + (b- 4a) 2° +?27+?@ + 13c. 


Comparing coefficients, az’? + ba + c= -2x47 + 13a - 15. 


We can now find the other two roots using the usual quadratic formula: 


—b+ Vb? -dac -13+,/13?-4(-2)(-15) 134/749 1347 
x ere CO CW’: eee eee 1 eee COS 
2a 2(-2) 4 4 


=1.5,5. 
Thus, the four roots are 2 — 3i, 2+ 31, 1.5, and 5. Ss 
A313. If 1-i solves x? + px +q-=0, then by Theorem 21, so too does 1+i. And so, 

a? +pxt+q=([x-(1-i)][x-(1+i)] = (@-1)?-? = 2? - 22 +2. 


Hence, p = -2 and q = 2. at 


152.4. Ch. 82 Answers (The Argand Diagram) 





A314. A 
y 
2i = (0,2) { stray) 
-1 = (-1,0) 2=(2,0) « 
@ @ > 
@) 
=H ere (1-3) 
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152.5. Ch. 83 Answers (Complex Numbers in Polar Form) 


A315. |2| = 2, |-1] = 1, |2i| = 2, |1 + 2i] = V12 + 22 = V5, |-1- 3i] = V/(-1)” + (-3)’ = V0. 


A316. Refer to figure on the previous page. We have arg 2 = 0, arg (-1) = 7, arg (2i) = 71/2 
and arg(1+2i) = tan’! (2/1) » 1.107. For -1- 3i, observe that @ = tan7!(3/1). Thus, 
arg (-1-3i) = 0 -m= tan! (3/1) — 7s -1.893. 





A317. A 
y 
lw| = \/ (-3)? + 22 = V13 
wea + ZI 9 
arg w =7-tan 3 w 2.004 
Ka 
> 
g270 =i 
Jz) = 224 (-1)? = V5 
11 
Ares ==TAL 5 x —0.464 





2 —1 0 
A318. arg2= cos"! = 0, arg (-1) = cos! — =n, arg (2i) = cos} — = 2/2. 





1 2 
arg (1+2i) = cos”! = #1.107,*  arg(-1-3i)=-cos! = ~ —1.893. %* 
arg z = arg(2-i)=-cos! ie w —0.464, arg w = arg (-3+ 2i) = cos? = aw 2.554. 
A319. Using the moduli and arguments found in the above answers, we have 
2 = 2(cos0+isin0), -1=1(cos7+isin7t), 2i = 2(cos™ +isin 7), 
1+2i* V5(cos1.107+isin1.107),  -1-3i* V/10 (cos—1.893 + isin -1.893), 
2-ix V5 (cos—0.464+isin-0.464),  -3 + 2is V/13 (cos2.554 + isin 2.554). 


152.6. Ch. 84 Answers (Complex Numbers in Exponential Form) 


A320. Using the moduli and arguments found in the above answers, we have 
2 = 2e% = 2e% = 2. Ase Vieve™ 1+2ia V5el 107, 1.23) a4/ We. 
w=2-in V5e 0464 z= 342i wx 130254, 
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152.7. Ch. 85 Answers (More Arithmetic of Complex Numbers) 


A321(a) |z| =|1|=1, argz =0, |w| = |-3] =3, and argw =7. 

Hence, |zw| = |z||w] = 3 and arg (zw) =argz+argw+t+ 2km=0+7+0=7. 

Thus, zw = 3(cos7+ isin 7) = 3e'” = -3. 

Next, |-2zw| = 2|zw| = 2-3 =6 and arg (-2zw) = arg (zw) -7=0. 

Thus, -2zw = 6 (cos0 + isin0) = 6e'™ = 6. 

(b) |z| = [2i] = 2, arg z = /2, wo] = |1 + 2i = V5, and argw = cos" (1/V5) » 1.107. 

Hence, |zw| = |z||w| = 2V5 and arg (zw) = arg z+ argw + 2k7 = 1/2 + 1.107 + 0 ® 2.678. 
Thus, zw = 2V5 (cos 2.678 + isin 2.678) = 2V/5e?"! w —4 + 2i. * 
Next, |-2zw| = 2|zw| = 2-2/5 = 4V5 and arg (-2zw) = arg (zw) — 1 2.678 — 1» —0.464. 
Thus, -2zw = 4V5 (cos -0.464 + isin -0.464) = 4V/5e0- 464 8 - 4i. 

(c) |z| = V10, arg z = - cos"! (-1/v10) w —1.893, |w| =5, and argw = cos”! (3/5) x 0.927. 
Hence, |zw| = |z||w| = 5V/10 and arg (zw) = arg z + arg w + 2k7t = -1.893 + 0.927 + 0 x -0.965. 
Thus, zw = 5V10 (cos —0.965 + isin -0.965) = 5V/10e~9 w 9 — 13:. % 
Next, |-2zw| = 2|zw| = 2-5V10 = 10V/10 and arg (-2zw) = arg (zw) +.» -0.965 +71 ® 2.177. 
Thus, -2zw = 10V10 (cos 2.177 + isin 2.177) = 10V10e7!"" w -18 + 26i. 

(d) |z| = V29, arg z = cos"! (-2/¥29] # 1.951, |w| =1, argw = 7/2. 

Hence, |zw]| = |z||w| = 29 and arg (zw) = arg z+ argw + 2k7t = 1.951 + 1/2 - 2» -2.761. 
Thus, zw = V29 (cos -2.761 + isin—2.761) = /29e7?764! w —5 — 2. x 
Next, |-2zw| = 2|zw| = 2V29 and arg (-2zw) = arg (zw) + m* -2.761 + 7 0.381. 

Thus, -22w = 2/29 (cos 0.381 + isin 0.381) = 2V29e84 w 10 + 4i. 

(e) |z| = V2, argz = - cos" (-1/v2) x —2.356, jw] = V5, and argw = - cos"! (-1/V5) nw 
—2.034. 

Hence, |zw| = |z||w| = 10 and arg (zw) = arg z + arg w + 2k7t = -2.356 — 2.034 + 27» 1.893. 
Thus, zw = V10 (cos 1.893 + isin 1.893) = V10e!°9 ~ -1 + 33. a 
Next, |-2zw| = 2|zw| = 2V/10 and arg (-2zw) = arg (zw) - m* 1.893 — 1 * -1.249. 

Thus, -2zw = 2/10 (cos —1.249 + isin -1.249) = 2\V/10e71 749! w 2 - Gi. 

(f) |z| = V34, argz = -cos? (-5/v34) w —2.601, |w| = 26, and argw = —cos"! (5/v26) ws 
—0.197. 

Hence, |zw| = |z||w| = V34V26 and arg (zw) = arg z+arg w+2k7 © —2.601-0.197+0 » -2.799. 
Thus, zw = V34V26 (cos -2.799 + isin -2.799) = V34V/26e 2.79% w -28 = 103. a 
Next, |-22w| = 2|zw| = 2V34V26 and arg (-2zw) = arg (zw) + 18 -2.799 + 7 0.343. 

Thus, -22w = 2/34/26 (cos 0.343 + isin 0.343) = 2V/34V/26e9345' w 56 + 20i. 
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A322(a) z=r(cos@+isin@) and w =s(cos¢@+isin®@). 
(b) zw =rs(cos@ +isin@) (cos¢@+isind) 
=rs(cosOcos¢+isin@cos¢+icos@sin ¢ — sind sin d) 


=ps[cos (0+) +isin (04-4). 
(c) Jew| = \/[rs cos (0+ #)/ + [rssin (0+ 6)] 

= rsy/cos? (0 +4) +sin? (0+) =rsVI=rs=|z|lul. Vv 
A823(a) |z| = 1, argz=0. So, |1/z| = 1/|z| =1, arg (1/z) = —argz = 0. 


1 : 
Thus, — = le“ = cos0+isinO = 1. 
z 


(b) |w| = 2, argw = 7/2. So, |1/w| = 1/|w| = 1/2, arg (1/w) = -argw = -7/2. 


a 1 —T at 1 
Thus, rs son = (cos Be +isin = = “5h % 


(c) |z| = 17, argz = 7. So, |1/z| = 1/|z| = 1/17. Note importantly that z <0, so that Fact 
194(b) does not apply here. We have, simply, arg — = arg z =7t. And, 
z 


= =e a (cos 7+ isin 71) = : 
7 “ay ae 





&R 
= 
~l 


(d) |w| =8, argw =—-m/2. So, |1/w| = 1/|w| = 1/8, arg (1/w) = -argw = 7/2. 


Thus, = =¢i7/? = : (cos a isin =| 2 si. % 


2 2 


(e) |z| = V29, arg z = cos"! (-2/v29) # 1.951. So, |1/z] = 1/|z| = 1/V29, arg (1/z) » -1.951. 











1 1 ol 
Thus, ay a cos -1.951 + isin-1.951) » —0.069 + 0.172i. % 
z y/29 /29 ( ) 

(f) |w| = V2, argw = -cos} (-1/v2) = ~3n/4. So, |1/w| = 1/|w| = 1/V2, arg (1/w) = 37/4. 
ae ae 1 i 4 

Thus, ee (cos on +isin =) =-~+-i. af 
w /2 re 4 4 2 2 

(g) |2| = V10, arg z = - cos" (1/V10) » -1.249. So, |1/2| = 1/|z| = 1/V10, arg (1/2) » 1.249. 

Thus ee (cos 1.249 + isin 1.249) » 0.1 + 0.3i. a 

z /10 J/10 


(h) |w| =5, arg w = cos! (3/5) * 0.927. So, |1/w] = 1/|w| = 1/5, arg (1/w) = — arg w = -0.927. 


1 1 i 
Thus, = we 0987 . (cos -0.927 + isin -0.927) » 0.12 - 0.163. at 
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A324(a) |z| = |1]=1, arg z =0, |w| =|3] = 3, and argw =7. Hence, |z/w| = |z|/|w| = 1/3 and 
arg (z/w) = arg z-argw+2kn =0-m+2n=7. Thus, z/w = 1/3 (cosm+isin7) = !/3e'™ = —1/3, 


(b) |z| = |2i| = 2, arg z = 1/2, |w| = |1 + 2i| = V5, and argw = cos (1/v5) « 1.107. Hence, 

|z/w| = |z|/|w| = 2/5 and arg (z/w) = arg z — argw + 2k = 1/2 - 1.107 +0 » 0.464. Thus, 

z/w » (2/V5) (cos0.464 + isin 0.464) » (2/V5) 4 w 0.8 + 0.41, ts 
21 2 1-21 2i1+4 


: = = 0.8 40.4: 
(29 (2012-8 Pe oe 





(c) |z| = |-1-3i| = V10, argz = -cos"! (-1/v 10) « —1.893, |w| = |3+4i| = 5, and argw = 


cos! (3/5) » 0.927. Hence, |z/w]| = |z|/|w| = V10/5 = 2/5 = V0.4 and arg (z/w) = argz - 
arg w + 2k7t = -1.893 —- 0.927 + 0 » —2.820. 














Thus, z/w x V0.4 (cos -2.820 + isin -2.820) » V/0.4e7?°" = -0.6 - 0.2i. at 
Sia3i 21-316 =4 S344201=12. 155i 
2 o AaiicaliRlacr = -~0.6 — 0.2i. 
344) 344i 3-4i 32 4 42 25 


(d) |2| = |-2+5i] = V29, argz = cos! (-2/V29) » 1.951, |w| = fi] = 1, and argw = n/2. 
Hence, |z/w| = |z|/|w| = V29 and arg (z/w) = arg z - argw + 2km = 1.951 -7/2+0% 0.381. 











Thus, z/w » V29 (cos0.381 + isin 0.381) # V29e9 8" ~ 5 + 2i. ae 
oa K a a 
ae ES 2 A 5h 
i i -i - 


(e) |z| = |-1-i| = V2, argz = -cos (-1/v2) x —2.356, |w| = |-1-2i] = V5, and argw = 


—cos} (-1/v5) « —2.034. Hence, |z/w]| = |z|/|w| = 2/5 = V0.4 and arg (z/w) = arg z - 
arg w+ 2k7t = —2.356 + 2.0384 + 0 # —0.322. 





Thus, z/w * V0.4 (cos -0.322 + isin -0.322) » V0.4e03"2! ~ 0.6 — 0.2i. at 
-1-i -1-i-1+2i 1-2i+i1+2 3-i 
= . - = 0.6 - 0.2i. 
2° 150i ee? 5 


(f) |z| = |-5 - 3i] = V34, argz = -cos (-5/v34) « —2.601, |w| = [5 -i| = V26, and argw = 


—cos} (5/26) « —0.197. Hence, |z/w] = |z|/|w| = 34/26 = 17/13 and arg(z/w) = 
arg z-argw + 2km = -2.601 + 0.197 + 0 » -2.404. 

















Thus, z/w = 17/13 (cos -2.404 + isin -2.404) = /17/13e 74. at 
—-§5-31 -5-3i15+i -25-51-151+3 -22-20i 1 10: 
- = = = — 1. 
9-1 59-1 5+i 52 + 1? 26 13 13 
1 1 1 
A325. |—| = -— 2 |2| - 2 |2|— = lel where = and 2 use Facts 193 and 194. 
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153. Part V Answers (Calculus) 
153.1. Ch. 86 Answers (Limits) 
A326. lim ¢ (2) =0, lim é (a )= 1, lima 4 (ay =a, lim ¢ Ga es? 
A327. lim f (a)=1 lim f (x) does not exist, and lim f (a= 2: 


Figure to be 


inserted here. 





A328. From the graph below, lim h (x) =0 and lim é Cas 


Figure to be 


inserted here. 





And so, by Theorem 23, lim hate) | =7x0=0, lim [h (a) +2(x)] =0+2 =2, lim [h(z)-i(z)] = 


1 1 h 0 
0-2 = -2, lim [h (x) i (x)] 20% 2 = 0) aa) aor hin = a lim k = 7, and 





lim x" = 0" = 0. 
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153.2. Ch. 87 Answers (Continuity, Revisited) 
A332. By Definition ??, 


sin i 1 cos 
tan = —., cosec = —., sec = ——, col = ——, 
cos sin cos sin 


Each of tan, cosec, sec, and cot is defined as the quotient of two continuous functions. And 
so, by Theorem 26, each of these functions is continuous. 


By Definition 99, tan™! is defined as the inverse of the continuous function tan restricted 


m 7 
to the interval (-5, *). And so by Theorem 27, tan7! is also continuous. 
A331(a) lim(x+1)*lime+lim1 rei, 
The cubing function is continuous at 1. And so by Fact ??, we can “move” the limit in: 
: 3 
lim (a + 1)" = ima (x + 1)| ate, 
ee ae 
(b) lima* = 0° =0. 
x0 
The cosine function is continuous at 0. And so by Fact ??, we can “move” the limit in: 
lim (cos x”) = COS (iim °) =cos0= 1. 
u—> xz->0 


The sine function is continuous at 1. And so again by Fact ??, we can “move” the limit in: 


lim sin (cos a) = sin im (cos °)| = sin 1. 
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153.3. Ch. 88 Answers (The Derivative, Revisited) 


A333(a) f is differentiable at -3, with f’ (-3) 


= —], as we now show: 


fi (-8) = tim £@)= £0) _ yy HIE 


x — (-3) a>-3 £+3 





-2-3 

= lim — (For all x “near” 3, x <0 and hence |z| = -2) 
z>-3 £+3 

= lim -1£-1. 


t>-3 


(b) f is differentiable at any a <0, with f’ (a) =—1, as we now show: 





rea) ren F(#)- F(a) _|, [e|-la 
7 (a) =lim = lim 
wa 86 EG za £-A 
s a +a oo ” 
= lim (For all x “near” a <0, x <0 and hence |z| = -2) 
ra £- a 
=lim-1£-1. 


w>a 


A334(a) First simplify the difference quotient: 


g(2)-9(-3)_2?-(-3_(w-3)[e-(-3))_ 
x — (-3) x — (-3) x — (-3) 


Now, 


PCat (x) - 9 (-3) _ 





. + . . oe 
z>-3 g-(-3) | a aie a ee 
Sas oN 
=3 = 


The derivative of g at -3 exists and equals -6. 


(b) First simplify the difference quotient: 


g(x) -g (0) _ 22-0? (w~0)(w+0) _ 
x-0 x—-0 x —-0 
Now, 


(0) = Him 969 79D) _ time E 
g' (0) = lim ai = lim x = 0. 


The derivative of g at 0 exists and equals 0. 


(c) First simplify the difference quotient: 


g(a) -g(a)_a2-a _(e-a)(e+a) _ 


r+aQ. 
r-a w—-a 


xL-a 
Now, 
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g' (a) acca bee 2) =lim (x +a) = lima + lima = 2a. 


The derivative of g at any a€ R exists and equals 2a. 


A335. Let ace R. 


(a) Simplify the difference quotient: J (a) = f(a) eos 0. 
Za La 


f(x) - f(a) _ 





limo £ 0. 


wa 


So, lim 
wa 


We’ve just shown that for any a € R, the derivative of f at a exists and equals 0. 
Hence, the derivative of f is the function f’:IR > R defined by f’ (x) =0. 
The derivative of f at 2 is f’(2) =0. 

g(x)-g(a) _ (5a+7)-(5a+7)  5a-5a _ 








(b) Simplify the difference quotient: = 5: 
L-a r-a r-a 
So, tim 962) - 9) _ ns Ss, 
wa G—- a wa 


We’ve just shown that for any a € R, the derivative of g at a exists and equals 5. 
Hence, the derivative of g is the function g’: R > R defined by g’ (x) = 5. 

The derivative of g at 2 is g' (2) =5. 

(c) Simplify the difference quotient: 


h(x)-h(a) (2x24+5x2+7)-(2a?+5a+7) 2x? - 2a? + 5x -5a 


v-a x-a L—-a 
2 42 _ ee _ 
9 pO gD) xs Ges 
L—-a x-a L—-a 
Geer 
2a 2a 5 





in BD - BO 


ie i Tt at ae 
= lim [2 (4 + a) + 5] = lim 2z + lim 2a + lim5 = 4a+ 5. 
wa G= wa wa w>a wa 


We’ve just shown that for any a e€ R, the derivative of h at a exists and equals 4a + 5. 
Hence, the derivative of h is the function h’: R > R defined by h’ (x) = 4a +5. 





652 At =, we can apply the Sum Rule because these three limits exist: 
1. lim 2x = 2a (by the Power and Constant Factor Rules for Limits) 


2. lim 2a = 2a (by the Constnat Rule for Limits) 
3. lim5 =5 (by the Constant Rule for Limits) 
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The derivative of h at 2 is h’(2)=4-2+5=13. 
A336. Write g(x) = \V/|2| = V Vx? to see that g is elementary. 


Figure to be 


inserted here. 





Observe that the tangent line of g at 0 is vertical—hence, g is not differentiable at 0 (and 
is not a differentiable function). 
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153.4. Ch. 89 Answers (Differentiation Notation) 


A183(a) The derivative of f is f : R > R (Newton) or f : R > R (Leibniz) and is defined 
2 


by F(a)= SF =0 


d 
The derivative at 2 is the number f (2) = — 


_ AF poy _ 
dx ec 





d 
(b) The derivative of g is g: R > R (Newton) or ~ : IR > R (Leibniz) and is defined by 
a 


. dg 
— 5 
g(a)=— 
The derivative at 2 is the number g (2) dg dg (225 
vativ S| = (2)=5; 
7 dx 3 dx 





(c) The derivative of h is h:R>R (Newton) or = : R > R (Leibniz) and is defined by 
e 


h(x) == 4045. 


The derivative at 2 is the number h (2) = a 


dh 
=) 2 =4.2 5= 13. 
dx a : 





A337. T (h) is the function whose domain and codomain are both R and whose mapping 
rule is x (x +1)? /2. And T (h) (2) = 9/2. 

T (7) is the function whose domain and codomain are both R and whose mapping rule is 
x (3x -1)° /2. And T (i) (2) = 25/2. 
S(j) is the function whose domain and codomain are both R and whose mapping rule is 
w+ (x+1)?/4. And $(j) (2) =9/4. 
S(k) is the function whose domain and codomain are both R and whose mapping rule is 
x (3a-1)7/4. And S(k) (2) = 25/4. 

A338(a) If a differentiable function maps each x to x, then its derivative maps each x to 
ox". 

(b) If a differentiable function maps each x to sina”, then its derivative maps each x to 


27 COS x. 





(b) If a differentiable function maps each x to Inz, then its derivative maps each x to 1/2. 
A339. The mistake is in Step 4. 


This is a common mistake made by students. To find the derivative of f at 2 (a number), 
we must first find the derivative of f (a function)—then plug in 2. 


The mistake here is to plug in 2 first, getting f (2) = -2, then differentiating the constant 
—2 (a meaningless operation), which of course yields 0. 
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153.5. Ch. 89 Answers (Rules of Differentiation, Revisited) 


A341. Let ae R. Simplify the difference quotient: 


4_ 4 3 Dc 3 
i()-i(a)_& —a « (t-a)(@ +ax? +a t+@") 34 aetterea 
L-a La La 
So, 958 
a? a> a? a? 
: ( )-i(a) SSS OOF See OOS 
| ay, Ae 1: 3 ee oe yh oe es ae 2S in ee ee ae 
t (a) = lim ———— = lim (a + az* + ax + a") = lima” + limaz* + lima*z + lima’ = 4a’. 
rL>a T—-a xL>a z>a xL>a xL>a z>a 


We’ve just shown that for any a€ R, the derivative of i at a exists and equals 4a’. 
Hence, the derivative of i is the function i’: R > R defined by 7’ (x) = 4°. 
A342. Let ae R. Simplify the difference quotient: 








j (x) - 3 (a) _ a°-a% o (x —a) (ae! + a°?%a 4+ oo Sa? +--+ + cae? +a~!) eee ee 
r-a r-a L-a 

So, 

654 


j' (a) = lim lim (a! +a°?a+a°%a? +---+ 20°" +a") 
ra r-a ra 
= lime! + lim 27a + lim 2° 3a? +--+ + limza®? + lima! = ca! 
wa wa wa wa wa 
a ee Sass OEE 
acl acl acl acl acct 


We’ve just shown that for any a€ R, the derivative of j at a exists and equals ca". 





653 At = we can use the Sum Rule for Limits because these four limits exist: 


1. lim? =a? (by the Power Rule for Limits) 


lim ax? = a® (by the Power and Constant Factor Rule for Limits) 


«wa 


lim a?x = a® (ditto) 


za 


lim a? = a® (by the Constant Rule for Limits) 


«wa 


= oo IS 


654A¢ = we can use the Sum Rule for Limits because these c limits exist: 


1. lima®! =a" (by the Power Rule for Limits) 


za 


. lima®?a =a“ (by the Power and Constant Factor Rule for Limits) 


«wa 


< lime? "a" =a" (ditto) 


2 

3 La 

4.: 

5. lim ae =a’ * (ditto) 
6 


. lima®' = a®' (by the Constant Rule for Limits) 


za 


1897, Contents www.EconsPhDTutor.com 


Hence, the derivative of 7 is the function j’:IR - R defined by j’ (x) = cx}. 
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153.6. Ch. 91 Answers (Some Techniques of Differentiation) 


d 
A343. Apply the dn operator to the given equation: 
e 

















d d d d 3 
— (x? siny) =—l] => 322 siny +23 cosy— 2 () — oF = tiv: 
dx dx dx U5 
I 
From the given equation, we have y 3 sin”! a: 
x 
d 3 1 
Plug 3 into 2 to get SY 2? tan (sin =). 
dx x is 
d 
A344. Apply the dn operator to the given equation: 
us 
d d d d 
qy ysinz) = 4 (eh 1) —=> SH sina + yoosz be (14 22). 
By the given equation, at 7 = 0, we have 0 = e’ -1 or y =0. 
So, plug (x,y) = (0,0) into 2: 
d d 
— sin +0-cos0 =o (14 ) = 
dx !(x,y)=(0,0) ' dz fy) (00) 
O40 (1 gc as “a = 
dx! («,y)=(0,0) dx !(x,y)=(0,0) 
A345(a) Let y= cos! x € [0,7], so that x = cosy. 
d 
Apply the — operator to é: 
dx 
d d d d -] 
—ZX=— cosy = 1 =-siny —> ane —, for siny #0. 
dz dx dx dz siny 


From the identity sin? y + cos” y = 1, we have sin y *+V/1-22. But since y € [0,7], we know 
that siny >0. And so, we may discard the negative values and rewrite 2 as sin y 3/1 - 22. 





d d —1 
Now plug 2 into 2 to get costa = % = which is defined for x € (-1,1).°? 
ii 


dz VJ1-2?’ 


(b) Let y= tan! 2, so that x + tan y. 


d 
Apply the ae operator to é: 
x 


Qu 
1@) 
s) 
1) 
< 
Qu 


y2 1 


d 
—r=—tany = 1 =sec ae — co ry 


dx dz 


(Note that for all y € R, secy #0.) 





1 


®°°So, cos’ is not a differentiable function: Its domain is [-1,1], but that of its derivative is (-1,1). 
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Plug + into the identity sec? y = 1+ tan? y to get sec” y S14a?. 


d dy 1 
Next, plu 3 into 2 to get — tan! 2 = — = . 
noe oe da dx 1+2? 





d 
A346. Apply the a. operator to the given equation: 
x 


d d d 
da 7 *8D2) = G0 i doy +2?" + cos. =0 — 
ie a = for x + 0. 
da a 
And so, by the IFT: 
da a 


— = -—____., for 2xy + cos x # 0. 
dy 2xy+cosx 


A347. Because V1 —- x? = cos (sin! a), as we now show: 


Let y = sin"! a. Then x = siny and V1-2?2=V1- sin? y = COSY = COS (sin x). 


d d 
A352. Compute ee e! — 3t? and ae —sint + 2t. 
dt dt 


So, by the Parametric Differentiation Rule, 


dy dy dx ef —3¢? 
dx dt dt —-sint+2t 
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(for —sint + 2t #0). 
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153.7. Ch. 93 Answers (The Second and Higher Derivatives) 


d 
A353(a) First derivative: , = eS? (_ sin x) = -e* sin x. 
ua 


COS & 


Observe that the expression —e°°*” sin x is defined for all x € R. 


d 
Hence, f is a differentiable function. Its (first) derivative may be denoted f", f, or ae and 
a 
has domain R, codomain R, and this mapping rule: 


Ps) =f ta) = = (a= - == * sin r, 


The (first) derivative of f at 1 is f’(1) = -e°S! sin 1. 


_ e608 x cos x COS & ( 2 


d ; : 
Second derivative: re (-e°S* sin x) = sin x (—sinxz)-e°** cosxz =e sin’ x — cos x). 
ui 


aie (sin? XZ — cos x) is defined for all x eR. 


Observe that the expression e 
Hence, f is a twice-differentiable function. Its second derivative may be denoted f”, f, or 
2 


a2! and has domain R, codomain R, and this mapping rule: 
£ 


f" (2) =e" (sin? 2 — cosa). ® 


The second derivative of f at 1 is f” (1) =e! (sin? 1 - cos 1). 
d 2 

(b) First derivative: —Va2?-1= —— 
dx Q/xr2-1 Vx2-1 


Observe 2?-1=0 <— > x=41. So, g’ will not be defined at 1 and is defined only on (1, oo]. 








d 
Hence, g is not a differentiable function. Its (first) derivative may be denoted g’, g, or mee 





dx 
and has domain (1, co], codomain R, and this mapping rule: 
. dg dg ne 
! — = ———— = = 
a (0) = 9 (2) = $2 (2) = $8 =§ 


The first derivative of g at 1 is undefined (or does not exist). 


Second derivative: 


d x 1 v- 2x x2 -1 x —] 


Again, g” will not be defined at 1 and is defined only on (1, 00]. 
Hence, g is not a twice-differentiable function. Its second derivative may be denoted g”, g, 
2 


= and has domain (1, co], codomain R, and this mapping rule: 


of — = 
dx? 


" = =il 
")= ae © 


1901, Contents www.EconsPhDTutor.com 


The second derivative of g at 1 is undefined. 
d 2). 2 
(c) First derivative: —Vsin x? +2 = Ss apnea 
dx 2Vsin x? +2 sin x? + 2 


Observe that for all « € R, sinz? +2> 0. 





dh 
Hence, h is a differentiable function. Its (first) derivative may be denoted h’, h, or —; and 





dx 
has domain R, codomain R, and this mapping rule: 
. dh dh 2 
h! (x) = h(a) = — (2) = = 
dx dx VJ/sinx? +2 
1 cos 1? cos 1 
The (first) derivative of h at 1 is h’ (1) = = 
oe ) Vsinl?+2 vVsinl+2 

Second derivative: 
d.. “cosa? . cosa” =a(sing*) 2a. ge eosa" (cosa) 2e cosa? =2n"sinw? a cos? a" 


dx J/sinz2 +2 — Vsin x? +2 2 (sina? + 2)"? - sin x? + 2 (sin x? +2)'° 
Again, for all x € R, sina? +2>0. 
Hence, h is a twice-differentiable function. Its second derivative may be denoted h”’, h, or 


d?h 
; and has domain R, codomain R, and this mapping rule: 








dx?’ 
h" (2) = cosz*—2¢?sinz*  —_£? cos* x? © 
= 5 a weer. 
sin x? + 2 (sin x? + 2) 
cos1?-2-1?sin1? 1? cos? 1? cos 1-2sin1 


The second derivative of h at 1 is h’” (1) - = 
Vsinl2+2  (sinl?+2)'° Vsinl+2 
cos? 1 
(sinl+2)'° 
A354(a) False. Twice differentiability implies differentiability. However, the converse 
(“differentiability implies twice differentiability”) is false—in Example 1156, the function h 
is differentiable but not twice differentiable. 


(b) True by definition. 


(c) True by definition. Recall that the second derivative of f at a is defined as this limit 
(which is a real number if it exists): 
! _ fi 
an ff (a) 
ra eG 





So, if f’ (a) doesn’t exist, then clearly the above limit (or equivalently, the second derivative 
of f at a) cannot exist either. 


A355. Since f is a differentiable function, by Definition 205, it is differentiable at every 
point in its domain. Thus, the (first) derivative f’ has the same domain as f, namely D. 


We are given no information about the second derivative of f. Thus, all we can say about 
the domain of f” is that it must be a subset of D, the domain of f’. The domain of f” 
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could be as “big” as D (if f is twice differentiable) or as “small” as @ (if f is nowhere twice 
differentiable). 


A356. The function g is (at least) five-times differentiable. Its first five derivatives have 
domain R, codomain R, and these mapping rules: 
Lagrange Newton Leibniz 


FCC - 2 (2-4 hei Gs Dat. 


g" (x) = g(a) “is x) = 8 = 120? 60 +2, 
o"(a) =G(2) = 59 (x)= 54 = 240-6, 
g(x) =4(e) =<4(0)- Za 
g(x) =9(a) = 9 yo 


Clearly, for any integer n > 5, g is n-times differentiable and its nth derivative is defined by 


Lagrange Newton Leibniz 


g(x) =9(0) = 2(x)- <2 =0 





Evaluating each of g’s derivatives at 1, we have 


Lagrange Newton Leibniz 




















dg df 
'd)=g(0 =—(l)=—= =A 1 Sole eos a1 Se 
g (1) =9 (1) - d|_, + 
g" (1) =9(1) aa =12-17-6-14+2=8, 
=1 
. cg 
eg’ (1) =¢ (1) 9 (1) =< =24-1-6=18, 
xv=l 
g (1) =9(1) 9 (1) = 29 = 24 
es si 
For each integer n > 5, 
n d"g ang 
™(1)=-61 = ie = 
gs” (1) = 9 Q) qn (1) dan|_, 





A357. The first five derivatives of h have domain R, codomain R, and these mapping 
rules: 
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h' (x) = 5a* — 423 + 327-22 +1, 
h' (x) = 20x? - 1227 + 6x - 2, 
h'" (x) = 60x? — 24x + 6, 
A (x) = 1202 - 24, 
AY) (x) = 120. 
For any n > 6, the function h is n-times differentiable, with the nth derivative defined by 
A (x) =0. 


Hence, h is smooth. 


A358. The first four derivatives of cos are 


cos’ = —sin, 

cos” = —cos, 
cos” = sin, 
cos“) = cos. 


We see then that we’ll have a repeating cycle. Specifically, for each k = 0,1,2,3,..., we 
have 


cos(!+4*) = — sin, 
cos?+4*) = — cos, 
cos@+4*) = gin, 
cos(**) = cos. 
We’ve just shown that sin is n-times differentiable for every positive integer n. Hence, exp 
is smooth. 
And the 8603rd derivative of cos is 


cog(8 6038) = epg (8+4x2 150) ~ gin. 


A359. We’ve already used f” to denote the composite function fo fo---o f. So, the 
— 


n times 
parentheses help us distinguish f‘”), the nth derivative of f, from the composite function 


f". 
A360(a) The first derivative of h is the function h’: R > R defined by h’ (x) = 2a. And 
Wy = 2s, 


(b) The composite function h? : R > R is defined by h? (x) = h(h(x)) = h(a? +1) = 
(a? +1) +1=204+227+2. And h?(1) = 1442-12425. 

(c) The squared function (h)° :IR > R is defined by (h)* (x) = (a + 1) = ¢4+227+1. And 
(h)? (1) =144+2-12 4124, 
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2 2 
(d) The function (=) : R > R is defined by (=) = [h'(x)]} = (2r)° = 42”. And 





dy 

dh\° 

—_ =4-17=4. 
(i) 

r=1 
(e) The second derivative of h is the function ah IR —> R defined b oP. 1d h’ (x) 
Vi Vv os > ——_—. = — = 
da? Y dae dex 

d d?h 
— (2x) =2. And —~| =2. 
7 ”) Oe de = 





dh? 
(f) The first derivative of the composite function h? is the function aa : IR > R defined by 
c 


dh dg d 7-4 
a a —— 2 9) = 47° Ay. 
re aa {*) sy (wt + 2a? +2) xv? + 4x 
2 
eres =4.134+4.1=8. 
dx Ss 





dh 

(g) Informally, the function a gives us the rate of change of h with respect to 27. By 
£ 

the Chain Rule, 





dh dx? dh 
dxz2 dx dx’ 
KS ae ee 

22 2x 


dh dh dz? 22% 
Rearranging, a2 = ae od ce. = oe =I, 


So, the function sd :R — R is defined by oP =1. And — | =1. 
da? da? 


xr=1 
dh? 
(h) Informally, the function a2 gives us the rate of change of the composite function h? 
iz 


with respect to x”. By the Chain Rule: 


dh? dx? — dh? 
dx2 dx dx | 
—s— ——* 

2x 4e3+de 


dh? dh? da? 4x3 +4a _ 





Rearranging, Ene) = aie = ae = On 2x? + 2. 
dh? dh? dh? 

Hence, the function —~: R > R is defined by —> = 227 +2. And —<| =2-17+2=4. 
da da da 7 
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153.8. Ch. 94 Answers (The Increasing/Decreasing Test) 
A361(a) The derivative of f is the function f’: IR > R defined by f’ (x) =e” -1. We have 
<0, for x € (-00,0), 


f(z) =), lor a =, 


>0, for xe (0,00). 


Figure to be 


inserted here. 





The function f is strictly decreasing on (-0o0,0) and strictly increasing on (0,00). At the 
point 0, f is both increasing and decreasing (but neither strictly increasing nor strictly 
decreasing). 


These findings are consistent with Fact 208: 
IDT 


(a) f’(x) 20 for all x € [0,00) => f is increasing on [0, 00). 
(b) f’(2) >0 for all x € (0, «) Ed f is strictly increasing on (0, co). 
(c) f’(z) <0 for all x (-00,0] = f is decreasing on (-o0, 0]. 


(d) f’(«) <0 for all x € (-00,0) ee f is strictly decreasing on (—0oo,0). 


(b) The derivative of g is the function g’: [0,27] > R defined by g’ (x) = cosa. We have 


> 0, for x € 0,5) u(F,2n, 


2 2 
3 
@ (2) =cos(2)4=0, torg= a 
<0, for x € ie 
2° 2 


Figure to be 


inserted here. 





7 OT Mm 37 
The function g is strictly increasing on 0. | U (> 2n| and strictly increasing on (7, =]. 
T Tl 
At each of the two points ~ and pte is both increasing and decreasing (but neither strictly 
increasing nor strictly decreasing). 
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These findings are consistent with Fact 208: 


(a) g’(x) >0 for all xe€]0, | U =. 27 = g is increasing on | 0, | U | 
(b) g’(x) >0 for all x€]0, =| U (= Qn = g is strictly increasing on |0, 2) U ( 
(c) g'(x) <0 forall xe = = = g is decreasing on = > ; 
(d) g’(x) <0 forall ve (Z, = eae gis strictly decreasing on (2, = ; 


A362(a) The derivative of i is the function i’: R > R defined by i’ (x) = -327. 


Figure to be 


inserted here. 





(b) The function 7 is strictly decreasing on R because if a < b, then i(a) = —-a® > —b® = i (0). 
By Fact 46, 2 is also decreasing on R. 
(There is no interval on which 7 is increasing or strictly increasing.) 


(c) The function i shows that the converse of IDT(d) is false. 
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153.9. Ch. 95 Answers (Determining the Nature of a Stationary 
Point) 


A363(a)(i) The derivative of f is the function f’ : IR > R defined by f! (x) = -32? +3 = 
3(1-2°) =3(1-2)(1+2). 


We have f’(a) =0 <> a=+#l. So, f has two stationary points, +1. 





Figure to be 


inserted here. 





(a) (ii) At the point -1, f’ <0 to the left and f’>0 to the right. Hence, by the FDTE(d), 
—1 is a strict local minimum. 
At the point 1, f’ > 0 to the left and f’ <0 to the right. Hence, by the FDTE(b), 1 isa 
strict local maximum. 
(a) (iii) The function f has two turning points, +1 (these are the only points that are both 
stationary points and strict local extrema). 
(b)(i) The derivative of g is the function g’ : [0,27] > R defined by g’ (x) = cosa. 

mM 37 7 STU 


We have g’(a)=0 <> a= ok So, g has two stationary points, 3 and os, 





Figure to be 


inserted here. 





(b) (ii) At the point a. g' > 0 to the left and g’ < 0 to the right. Hence, by the FDTE(d), 


7 . . . 
3 is a strict local maximum. 


At the point oe g' > 0 to the left and g’ > 0 to the right. Hence, by the FDTE(d), = is 
a strict local minimum. 

(b) (iii) The function g has two turning points, . and = (these are the only points that 
are both stationary points and strict local extrema). 

(c)(i) The derivative of h is the function h’: R > R defined by h’ (x) = 2Qxe” . 


We have h'(a) =0 <=> a=0. So, h has one stationary point, 0. 
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Figure to be 


inserted here. 





(c)(ii) At the point 0, h’ <0 to the left and h’ > 0 to the right. Hence, by the FDTE(d), 
0 is a strict local minimum. 


(c) (iii) The function h has one turning point, 0 (this is the only point that is both a 
stationary point and a strict local extremum). 


A366(a)(i) The derivative of f is the function f’:.R > R defined by 
f' (@) =8a" 4 2n° = 62" =a? (82° + 22-6) = 0° (8a—6) (@ +1). 
The stationary points of f are given by 


f(a) =0 — a (8a-6)(a+1) =0 => a=0,-,-1. 


3 
So, f has three stationary points: 0, m1 and -1. 


Figure to be 


inserted here. 





(a)(ii) The second derivative of f is the function f” : R > R defined by f” (x) = 56x° + 
122° - 3027 = 2x4 (282? + 6x - 15). 


Evaluating f” at each of f’s three stationary points, we have 


f" (0) =0, 


(3) =2.5 2s- (2) +6-2-15] > 0, 
f""(-1) =2-(-1)* [28-(-1)’ +6-(-1) - 15] > 0. 


3 
Hence, by the SDTE, Fl and -1 are strict local minima of f. 


Unfortunately, the SDTE is inconclusive about 0. 


It turns out that 0 is a strict local maximum of f. We can justify this by simply pointing 
to the graph (this will suffice for your A-Level exams).°° 





6°6Tt is also possible to rigorously show that 0 is a strict local maximum of f, as we do now: 
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(a) (iii) The function f has three turning points (points that are both stationary and strict 


local extrema): -1, 0, and -. 
A 
(b)(i) The derivative of g is the function g’: (-2, | — R defined by 


g' (x) = sec? x. 


The stationary points of g are given by 
! = 2 4 
g' (a) =0 or seca =0. 


Tl 
But = is false for all ae (- *). Hence, g has no stationary points. 


Figure to be 


inserted here. 





(b) (ii) Not applicable, since g has no stationary points. 
(b) (iii) Since g has no stationary points, it has no turning points either. 


(c)(i) The derivative of h is the function h’ : [0,27] > R defined by h’ (x) = cosx - sina. 
We have 








7 DI 
i (aj)=0 — > cosa-sina =0 —> cosa = sina <=> a= Tia 
; Tl O7t 
So, h has two stationary points, ri and 7h. 
Figure to be 
inserted here. 
First, observe that f (0) =0. 
1 1 
Next, observe that if x ¢ (-— 0) U (0 a) then «° > 102" > 100°, so that 
2 2 68 68 61 
f (z) =a + pa! — a? <a + a! — 10g! = a — at < a8 - a =-— a <0= f (0). 
We’ve just shown that for all x that are “near” 0, we have f(x) < f(0). Hence, 0 is a strict local 


maximum. 
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(c)(ii) The second derivative of h is the function h” : [0,27] > R defined by h” (x) = 


—sin x —cosz. 
Evaluating h” at each of h’s two stationary points, we have 


ii v2 v2 _ /2<0 


" mu aes au == _ 
h (=) sins COs 7 5 5 
2 2 


7 
By the SDTE, — is a strict local maximum, while a is a strict local minimum. 


(c) (iii) The function h has two turning points (points that are both stationary and strict 


5 
local extrema): — and sil 
4 4 
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153.10. Ch. 96 Answers (Concavity) 
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153.11. Ch. 97 Answers (Inflexion Points) 
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153.12. Ch. 98 Answers (A Summary of Chapters 94, 95, 96, and 
97) 
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153.13. Ch. 99 Answers (More Techniques of Differentiation) 


ASG. At t=0, (z,y) = (0,0), At t=1, (gy) =@,0): 
dy dy da 4t?-1 


By implicit differentiation, — = — +— = ; 
ea dx dt dt 5t+1 











At? — 1 
Hence, /; has gradient Baa) = —1 and is described by y = —x. 
t=0 
And ly has gradient | ee and is described b O+- 5 (e- 2)=—-1 
ee b+i] 6 2 — a i 





The intersection point of J; and ly is given by this system of equations: 


L 2% 


ads 


—2 and y 5 


Y 
Solving, -x = #/2-loraz 3 2/3. Plugging 2 back into either + or 2, we have y = —2/3. 
Thus, the intersection point of J; and ly is (2/3, -2/3). 


A381. The base of a cone is a circle. You will doubtless recall the formula for the area of 
a circle: 


A=mnr’. 
Differentiate with respect to t: 
dA dr 
— = 2?7qr—_— 
dt dt 


Plug in 2 and 2 (from the last example): 


dr 3 i n\"3 
om] S| =an(2)" Sorgae=($) +100 
2 t=2 t=2 


-1 


dA 
dt 
t= 








The base area A is increasing at a rate of 1.02m?s 


A382(a) V = anh = 1. Rearranging, 


(b) By Pythagoras’ Theorem, | = Vr? + h? = \ [— a i. 


(c) S=nrl= m/f 3+ h? =m m+ “Va + 3th. 








ees 
(d) Compute 2 : ay, + 3rt). 
On 9.) 8.4. 3nh* % 
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7 1/3 
= 0 —> q= 75 +3n=0 ed n= (2) . 


- 6 1/3 
Hence, the only stationary point of S' (with respect to h) is at h = (=| # 1.24. 

















ds 

(e) Sign diagrams for p, q, and ie pq." 

+ 
>» %?P 

_ + 

eg 
- * ds 
ee 
dh 


By the First Derivative Test for Extrema (FDTE), S attains its strict local minimum at h. 
By the Increasing/Decreasing Test (IDT), S$ is strictly decreasing on (0,h) and strictly 
increasing on (h, 00). 


- 6 1/3 
Hence, S attains its strict global minimum at h = (- 





®7Explanation: p> 0 for all h € (0,0). 
6 1/3 6 1/3 
In (d), we found that g=0 <> h= (<) . Now observe also that if h < (=) , then q < 0. And if 


6 1/3 
n>(2) , then q> 0. 
Tt 


Since a = pq and p> 0 for all h, the sign diagram for is the same as that for q. 


dh 
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153.14. Ch. 100 Answers (More Fun with Your TI84) 
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153.15. Ch. 101 Answers (Power Series) 
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153.16. Ch. 102 Answers (Maclaurin Series) 


A390(a) We use the Four-Step Maclaurin Recipe: 
1. The derivatives of f are defined by 


f'(z) =n(1+a)"™, 
f" (az) =n(n-1)(1+2)"", 
f® (@) =n(n-1)(n-2)(1+2)"” 
( 


f(x) =n(n=1)(n—2)(n-3) (14a), 


f©) (2) =n(n-1)...(n-r4+1)(142)"" 

2. Evaluate each of f, f’, f”, f’”, etc. at 0: 

fO) 2040) 21, 

f'(0) =n(1+0)" =n, 

f" (0) =n(n-1)(1+0)” = n(n 1), 

f (0) =n(m-1) (n-2)(1+0)"" =n(n-1) (n-2), 

f (0) =n(n~1)(n-2) (n-3) (1+0)" =n(n-1) (n-2) (n-3), 

(OO) -nGis i)... op hs 0) Se 
3. The rth Maclaurin coefficient of f is 
FO (0) _n(m=1).. (m=r +1) 


rt 


r! r! 
4. The Maclaurin series of f is 
Lena + PAD pa Minn Teo Ao gg MN en re, % 
| ! ‘ie 


(b) We use the Four-Step Maclaurin Recipe: 
1. The first five derivatives of cos are defined by 
cos’ x = —sinz, cos” x = — cosa, cos) x = sina, cos) x = cos 2, 
cos) x = - sing. 


We observe a cycle after every four derivatives. And so,°°® 


COs x, for 7. =0,4.8... 255 

ay —sin gz, for 7=1,5.954.25 
cos\”? 7 = 

— cos 2, |e aes fam ol Reraeee 

Sin ®, fOr 33, Fe Ls aes 





68 Actually, we already did this in Exercise 358. 
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) 


2. Evaluate each of cos, cos’, cos”, cos”, etc. at 0: 
tL, for 77 =U, A, Bees 
0 f 21,5, 9 is.08 
cos(”) 0= ’ or 7 9494) ’ 
—1, fOr WS 20 10h see, 
0, for W347, 1 e wees 
3. The nth Maclaurin coefficient of cos is 
1/n!, for 7 = "0,4, 85.024 
_ cos’) (0) _ }0/n! = 0, for #= 1,5, 9)... 
nl! -1/nl, for n= 2,6,10,... 
0/n! = 0, fOr WH 8, (, LU onds 
4. The Maclaurin series of cos is 
2 of 
[ale 
2) Al 
(c) We use the Four-Step Maclaurin Recipe: 
1. The derivatives of g are defined by 
1 
/ 
g (2) ~ 1 +a 
Gg’ v = 
Oe 
g® (a) = 3: 
ee 
(4) ped 
g t , 
7) (1 +2)" 
n-1 
Oia.) ea 


2. Evaluate each of g, g’, g”, g’”, etc. at 0: 
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g(0) =In(1 +0) =0, 

0) sa 

"eas “i 

OO = Gop . 

9 O)=— 7 = -3! 

(0) UD, (-1)"7 (n= 1) 


3. The nth Maclaurin coefficient of g is 


90) _ (CY =)! ey" 


_ 
n! n! n 


4. The Maclaurin series of g is 


A392. In Example 1281, we showed that for any né R, 








n(n-1) 9 n(n-1)(n-2) 3 for all x € (-1,1) 


(l+a) =l+nax+ mI 3 


And so, in particular, for n = -1, we have 





1 2(2-1 2(2-1)(2-2 
ep ee CS = 14 2aferailde(+1,1) 
(1 +2) 2! 3! 


That is, f can be represented by the power series just given. 
oe 


(1+ v)~ = 


A394(a) The Maclaurin series of exp is M (x) =1+a+ a 4 aI +... 
So, the first four Maclaurin polynomials are 
Mo (a) = 
M,(2)=1l+z 
x 
Mo(“)=1+a+ oT 
2 a8 
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Figure to be 


inserted here. 











“i ~1)(n-2)23 

(b) The Maclaurin series of f is M(x) =1+na + mn + mnie os 
So, the first four Maclaurin polynomials are 
Mo Ci) =] 
M(x) =1+nz 

-l 
M2 (x) = Lene + MD 92 

-1 ~1)(n-2)23 
M3(r)=1+nx+ ae ) ge oD in )a 


Figure to be 
inserted here. 





2 
(c) The Maclaurin series of cos is M (x) =1- BAe hy: 


So, the first four Maclaurin polynomials are 


Mo (x) = 
M (a) 
My (x) =1-5 
My (2) =1-5 


Figure to be 


inserted here. 
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2 
The small-angle approximation for cosine is cosx » 1 - — 




















2! 
A395(a)(i) sin 2x = 2x - ch + a —... for all 2x € Ror zeR. 
(a) (ii) sin (2? + 1) =¢°41= ian) + ioe) —... for allaz?+1leRorzeR. 
(a) (iii) sin (sin) = sinx - ee + ss —... for all singe Ror zveR. 
(a) (iv) sin = Z - oie) + sy 304 Or all Zs eRorzeR. 





POD (aay? 4 BIH?) (o5)8 4... for all 22 € (-1,1) 


(b) (i) (1+ 22)" =1+2nxr + 3I 


11 
ea) 
(b) (ii) (1 +22 +1)" = (2+22)" 4 14n(2? + jad (0? +1)? 2 J 7) (92 4:1)%4 


... for all 2? +1 € (-1,1) or 2 € (-2,0) or x € g. The interval of convergence is empty. 





That is, + holds for no values of z. 


ii _ -1)(n-2 
(b) (iii) (1 + sin x) 21+nsina+ MOD snot mn DO) snes ... for all sing € 
(-1,1) or x + k7t/2 for any integer k. That is, 2 holds for all real numbers x except those 
that can be written in the form k7t/2 for some integer k. 


(b) (iv) (1 + ay 2 ene nO ) ane (ne 2) ee forall Z eel 
or x € (-5,5) or re (-¥5, v5). 




















) cos2n = 1-22), 22)" 
(c)(i) cos2x =1- a gy for all 2x ¢ Ror veR. 
rae ee We 
(c) (ii) cos(x?+1)21-E A er for all z?+1leRorzeR. 
. 2 - 4 

(c) (iii) cos (sinx) = 1- S = — ~_... for allsine ¢Ror «eR. 

3 a5 \7 3/5\" 3 
(c) (iv) se ad [5) 7@ [5) ~... forall eRorzeR. 

5 2! 5! 5 
A396(a) From the fifth “standard” Maclaurin series, 

8 
In(L+2)=e-S4+ oH... for all x € (-1,1]. 


Substituting “x” with “x -1” to get 





2 3 
= = 
In(l+a2-1)=Inr=2-1- & ; ) + pbs + 1e(-1.1] or x € (0, 2]. 
2 3 
(e-1) (@-1) 
2 B 
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We’ve just shown that f can be represented by the power series x-1- 





2 3 
That is, i (ejeheten—1- eet) + wet) for. all x € (0,2] = Domain f. 


A397. From the first “standard” Maclaurin series, 





=l-gx+27°-2+..., for all x € (-1,1). 
Lab 


Substitute “x” with “2x?” to get 


1 2 3 
12972 = 1-227 + (22) — (2x7) +e-= 1-297? + 4x4 - 87% 4+... 


2/2 
for all 2x? € (-1,1) or x? « (-; 5) or ©E (-S 2). 


2 2 2 
2 f2 
Hence, g has domain (S 2) and can be represented by the above power series. 


A398(a) We know that sin and exp can be represented by power series. So, by Theorem 
57, f =sin-exp can also be represented by a power series. 


(b) From the “standard” series, we know that sin and exp can be represented by 


20 


d 1 : ay + tivel 
or ae 0 ; 
£ 31 an £ aT , respectively 
3 9 3 
. e Sane 
Write (2- s+... \(leee Sate... |acot ce togn + oye? +... 


By observation, cy = 0, cy; = 1, c2 = 1, and cs = -1/6 + 1/2 = 1/3. 
Hence, f can be represented by x+ x27 + 2°/3+.... af 


(c) In (a), we reasoned that f can be represented by a power series. By Theorem 38 
then, f can be represented by its Maclaurin series, which we now find using, as usual, the 
Four-Step Maclaurin Recipe: 


1. The first three derivatives of f are defined by 


f' (a) = cosxexpx +sinxexpxz = expxz (sinx+cosz), 
f" (2) = exp (sin x + cos) + exp x (cosx - sinx) = 2expx cosa, 
f' (x) =2expz (cosz-sinz), 


2. f (0) =0, f'(0) =1, f" (0) =2, and f”" (0) =2. 

3. The first four Maclaurin coefficients of f are 0, 1, 2/2 =1, and 2/6 = 1/3. 
4. Hence, the Maclaurin series of f is +2? +2°/3+.... 

(d) Yes. 


A399(a) Let f : (-1,1) > R be the function defined by f (x) = In(1+2). We know that 
cos and f can be represented by power series. So, by Theorem 57, g = cos-f can also be 
represented by a power series. 
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2 


. 
(b) From the standard series, we know that sin and exp can be represented by 1 - oT 


eo 
(ee 
and x a 


x ge og? 
2 3 
Write (1-4... )(e- S45 -...)-otaetar + (30° +.... 


By observation, co = 0, c; = 1, cp = -1/2, and c3 = -1/2 + 1/3 = -1/6. 
Hence, g can be represented by x - x?/2-2°/6+.... % 


(c) In (a), we reasoned that g can be represented by a power series. By Theorem 38 then, 
g can be represented by its Maclaurin series, which we now find, as usual, the Four-Step 
Maclaurin Recipe: 


1. Step 1. The first three derivatives of g are defined by 


g (x) =-sinaln(1+2)+ _— 





sinx sinx cos © 
uf == ] 1+ =- 
g’ (x) =-cosaln(1+-2) ies lee Gea? 





g(x) =sinzln(1+2) - 

















COs x ( COs x sin x sin x 2cosx 


- - s|+ s+ 7 
l+z 1+ (1+2) (l+az) (1+z) 


2. Step 2. g(0) =0, g’ (0) =1, g” (0) = -1, and g” (0) =-1. 

3. Step 3. The first four Maclaurin coefficients of g are 0, 1, -1/2, and -1/6 = 1/3. 
4. Step 4. Hence, the Maclaurin series of g is x — x?/2-23/6+.... 

(d) Yes. 

A400. As usual, we use the Four-Step Maclaurin Recipe: 

1. The first three derivatives of h are defined by 








h! (xr) (-2)(-1) 1 1 -1-4¢ n(a)( D Ax ) 


(1—a)? 14222" (1-2)? (14222)? - l-x 14272 








h(a) =H (a)( 2 Ax 16a? 4 [-Ge) 


) +h (a) a 
— Xx ———$$— ————— ye 
l-x 1422? (l-x) (14222)? 1+ 22? 


_ 2h (x) hi (x) hl" (x) ~ [hi (x) Jb! (x) 
[h(x] 
2. Evaluate each of h, h’, h”, and h’” at 0: 


2 1622 4 
AO (x) 6x | (x 


+o) ° (1 +222)? (1+ 22? 
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1 1 
iS se itt 
(0) (20) Le2-e 











acs, OF 2 16 - 0? 4 ] 2 - 
OER [aap wen ay Tore] TOA? 

m (gy = 2h) h! (OY AY (0) ~ [h’ (OT h'(0) |, 2 wo 4 7), 
h (0) a [h(0)]? h ola (1+2-02)" 142-02 h(' 


_2+1-2-2-22.9 


; +2-[2+0-4]+1[4-0+0]=0-4+4=0. 


3. The Oth, Ist, 2nd, and 3rd Maclaurin coefficients of h are 











_Ah(O) 1 A" (0O) 2 
i ae eae ae a ae 

h'(0) 2 nl" (0) 0| 
i ale | as ay ge 


4. The Maclaurin series of h is 
M (2) =14+22+27+02' +... 


Conclude: By Theorem 38, h can be represented by the power series 1 + 2x + ge Uae 


= 14+ 2x + afox Ort. (“F, ra = Domain h. 


1 


1 
That i h SS 
aia (1-2)? 1+ 22? 


This finding is the same as the conclusion in Example 1298. 


A401. We know that sin and cos can be represented by power series. So, by Theorem 57, 
f = sin-cos can also be represented by a power series. Hence, by Theorem 38, f can be 
represented by its Maclaurin series, which we now find: 


1 
Recall that sin 2x = 2sinxcosxz. So, f (x) = 5 sin 2x. Four-Step Maclaurin Recipe: 
1. The first three derivatives of f are defined by f’(x) = cos2z, f" (x) = -2sin2z, and 
i" (2) =—4008 27, 
2. We have f (0) =0, f’(0) =1, f” (0) =0, and f’” (0) = -4. 
3. The first four Maclaurin coefficients of f are 0, 1, 0, and —4/6 = -2/3. 
4. The Maclaurin series of f is z-223/3+.... * 
That is, f can be represented by x —22°/3+.... 
Our finding agrees with the conclusion in Example 1299. 


A402. As usual, we use the Four-Step Maclaurin Recipe: 
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1. The first four derivatives of tan are defined by ® 


tan’ x = sec? x, 


tan” © = 2sec? x tana, 

tan’ ¢ = A4sec? x tan’ x + 2sec! x, 
tan” x = 8sec? x tan? x + 8sec! x tan x + 8sec! x tanzx = 8sec? rtan® xr + 16sec! x tanz. 
tan) © = 16sec? x tan* x + 24sec! r tan? x + 64sec! x tan? x + 16sec! x. 


2. Observe that secO = 1 and tan0 =0. So, 


tan0 =0, fon Oe ao: 
tan’0 = 1, tan) 0 =0, 
tan”’0 = 0, tan©) 0 = 16. 


3. The first six Maclaurin coefficients of tan are 0, 1, 0, 2/3! = 1/3, 0, and 16/5! = 2/15. 


A. The Maclaurin series of tan is 


The small-angle approximation for tan is tanz * x. 
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153.17. Ch. 103 Answers (Antidifferentiation) 


A404(a) Three antiderivatives of f are the functions g,h,i: IR > R defined by 
i is | is 
g(a) = 58 — 32, h(x) = 5% — 3x +1, i(z)= 52 — 3x42. 


(Your answers may be different.) 
(b) There exist constants C),C2,C3 € R such that 


A(x) =g9(4)+C, =h(2)+Co=i(4)+Cs, for every x ER. 


A405(a) <4 (w= = (—cos 4x) = 4sin 4x = f(z). 


d 

—h(x) = 8(2sinxcos*x-2sin® xcosx) = 16sin x cos x (cos? 
Asin4ax = f (x). 

Hence, each of g and h has derivative f. Equivalently, each of g and h is an antiderivative 
of f. 

(b) Although the functions g and h seem very different, they actually differ by only a 
constant (namely 1), as we now show: 


x - sin’ x) = 8sin2z7cos2x = 


h(x) = 8sin? x cos? x = 2(2sinx cos 2x) (2sinz cos x) = 2sin? 2x 
=1-(1-2sin? 2x) = 1-cos4r = g(x) +1. 


So no, this is consistent with and does not contradict Fact 214. 
d 

A407(a) ak? = k (Power and Constant Factor Rules). 
os 


d kt ok 


(b) oe 


a* (Power and Constant Factor Rules). 








(d) = exp x = exp (Exponential Rule). 
2 
d 
(e) qa C8 = sin (Cosine Rule). 
i 
(f) = sinx = cosx (Sine Rule). 


(g) = (F+G)= -F + =¢ = f +g (Sum and Difference Rules). 
d 


(h) <KF = kT F =kf (Constant Factor Rule). 


1 u) 1 
(i) <P (ax +b) = _ [F' (ax +b)] = =i (ax+b)-a = f(ax+b) (Constant Factor and 
Chain Rules). 
A408. Consider an antidifferentiable function defined on an interval. 


(b) If that function is defined by x + x*,°® then its antiderivatives are exactly those 





659 Also assuming k + —1 and the function’s domain does not contain 0 if k < 0. 
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functions defined by 2+ x**!/(k+1)+C (for C eR). 


(c) If that function is defined by x + 1/z, then its antiderivatives are exactly those functions 
defined by x In|x|+C (for C eR). 


(d) If that function is defined by x + expz, then its antiderivatives are exactly those 
functions defined by 7 + expx+C (for Ce R). 


(e) If that function is defined by x + sinz, then its antiderivatives are exactly those 
functions defined by 7» -cosx+C (for Ce R). 


(f) If that function is defined by 2 + cosa, then its antiderivatives are exactly those 
functions defined by 7 & sina + C' (for Ce R). 


(g) If that function is defined by x + (f +g) (x), then its antiderivatives are exactly those 
functions defined by x + (F'+G) (ax) +C (for Ce R). 


(h) If that function is defined by x & kf (x), then its antiderivatives are exactly those 
functions defined by 1+ kF' (x) +C (for C eR). 


(i) If that function is defined by «+ f (ax+b), then its antiderivatives are exactly those 
i 

functions defined by 1» —F' (ax +b)+C (for C eR). 
a 


1 
A409(a) fw +bda = 50a? + be +C. 
il i! 
(b) faa? + bx + ede = <a0* + ba? +00 +C. 
ee. Lt pee, | 2 
(c) i + bx +ex+ddz = 7ax + abs + 5cu +d+C. 


c i ctl 
(a) f (av+b) sr a +C. 





1 1 
(e) f sae == Infar +040. 


(f) f asin (be +) + dd = ~* cos (bu +) + de, 
(g) J cexp (bar + 0) + dda =F exp (ba +) + dz. 


a 


1 
(h) f acosba +0+ — de = 5 


1 
sin be + cx + = In|a| +C. 
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153.18. Ch. 104 Answers (Integration) 


A412. The heights of our eight rectangles are 


f() =124+1=2, f(9) =9?+1=82, 

f (3) =3°+1=10, #1) S11? +1122, 
f() =o +1226, f 13) 13°41 =170, 
iO) =F +l=50, f (15) = 157+ 1 = 226. 


The areas of our eight rectangles are 


Agi = 2x f (1) =4, Ags = 2x f (9) = 164, 
Ag 9% f GY =20: Asp 9% f C2044, 
Ae 20 F (5) S50: Agr = 2x f (13) = 340, 
Ae 295 (= 100, Agg = 2% f (15) = 452. 


Figure to be 


inserted here. 





The total area of our eight rectangles is 


8 
Ag = 9) Agi = 4+ 10 + 26 + 50 + 82 + 122 + 170 + 226 = 1376. 


i=1 


i! 
=5.. 
3 


Conclude: This fourth approximation is an improvement over the previous three approxi- 
mations. 


b b 
A413(a) By the Constant Rule, [ h=(b-a)d and [ i=(b-a)e. 


16 
The approximation error is €3 = | if f -— As 
0 





= h 3815 - 1376 





b b b 
(b) By Comparison Rule I, we have f[ h< [ Ts [ 1: 


b 

(c) Hence, (b-a)d< [ f<(b-a)e. 

A414. Following the hint, we define h: [a,b] > R by h(x) = 0. By the Constant Rule, 
b b b 

7 h=0. Since f >h=0 on [a,b], by Comparison Rule I, f[ fe 1, =O. 


A415. The Constant Factor Rule of Antidifferentiation says that if a function f has 
antiderivative [ f, then the function kf will have antiderivative k - fF 
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The Constant Factor Rule of Integration says instead that if the area under the graph of f 
between a and 6 is S, then the area under the graph of kf between a and 6 is kS. 


These two Rules are completely different. Once again, here we may repeat our warning 
about the difference between antidifferentiation and integration: A priori, these two Con- 
stant Factor Rules have absolutely no relationship with each other. One is concerned with 
antiderivatives, while the other is concerned with finding the area under a curve. That they 
are in fact related is established only in the next chapter by the two FTCs. 


(Similar remarks also apply to the Sum and Difference Rules of Antidifferentiation vs In- 
tegration.) 


1931, Contents www.EconsPhDTutor.com 


153.19. Ch. 105 Answers (The Fundamental Theorems of Calculus) 
A417(a) A function. Specifically, h : [-2,5] > R is the function defined by h (x) = [ ie 


3 5 0 2 = 
O)AB)= f f-6,AO)= | PaMmrO= f fa- J, f=Bend-2)= jf F- 
-f fox. 

= 
3 
(c) A real number. Specifically, 7 = i, {=i (a) =6: 


(d) This is a trick question. Since 7 is a real number, “7 evaluated at 3” (or 5 or 0 or -2) 
is meaningless. 


Figure to be 


inserted here. 





A418(a) A function. Specifically, g : [a,b] + R is the function defined by g (x) = [ f. 
(b) The FTC1 says that g’ = f. 


(c) Hence, we have shown that f has an antiderivative, namely g. 
A419(a) The FTC1 says that h is an antiderivative of f. 


(b) Since g is also an antiderivative of f, by Corollary 47, there exists C ¢ R such that 
h(x) =g(x)+C for all xe [a, d]. 


(c) 9(0)~9(a) = [9(0) +C]-[a(a) +0] =h()-n(a)= for f" r= fo r-o- fr 
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153.20. Ch. 106 Answers (More Techniques of Antidifferentiation) 




















1 1 i 4 1 
A423 {f= - { sa a C= C. 
?) | Gate (Qn-12—22n-1 4 ° 
1 1 11 1 
b (=! - { —a —— Cau C: ® 
>) | Ge sope as” (QQn+5)2 33045 Oe415 
A424(a) Observe that 52? — 2x -3 = (54 +3) (2-1). So write 
1 _ A : B A(ax-1)+B(5r+3) (At+5B)r+3B-A 
5e2?-22-3 5a+3 2-1 (5¢+3)(z-1) 5a?-27-3 


Comparing coefficients, A+ 5B +0 and 3B-A21. Fom=+ 2 we get 8B =lor B 3 1/8. 
Plug 2 into 4 to get A = —5/8. Hence, 


i 1 x= f -5/8 1/8 a4 
5a 25-3" 3 x-1 











—5/8 1/8 
l 
aa 70 (Sum Rule) 
5 
F; l 
“9 | Bea —4 ue fo (Constant Factor Rule) 


ol 
=— oe In [5x + 3] + 7 ae jx -1]+C (Reciprocal and LPC Rules) 


(—In|5a +3} +In|ja-1|)+C (Factorise) 





1 

8 

1 | 
=. In = : i (Law of Logarithm) 

1 

8 














G = 
Fact 1 
nie alt (Fact 10) 
(b) Observe that 2? - a? = (x +a) (x-a). So write 
1 A B  A(w-a)+B(ata) (A+B)r+(B- A)a 
a—a? eta £2-a  (x£+a)(x-a) | x? - a? 


Comparing coefficients, A+ B = 0 and (B-A)a=lorB-A 7 1/a. From = +2 we get 
2B =1/a or B21/(2a). Plug 2 into = to get A =-1/ (2a). Hence, 
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J 
te 2a ( Tew) * 9a (a—a) a 


=f mw faa” (Sum Rule) 


ee [ : dx + = ri dx (Constant Factor Rule) 














203 £+0 203 2d 
1 i 
= -—I|n|x+a\/+—In|x-al+C (Reciprocal Rule) 
2a 2a 
i 
o (In |x - a] -In|x+al)+C (Factorise) 
a 
Aigo (Law of Logarithm) 
lz +a 
1 = 
es i | (Fact 10) 
20 (eee 








(c) Instead of slaving through all the steps again, we can simply make use of (b): 


1 ih (b) l i 
| aaate=- f aaa 5pm 


-C=—In 
Note that in the last step, we replaced “-C” with “+C”, where C = -C. 


2a 
7x +2 Tx +2 7 22-1 we 
A425 i d - {a if d 
(a) de? —Ag+1 (2x -1)° " 2(2n-1) (2x = 24 5) Oped 


w-a r+a 


A 


+C. 




















w+ a v—-a 























ial 1 =i5 Ly 1 1 t 11 
— —_ dx a es 1 —— C= —In ee =1 C. 
(2x —1)? BROS a 2 ( ae pe thy 
& 
(b) In Example 1358, we already showed that 
7 ah d ar ee saa 
=. Sao eee LS n ‘ 
= 6° POT 2) ve 
So, 
7£4+2 7£ +2 r+3 19 1 
Lae, d [——=S oe -7 f —a 
eerie (2@+3)(2-2)”” J ‘(2+3)@-2) (2+3)(@-2) = ae 





@1TIn|x-2|- =m — 
x 





7 “Cz = (16 Info -2 +19In|2+3/)+C. 
Note that the last step is nice but not necessary (you should however be perfectly capable 


of doing this bit of algebra). 


(c) Below, @2 uses our answer from Exercise 424(a). 
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12 gf Trdg, fp o,f 
5a? -2x-3 (52 +3)(a-1) (5a+3)(a-1) (54+3)(a-1) 5a 
tee 
5x2 +3 





f 9 
2 In|5a +3] + gin 





al 
[+ C= Tin [ox+3|+ 2 In| -1]+C. 


Ditto the last sentence in (b). ® 


d d t 
A426(e) If seca > 0, then — In|sec z| = —Insec x = —et = tang. 
dx dx sec & 


—secxtanz 


d 
If seca < 0, then qm |sec x| = =m (—sec x) = = tae, 


Xv — sec zr 


(Note: The requirement that x is not an odd multiple of 7/2 ensures that secx and tan x 
are defined.) 


d d 
(f) If sinz > 0, then — In|sin z| = — Insinz = ial = Colm. 
dx da sin x 


d d — COS Z 
If sinz < 0, then — In|sin z| = — In (-sinx) = ——— =cotz. 
dx dx SI 
(Note: The requirement that x« not a multiple of 7 ensures that sinx #0 and In|sin z| and 


cot x are defined.) 





(g) If cosecax + cot x > 0, then 


2 


d d —cosecx cot x — cosec*x 
— (-In|cosecx + cot x|) = — (-In|cosecx + cot x|) = _——A—£] 
dz dx cosecz + cot % 
cosecx cot + cosec?x cosecx (cot x + cosecxr) 
= =. + = COSC. 


cosecx + cot x cosecz + cot x 
If cosecx + cot x < 0, then 


d d cosecx cot x + cosec?x 
— (-In|cosecx + cot x|) = an [—1n (-cosecx - cot x) |] = -———A——£ 
z 


dx —cosecx — cot x 


cosecx cot x + cosec?x cosecx (cot x + cosecxr) 
= $$ = = cose. 
cosecr + cot x cosecx + cot x 


(Note: The requirement that x not a multiple of 7 ensures that sin x # 0 and cosecx+cot x # 
0; and that cosecx, cot x, and In |cosecx + cot x| are defined.) 
(h) If seca +tanz > 0, then 


d d sec x tan x + sec? © 
— |n|sec x + tan x| = — In (sec x + tan x) = ————__ 
dx dx secx+tanz 





sec x (tanz + sec zx) 
= = sec @. 
seca + tan x 


If seca + tan x < 0, then 
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d d —secxtan x — sec? © 
— |n|sec x + tan z| = — In (-sec x - tan x) = ————\£{ 
dx —cosecx — cot & 


_ sec ax (tan x + sec x) 





=secy@. 
secx+tanz 


(Note: The requirement that 2 not an odd multiple of 7/2 ensures that cosx # 0 and 
seca +tanx #0; and that seca, tanz, and In|sec x + tan 2| are defined) 


A428(a) Complete the square: 


“ae? 4046- tere] 3[2-(e-2)]]- al : (--4) } 
-( 


So, => i ea re aE 
V-322 +246 ( 





= as in-t a 1/6 
3 J/73/6 





(b) Complete the square: 
2 
1 2 57 1\ V57 1\ 
-Tx* -2+2= 7|- =5 4 oa +o] 7| ae - (+3) || -(«+=) | 


i Ss" I [ £2) 1 ta 
vi (“t) - (w+ 34) 
La e+t/l4 | 
Jin Teifia 
1. _,l4e4+1 
ae Tei + 
A429(b) Use the identity cos 2x = 2cos? x - 1: 


1 2 1 in 2 
f cosade= f —“de= 50+ sc. 


So, 

















d 
(c) Use (i) the identity tan? x = sec? x — 1; and (ii) ae tan = sec? or [ sec? = tan: 
is 
[ tan?xae = [ sePa- ldz=tanz—2+C. 
(e) As stated on List MF26 (p. 3), 


cos P - cosQ@1 - 2sin 
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_P4+Q P-Q 
2 2 7 
Hence, PO2(m+t+n)x and Q@3(m-n) x. 





So, let Mx 


Plug in @1, @2, and @3: 


i! 
[ sinme sinnede = [ —[cos (m+n) ~cos (m—n)2] dy 





5 et | 


2 m-n m+n 


(f) As stated on List MF 26 (p. 3), 








ie P- 
cos P + cos Q@12 cos ; @ cos — @ 
So, let c= 77 and ne = eS 
2 2 
Hence, P@2(m+n)x and QO3(m-n) zx. 


Plug in @1, @2, and @3: 


1 
J cosma cosnade = f 5 [cos (m+n) .x + cos (m-n) «1 da 


2 


LT si 7 
-;("" De, einivel ec 
m-n m+n 


A430(a) By dETAIL, we should choose v’ = e* (and u = 2x?), so that v =e" (and u! = 32”): 


1 
xe* dx = xe" — [ 3r7e" dr. 


From Example 1371, [ ve® dr 2 Ca — 20+ 2). Plug 2 into + to get 


f wet da = ae” - 30 (0? - 20 +2) +C =e" (2? - 327 + 62-6) +C. ® 


(b) We’ll use IBP twice. 


By dETAIL, we should first choose vj = sinz (and u; = 27), so that vy = —cosax (and 
tc = 2e): 


f esinadx =a? (-cose)- f 2x-(-cos2) dx+-x?cosx+2 f xcosede. 


Now find [ xcosx by applying IBP a second time, this time choosing vs = cosa (and 


ug = £), so that v2 = sin (and us = 1): 
; ; 2, 
f vcosnde =asine- f 1-sinede 2 xsina + cosa. 


Plugging @2 into @1, we get 
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1 
f esinndx 4-2? cosx+2 f acosede = -2? cose +2(asinx + cosx) +C, ® 


i 
A431. The error is in Step 5—we cannot simply subtract [ — dz from both sides. The LHS 
e 


1 1 

“ | —da” may differ from the RHS “ [ —dz” by up to a constant (because antiderivatives 
- x 

may differ by up to a constant). Indeed, in this case, they differ by 1. 

See the more detailed discussions in Remark 156 and Ch. 103.6. 


21 
A432. This time, Step 5 is perfectly legitimate. Any definite integral, such as f — dz, is 
i ae 
al 
simply a number. It is therefore perfectly legitimate to subtract f —dz from both sides 
1 20 


of an equation. 


This time, the error is in the last equation of Step 3: 
2 


2 
{i+ fac] -14 f a # 
x 1 1 2 


The above equation is false. We should instead have 


21 21 21 
+f —da=1-1+ f —da= —dz. 
r i. 26 Ls 1 2 


2 


he fall fa 


= 
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153.21. Ch. 106 Answers (The Substitution Rule) 


A437(a) Observe that x (x*- cos) = 327 +sinz. So, 

3x? + si 

7 a de = In [a — cos 2| +C, 

x? — COS ZT 

(b) Observe that x (sin? r+ 1) = 2sinzcosz =sin2z. So, 
in2 
[ de = In|sin? x + 1| +C =In(sin?z+1)+C. 
sin* x +1 

(In the very last step, we can remove the absolute value operator because sin?7+1> 0.) 
(c) Observe that x (52? + sinz) = 10x + cos. So, 


10x + cos x : 
So = In [52° + sin 2| +C. 
5e*+sin zr 


(d) Observe that x(a? +2?+2+1) =32?+2r+1. So, 


322 +2xr4+1 
[(_— . dx =In|z?+2?+24+1/+C. 
e+aur2t+ar+1 


COS & 
(e) Observe that cot x = 
sin 





d 
and —sinz =cosz. So, 
dx 


f cotade= | >" ar - In |sina| + C. 
sin 


secx+tanz sec? x +secx tan x 
(f) J secwde = [ seca a= fe, 


secxr+tanx secx+tan x 


2 


Observe that x (seca + tan) = sec’ x+secxtanz. So, 


2 

sec* x +secxtanx 

[secrdx= [ dz = In|secx + tana|+C-. 
secxt+tanz 





A440. As in the above example, we'll use the (1) Times One Trick; (2) IBP; and the (3) 
Substitution Rule: 


(a) [so e= sft. sin"! da 2 sin! x - 2rsint r+ V1-22+C. ® 


rer 





=“ 
(b) f costade+ [ 1-costede 2 xcosta- fax. <xcos'!x-V1-22+C. ® 
V1-2? 


A441(a) Five-Step Substitution Rule Recipe: 


dx 
1. Compute — + secu tan u. 
du 
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i: 


2. [= 


dx ® f ——___ ras a 
sec? u —— -1 
= /——=>= ia 
sec? uv tan? u 
1 
- | =—— ar (Va? = al) 
sec? u|tan u| 


1 1 
- {| =—— a we (0,2 ==>: tang 20 
secs u tan u 2 








1 du du 
- f —“az= f cosu de 
du 
3. = f cosuta= f cosudu 
dé 


4. =sinut+Cy 


i = sin (sec! x) +C}. 


(b) Five-Step Substitution Rule Recipe: 


du 9 2 dv ox 








1. Compute a or ao 
1 1 da du 
2, {= « - —_.. — — dz 
v2 x2 — 1 2/72 — 1 dudx 
2 = ae = f ee 
OS gh/e2-12de~ J 2/e?-1 de 
b I des 
= | ———dz 
2.,/udz 
3 1 a ag 
) Daas a 
4. = u+C> 
pb Va“2-1 
5: = C. 
x se @ 


-] 
for x € (1,00) may differ by only a constant: 





(c) By Corollary 47, sin (sec! z) and 


sin (sec! x) = 





+ Cs, for some C3 € R. 


Following the Hint, plug in x = 2 to get 
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V 22-1 
sin (sec! 2) = 6ifi a v3 and + C3 = v3 + C3 
5 2 Z Z 
2] 
Hence, C3 = 0 and sin (sec™! a) a 
a 
A442(a) Five-Step Substitution Rule Recipe: 
d 
1. Compute ne Oe. 
dy 
x3 é ha il i du 4 u-1 du 
2: —— = = 
I (x2 +1)” a J us/? 2d uel? a 2 sof u3/? rs 
u-1 ay 1 fu-l1 il 1 1 
7 af eae af Ge sf oe 
1 
4. +—=+C 
U Ji 1 
5. = Va24+14+ os 6:0 





xt) 
(b) Five-Step Substitution Rule Recipe: 


2 


d 
1. Compute 2 sec? uy, 
du 


tan? u 


dx 





Fe 
2: ! wane da 


tan? u 


J 


7 
“| 
“| 
: 


COS U 
1 


tan? u du 


secu dx 





tan? u du 


sec U dé 


sin? u 








cos? u 


cos? u 





+ COS U 


Io 


(c) By Corollary 47, 





1 
+V224+1 and 
Varr24+1 
a constant: 
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J (tan? ut i 


(sec? uy? 


sin? U 
Uz 
COS* U 


1 -— cos? u 


cos (tan! x) 


COS (tan! a) 


tan? u 


=f 


sec? u 


tan? u 





sec U 





sin udu 


sin u : 
—sinudu 





sin udu = [ 


a Cs 


COS” U 


+ COs (tan! 2) +O. 
+ COS (tan a) may differ by only 
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i 1 
+Var2+1= a (tan! x) +C3, for some C3€R. 
e241 cos (tan™ a) 





Following Hint 1, plug in x = 0 to get 





d, 
+V¥024+1=2 and 
07 +1 


1 
+ cos(tan™!0) + C3 = —  +cos0+ C3 =2+Cs. 


: 


1 
cos (tan! 0) 


1 
+ Vx? + 1 = —_——— + cos(tan™ 2) (for all x € R). 


Hence, C3 = 0 and = 
° x? +1 cos (tan! «) 
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153.22. Ch. 109 Answers (More Definite Integrals) 


A444, The requested area is labelled A in the following figure: 


Figure to be 


inserted here. 





Method 1. 4+ B+C+D=2'8 x2 = 24/3, 
B+Ceza1x1le=l. 





42 o/s 4 
a | = . Hence, 





A= 24/3 _ ics = 3 (24/31). 
A A 





Method 2. y=2°> <> r=y'/. So, 


=. pe | fr apsa. — 3p aysz2 — 3 (54/3 _ 
Az fo ndy= [yay => [y48], = = (24-1). 


A445. The points at which the curve and line intersect are given by 


Ot 


T™ OT 
ee 0 _7 om 
sing = 5, or x € (0,71) — iG ro 


Hence, the requested area is 


57/6 1 x 57/6 
[ sina - 5 de =|-cosn- 5] 
7/6 2 2 


Figure to be 
inserted here. 
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A446. The two curves intersect at 


2=97 =97 +1 =—> 1-227 =0 — v= +— = 42°? 


So, the requested area is 


Figure to be 
inserted here. 





A447. The curve intersects the x-axis at 


xc*-16=0 — v= +2. 
2 5 2 732 my 256 
Compute [ x* — 16 dx = = - 16e = = _ 32| ~ [= + 32] =-—_., 
= 


—2 


Hence the requested area is 256/5. 


Figure to be 


inserted here. 





A448. The line y = 7 intersects C at 7=#?-1 or 8=t? or t=2. 
Similarly, the line y = 26 intersects C at 26 = #?-1 or 27 =? or t=3. 


So the requested area is 


1944, Contents www.EconsPhDTutor.com 





26 
= f[ ( aie 2t) ae y (Times One Trick) 
Y 
26 dt d 
= f (¢? + 2t) at dy (+ - 31°] 
Y 
v8 dt 
i (? 1 2t) ob Ww wy (Substitution Rule) 
y=7 
t=3 
= [ ce + 2t) Bt dé (Change of limits) 
t=2 
3 
= i 3t* + 6t° dt 
2 
3 
_ Fa 7 st] _ E : = : = ; 5 | _ 633 - 295 ~9741. 
o 2 Jo 5 2 5 2 ie 2 





A449, Use Fact 17(a): 


7 7 1 in2xr 1" 72 
f mde =f msintade=n| 50-2] = 2. 





A450(a) f Wcosudu = wsinu- f Qusinudu = wsinu-2|-ucosu- f -cosudu| = 
u’sinu + 2ucosu-2sinu+ C. 


(b) Five-Step Substitution Rule Recipe: 


d 
1. Compute oF 2 cosu. 
du 





669 Again, this is not on List MF26, so this is something you’ll have to either mug or know how to derive 
during the exam. 
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2 J (sity dy= f way 


pe, 
du dy 


2 [ u’ cos aoe dy 
dy 


du 
Oe = [ weosuS = f wcosudu 
ag! 
4, sin Oncos aC 
oO. = (sin y) sin (sin y) +2 (sin y) cos (sin y) — 2sin (sin y) +C 


= (sin! y) y+2(sin y)cos(sin"! y) -2y+C 


=y(siny) +2 1-y?sin ty -2y+C 
(c) 


2 
fo reay =f n(siny)? dy fy (sinty) +2 T= Psinty-2y], =n (2-2). 
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153.23. Ch. 110 Answers (Differential Equations) 
A451. Use Integration by Parts (IBP) twice: 
ys f expasinede=expasine- f expecosxde 
=exprsinz — (expose - 7 exp x (-sin x) de] =expxrsing + exp x cos x — [ exp xsinxdz. 


sin © — COS & 
Rearranging, the general solution is y = exp 2— +C, 





3 
Plug in the initial condition to get 1 = 1- +Cor C= 5° Hence, the particular solution 
is 
_ exp a (sina -cosx) +3 
ne cn 


A452. By the Inverse Function Theorem (Ch. 91.2), 
da 1 1 


dy expy 7 





exp (-y). 
So, the general solution is 


x2 [ exp (-y) dy = —exp(-y) +C. 


Rearranging, we find that the general solution to + is also 


1 9 1 
C-2 C-2 








exp y = =-In(C'-2). 


We are now given this initial condition: 
(x,y) = (0,0). 
Plugging 3 into 2 we get C=1. Hence, the particular solution is 
x 2 1-exp(-y) or y=-In(1-2). 
A453. Use Integration by Parts (IBP) twice (we actually already did this in Exercise 451): 


d 
st = f expesinede=expesine- f exparcoseda 
g 


= exprsine ~(expreose - f exp (-sinz) de) =exprsing +expzoose~ f expzsinxda 


sin x — cosx 


5 +C’. Next, 


d 
Rearranging, — = [ exp x sin x dx < exp x 
z 
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y= f expe" + cax= f expe aes f expe 


inz-—cosx 1 
+ expa—— - 5 [ exprcosede + Cr. © 


Zde+ [ Cau 





To find i exp xcosz dz, again use IBP (but this time only once): 


: 1 sinx-cosx = 
exp x cosx dx = exp xcos x — exp x (-sin x) dx = exp xcos x + exp x ———_ Cc 


2 
sinz+cosx = 
2 exp £«——————_ + C' 
2 
Now plug 2 back into © to get our general solution 
sin © — COS & sinz+cosz 1- exp £ COS x A 
y = expx—] — - expt] — - 50 + Ca = Ser 6. 


Nl ow 


1 7 i, Aa A 
Plug the initial condition (x,y) = (0,1) into 2 to get 1 = > 0+Cor4 


a a 
Plug * and the initial condition (a, 7)= (2, 0] into 2 to get 0=O0+ or +5 Or Ge = 


exp X COS X 3 
Hence, the particular solution is y = — P +—-—“¢+ 


2 mn 2 
GMm 





(b)(i) F =< (mv). 


(b) (ii) If m is constant, then om = 0 and 


(c) By assumption, the sole force acting on the object is the Earth’s gravity. Hence, the 
ball’s rate of change of momentum is the force of gravity. That is, 


GMm GM 
5 or a= =< yt 
r r 





ma = — 





The negative sign in front of the force of gravity ™ is there because we've adopted the 


convention®” that upwards is the positive direction—gravity acts downwards and hence in 
the negative direction. 


(d) (i) [xa - f Zav= [ vav=5v +0. 


(d) (ii) [ -“far- AG 


: : 





661 We actually also did this in Exercise 188. 
6°2\We adopted this convention when we said that the object is shot upward with initial velocity V > 0. 
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(d) (iii) We were given the initial condition (r,v) = (R,V). 


(d)(iv) Combining (i) and (ii), avis = one C. 








GM GM 
Plug in (iii) t ei cctcs Co r= 5V?7- 
ug in (iii) to ge : a R 
Hence, Se a ea a 
2 ar 2 R r R 


(e)(i) At the object’s maximum height H, v = 0. 
(e)(ii) Plug (i) and r = R+ H into what we found in (d)(iv) to get 








2GM 2GM 2GM 2GM 
_ o 2_ 9 
0 Rai + RP or Rai +V RP 0 or 
YW i] 7 2RGM 7 RV2 
7 = sou IGM - RV2 2GM - RV2 


(f)(i) If the object keeps travelling upward forever, then it must be that v > 0 for all r. 


(nai) \/7e™ s V2 26% > 0 for all r _ 200 v2 OM 5 0 for all r _ 
Tr 


Yes 56 <= V> a 


Hence, the escape velocity (or minimum initial velocity for the object to travel upwards 
forever) is 








2GM 
v=, PM, 


vey ty. [2GM __ [2(6.67 x 10-1) (5.97 x 104) 3 
(f) (iii) Ve = EE a? egg 
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153.24. Ch. 111 Answers (Revisiting Logarithms) 


A455. Differentiate both sides with respect to x to get 


1 
By Corollary 47, In— and -In«z differ by at most a constant: 
z 
1 
In—=-Inz+C, for some Ce R. 
ve 


ii 
Plug in x =1 to get In] =-In1+C or C=0. Thus, In— =-Inz. 
is 


* l l ‘9 * 
A456(e) clog,a = Gene ees log, a°. (The second step uses Fact 223.) 








Inb- Inb 
1 gin Ina 
(f) log, — = nt fa) = a = -—log,a. (The second step uses Fact 222(c).) 
a n n 
,In(ac) Ina+Inc Ina Inc, 
(g) log, (ac) = oT ee ee log, a + log,c. (The second step uses Fact 


222(b).) 
(h) is immediate from (f) and (g). But if we’d like, we can also prove this without using 
(f) and (g): 


a . In(a/c) _ina-Inc_Ina_Incy,, eee 
c  Inb mb Inb nb oS 








log, 


(The second step uses Fact 222(d).) 


log.6 » Ina/Ine_ Ina, 





i) —- = = = | ; 
(i) log.a Inb/Inc Inb ae 
« » 4d 
(j) log c= ee —log,c. (The second step uses Fact 223.) 
b blna 0b 


Ina 
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154. Part VI Answers (Probability and Statistics) 


154.1. Ch. 112 Answers (How to Count: Four Principles) 


A457. ‘Taking the green path, there are 3 ways. Taking the red path, there are 2 ways. 
Hence, there are 3+ 2=5 ways to get from the Starting Point to the River. 


A458. The tree diagram below illustrates. 
Case #1. First letter is a D. 


Case #1(i). Second letter is a D. Then the last two letters must both be E’s. (1 permuta- 
tion.) 


Case #1(ii). Second letter is an E. Then the last two letters must be either DE or ED. (2 
permutations. ) 
Case #2. First letter is a E. 


Case #2(i). Second letter is an E. Then the last two letters must both be D’s. (1 permu- 
tation. ) 


Case #2(ii). Second letter is a D. Then the last two letters must be either DE or ED. (2 
permutations. ) 


Altogether then, there are 1+ 2+1+2=6 possible permutations of the letters in DEED. 





DEDE DEED EDDE EDED 
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A459. 3x5x10=150. 


A460. We must choose three 4D numbers. Choosing the first 4D number involves four 
decisions—what to put as the first, second, third and fourth digits, with the condition that 
no digit is repeated. 


Thus, by the MP, there are 10 x 9 x 8 x 7 = 5040 ways to choose the first 4D number. 


If we ignored the fact that we already chose the first 4D number, then there’d similarly be 
5040 ways to choose the second 4D number (given the condition that this second 4D number 
does not have any repeated digits). However, there is an additional condition—namely, the 
second 4D number cannot be the same as the first. Thus, there are 5040 - 1 = 5039 ways 
to choose the second 4D number. 


By similar reasoning, we see that there are 5040 — 2 = 5038 ways to choose the third 4D 
number. 


Altogether then, by the MP, there are 5040 x 5039 x 5038 = 127, 947, 869, 280 ways to choose 
the three 4D numbers. 
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A461. Apply the IEP twice. 


1. The food court and hawker centre share 2 types of cuisine (Chinese and Western) in 
common. And so together, the food court and the hawker centre have 4+ 3-2 = 5 
different types of cuisine. 


2. Combine together the food court and the hawker centre (call this the “Low-Class Place”). 
The Low-Class Place has 5 types of cuisine and shares 2 types of cuisine (Chinese and 
Malay) with the restaurant. And so together, the Low-Class Place and restaurant have 
5+3-2 =6 different types of cuisine (namely Chinese, Indonesian, Japanese, Korean, 
Malay, and Western). 


Food Court (4 choices of cuisine) Hawker Centre 


ee choices) 


Indonesian Western 


Korean 


Japanese 


Restaurant 
Total: 6 choices (3 choices) 





A462. 10-3=7. (Can you name them?) 
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154.2. Ch. 113 Answers (How to Count: Permutations) 


A463. 6! = 720, 7! = 5040, and 8! = 40320. 
A464. 7!/(4!3!) = 35. 
A465. 91. 


A466. The problem of choosing a president and vice-president from a committee of 
11 members is equivalent to the problem of filling 2 spaces with 11 distinct objects. The 
answer is thus P (11,2) = 11!/9! = 11x 10=110. 


A467. Let B and S stand for brother and sister, respectively. 


(a) First consider the problem of permuting the seven letters in BBBBSSS, without any 
two B’s next to each other. There is only 1 possible arrangement, namely BSBSBSB. 
There are 4! ways to permute the brothers and 3! ways to permute the sisters. 


Hence, there are in total 1 x 4!3! = 144 possible ways to arrange the siblings in a line, so 
that no two brothers are next to each other. 


(b) First consider the problem of permuting the seven letters in BBBBSSS, without any 
two S’s next to each other. We'll use the AP. 


1. B in position #1. 
(a) B in position #2. Then the only way to fill the remaining five positions is SBSBS. 


Total: 1 possible arrangement. 
(b) S in position #2. Then we must have B in position #3. 
i. B in position #4. Then the only way to fill the remaining three positions is 
SBS. Total: 1 possible arrangement. 
ii. S' in position #4. Then we must have B in position #5. And there are two 
ways to fill the remaining two positions: either BS or SB. Total: 2 possible 
arrangements. 


(... A467 continued on the next page ...) 
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(... A467 continued from the previous page ...) 
2. S' in position #1. Then we must have B in position #2. 


(a) B in position #3. Then, like in 1(b), we are left with two B’s and two S’s to fill 
the remaining four positions. Hence, Total: 3 possible arrangements. 


(b) S in position #3. Then we must have B in position #4. There are three ways 
to fill the remaining three positions: SBB, BSB, and BBS. Total: 3 possible 
arrangements. 


By the AP, there are 1+1+2+3+3=10 possible arrangements. 
Again, there are 4! ways to permute the brothers and 3! ways to permute the sisters. 


Hence, there are in total 10 x 4!3! = 1440 possible ways to arrange the siblings in a line, so 
that no two sisters are next to each other. 


(c) We saw that there was only 1 possible (linear) permutation of BBBBSS'S that satisfied 
the restriction, namely BSBSBSB. 


If we now arrange the siblings in a circle, there will necessarily be two brothers next to each 
other. 


We thus conclude: There are 0 possible ways to arrange the siblings in a circle so that no 
two brothers are next to each other. 


(d) In part (b), we found 10 possible (linear) permutations of BBBBSS‘S that satisfied 
the restriction. 


Of these, 3 have sisters at the two ends: SBSBBBS, SBBSBBS, and SBBBSBS. If 
arranged in a circle, these 3 arrangements would involve two sisters next to each other. So 
we must deduct these 3 arrangements. 


We are left with 7 possible arrangements: BBSBSBS, SBBSBSB, BSBBSBS, 
SBSBBSB, BSBSBBS, SBSBSBB, and BSBSBSB. But of course, these are sim- 
ply one and the same fixed circular permutation! (This is consistent with Fact 228, which 
tells us to simply divide by 7.) 


And now again, we must now take into account the fact that the brothers are distinct and 
the sisters are distinct. We conclude that there are in total 1 x 4!3! = 144 possible ways to 
arrange the siblings in a circle, so that no two sisters are next to each other. 
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154.3. Ch. 114 Answers (How to Count: Combinations) 








A468. 
ey n! 
k } k!(n—-k)! 
_nx(n-1)x--+x(n-k+1)x(n-k)x(n-k-1)x-- xl 
7 ki(n-k)x(n-k-1)x---x1 
Oe wane aon 
k! 
A469. 
| | 
C'(4,2) = Al Al _ 4x3 -6, 


(4-2)! 212! 2x1 


6! 6! 6x5 
6,4) = ————_ = —— =15 
C(6.4) = Tea > Tal = 2xd ; 


7! 7! 7x6x5 
CO ee 


31(7-3)! 314! 3x2x1 
Aa7o. {° |[ “ |{ ° | =630. 
1)\2}\2 


A471. (a) C(1,0)+C(1,1) =14+1=2=C(2,1). 
(b) C(4,2) + O(4,3) =34+3=6=C(5,3). 


17! 17! 17x16) «17x 16x15 


C(17,2) +C(17,3) = = 
tc) C(07,2) + C07,3) = sa * sng ox * Sx axl 





18x 17x 16 
Sx2e 1 


=l7 #5217 *x6x%5=17*s*0= 
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A473. Expanding, we have 


(l+z)=(1+2)(1+2)(1+z2) 
=1-1-14+1-1-¢4+1-¢-1+@-1-14+1-¢-74+24-1-¢4+4-"-14+2-2-2. 
CUZ Ure eon eee rh ee prkk_<730 oa SS —_—SI OY” 
0 2’S le 22'S 32'S 
Consider the 6 terms on the right. There is C(3,0) = 1 way to choose 0 of the x’s. Hence, 
the coefficient on x° is C(3,0)—this corresponds to the term 1-1-1 above. 


There are C(3, 1) = 3 ways to choose 1 of the x’s. Hence, the coefficient on x! is C(3,1)— 
this corresponds to the terms 1-1-z7, 1-7-1, and x-1-1 above. 


There are C(3,2) = 3 ways to choose 2 of the x’s. Hence, the coefficient on x? is C(3,2)— 
this corresponds to the terms 1-x2-x, x-1-x, and x-x-1 above. 


There is C(3,03) = 1 way to choose 3 of the x’s. Hence, the coefficient on x? is C(3,3)—this 
corresponds to the term x-x- 2x above. 


Altogether then, 
(1+2) = 2 Oa? \ais |? leat ? | ai aon ae ee. 
0 1 2 a 


A474, 2° = 128. 


WM 
fo) 
eae 
=) 
Qo. 
oO 
oO 
e 
bo 
“I 
ll 
a 
x] 
—— 
+ 
ee 
reoN 
—— 
+ 
ae 
oN 
——— 
+ 
+ 
a 
“IN 
—— 
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AA75. 


covayta( 4 Jair ( ; }oroe| : }are| : Jar| ; Jat 


= 8144-277 +6-927 44-3294 27 = 8141082 +540? + 127° + x*. 


A476. (a) There are ; = 4 ways of choosing the two Tan sons and ( ; | = 3 ways of 


choosing the two Wong daughters. 


Having chosen these sons and daughters, there are only 2! = 2x1 possible ways of matching 
them up. This is because for the first chosen Tan Son, we have 2 possible choices of brides 
for him. And then for the second chosen Tan Son, there is only 1 possible choice of bride 
left for him. 


Altogether then, there are ( : ( ; -2 = 24 ways of forming the two couples. 


(b) There are ( : | = 6 ways of choosing the five Lee sons and ( ; = 126 ways of choosing 


the five Ho daughters. 


Having chosen these sons and daughters, there are 5! = 5 x 4x 3x 2x1 possible ways of 
matching them up. This is because for the first chosen Tan Son, we have 5 possible choices 
of brides for him. And then for the second chosen Tan Son, there are 4 possible choices of 
brides left for him. Etc. 


Altogether then, there are ( ° \( . | -5! = 6- 126-5! = 90,720 ways of forming the five 


couples. 
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154.4. Ch. 115 Answers (Probability: Introduction) 
A477(a). (i) The appropriate sample space is 


S = {AS Ka, Q4,...,28, AV KY, QY,..., 29, A9, KO, Q¢,...,26, Ade Ke Qa... 26). 


(a) (ii) Since there are 52 possible outcomes, there are 2°? possible events. Hence, the 
event space contains 2°” elements. It is too tedious to write this out explicitly. 


(a) (iii) As always, P has domain © and R. We have P({3@}) = P({54}) = 1/52 and 
P({3¢,54}) = 2/52. In general, given any event A«€ X, we have 


|A| _ [Al 
Pye el 
(A) le) pe 
In words, given any event A, its probability P(A) is simply the number of elements it 
contains, divided by 52. So for example, P ({3¢, 54, A#}) = 3/52, as we would expect. 


(a) (iv) John might argue that since packs of poker cards usually come with Jokers, there 
is the possibility that we mistakenly included one or more Jokers in our deck of cards. He 
might thus argue that to cover this possibility, we should set our sample space to be 


S = {A@ Ka, ,...,2@, AV KY,...,24, 46, KO... 26, Ade Ke... 26 Joker}. 


953 


The event space would be appropriately adjusted to contain elements. 


The mapping rule of the probability function would be appropriately adjusted, based on 
John’s belief of the probability of selecting a Joker. For example, if he reckons that the 
probability of selecting a Joker is 1/10,000, then he might assign P ({Joker}) = 1/10, 000 
and for any other card C,, P ({C}) = 9999/(10000 - 52). The probability of any other event 
Aé® is as given by the Additivity Axiom. 
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A477(b). (i) The appropriate sample space is S = {HH, HT,TH,TT}. 


(b) (ii) Since there are 4 possible outcomes, there are 24 = 16 possible events. Hence, the 
event space contains 16 elements. It is not too tedious to write these out explicitly: 





Ds {2 {HH}, {HT} {TH}, {TT}, {HH, HT} ,{HH,TH},{HH,TT}, 





{HT,TH} ,{HT,TT} ,{TH,TT} ,{HH, HT,TH}, 


{HH,HT,TT} ,{HH,TH,TT} ,{HT,TH,TT}, s} 





(b) (iii) As always, P has domain © and R. We have P({HH}) = P({HT}) = 1/4 and 
P({HT, HT,TH}) = 3/4. In general, given any event A € X, we have 


A A 
Peay = l= 


In words, given any event A, its probability P(A) is simply the number of elements it 
contains, divided by 4. So for example, P({7'H,TT}) = 2/4, as we would expect. 

(b) (iv) John might, as before, argue that there is the possibility that a coin lands on its 
edge. He might thus argue that the sample space should be 


S ={HH,HT,HX,TH,TT,TX, XH, XT, XX}. 





The event space would be appropriately adjusted to contain 2° = 512 elements. 


The mapping rule of the probability function would be appropriately adjusted. For example, 
if John believes that any given coin flip has probability 1/6000 of landing on its edge, then 
we might assign P ({X X}) = 1/6000?, P({HH}) = (5999/6000)?, P({X H}) = (1/6000) - 
(5999/6000), ete. 
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A477(c). (i) The appropriate sample space contains 36 outcomes: 


g- Ey ) 
tat ear erat at 


(c) (ii) Since there are 36 possible outcomes, there are 2°° possible events. Hence, the 


event space contains 2°© elements. 


1 
(c) (iii) As always, P has domain © and R. We have P (| \ = (| \ = — and 


2 
P (| Z \ = —. In general, given any event A«™, we have 


O' B]) 36 
|A| _ [Al 
P a 
(A) [S| 36 
In words, given any event A, its probability P(A) is simply the number of elements it 
4 
contains, divided by 52. So for example, P a ie = —, as we would expect. 


O'S’ ea 36" 


(c) (iv) John might argue that there is the possibility that a die lands on a vertex. He 
might thus argue that the sample space contains 7? = 49 outcomes and should be 


) 


Pe om o Ho 8 GOvv V 
a Bp UVa @ VV aw’ ua V * 


The event space would be appropriately adjusted to contain 2” elements. 


The mapping rule of the probability function would be appropriately adjusted. For example, 
if John believes that any given die roll has probability 1/1000000 of landing on a vertex, 


V 1 o 999999 \? 
th si ‘ap So a = t 
en we might assign (| ” } 10000002" (| a } (aa) 7m 
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A478. (a) By definition, A and B are mutually exclusive if An B = @. Since P(@) = 0, 


a Oo 


the result follows. 


P(ANB) =0 





(b) The events A, A®n B, and A®n B°nC are mutually exclusive. Moreover, their union 
is AUBUC. Hence, by the Additivity Axiom (applied twice), 


P(AU BUC) =P(A)+P(ASN B)+P(ASN BONC). 
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154.5. Ch. 116 Answers (Conditional Probability) 


A479. Let A be the event that we rolled at least one even number and B be the event 
that the sum of the two dice was 8. We have P(B) = 5/36 (see Exercise 485). 


And AnB can occur if and only if the two dice were (3 8, (3 G3, or @ ©). Hence, P(An B) = 
3/36. 


Altogether then, 





_P(AnB) _ 3/36 3 
BALE) == pay gaps 


A480. It may be true that 


1 
( match|Blood stain is not John Brown's) 10,000,000 
It does not however follow, except by the CPF, that 
il 
(Blood stain is not John Brown’s| match) 10, 000, 000 


There is reason to believe that P (Blood stain is not John Brown’s) is much greater than 
P(DNA match) and thus that P (Blood stain is not John Brown’s|DNA match) is much 
greater than P(DNA match|Blood stain is not John Brown's). 


One important factor is that if the DNA database is large, then invariably we’d expect to 
find, purely by coincidence, a DNA match to the blood stain at the murder scene. As of 
May 2016, the US National DNA Index contains over the DNA profiles of over 12.3 million 


1 
individuals. A if j hat there i ] bability ————— that t 
individuals. And so, even if it were true that there is only probability 10,000,000 at two 


random individuals have a DNA match, we’d expect to find a match, simply by combing 
through the entire US National DNA Index! 


The error here is similar to the lottery example, where we conclude (erroneously) that a 
lottery winner must have cheated, simply because it was so unlikely that she won. 
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154.6. Ch. 117 Answers (Probability: Independence) 


A481. By Fact 234, A,B are independent events < > P(A|B) = P(A). Rearranging, 
P(B) = P(An B)/P(A) = P(B|A), as desired. 


A482. First, note that P (Hi) = P(71) = P(A) = 0.5. 


(a) P(A, A) = 0.25 = 0.5 x 0.5 = P(H,) P (Ap), so that indeed H; and Hp» are indepen- 
dent. 


(b) P (Hy 7,) = 0.25 = 0.5x0.5 = P (H2) P (71), so that indeed Hy and T; are independent. 


(c) Observe that H, nT; = @ (it is impossible that “the first coin flip is heads” AND also 
“the first coin flip is tails”). 


Hence, P(H, nT) = P(@) =0 #0.25 =0.5x 0.5 = P(A,)P(7)), so that indeed H, and T, 
are not independent. 


A483. No, the journalist is incorrectly assuming that the probability of one family 
member making the NBA is independent of another family member making the NBA. But 
such an assumption is almost certainly false. 


The same excellent genes that made Rick Barry a great basketball player, probably also 
helped his three sons. Not to mention that having an NBA player as your father probably 
helps a lot too. 


The two events “family member #1 in NBA” and “family member #2 in NBA” are probably 
not independent. So we cannot simply multiply probabilities together. 


A484. First, note that P (H,) = P(7>) = P(z) =0.5. 

(a) P(A, nT) = 0.25 = 0.5 x 0.5 = P(A) P(Z), so that indeed H;, T> are independent. 
P(H,nX) = 0.25 =0.5 x 0.5 = P(A) P(X), so that indeed Hi, X are independent. 
P(1,n X) = 0.25 = 0.5 x 0.5 = P(T,) P(X), so that indeed T,, X are independent. 
Altogether then, H,, 75, and X are indeed pairwise independent. 


(b) The event H; NT) X is the same as the event H,n7>. Thus, P(H,n7T)nX) = 
P(H, nT) = 0.25 + 0.5 x 0.5 x 0.5 = P (AH,) P (72) P(x), so that indeed the three events are 
not independent. 
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154.7. Ch. 119 Answers (Random Variables: Introduction) 














A485(a) 
k s such that X (s) =k PULX 2h) 
2 = = 
o 36 
OH © y) 
3 eae te = 
0’ 8 36 
A atid 3 
OH’ oO’ oO 36 
' HB O88 8 4 
Bo’ 8’ oO’ os 36 
‘ HB O88 8 5B 
BH ooo o 36 
‘i HD O88 8 6 
Boose oo 36 
F O08 8 ‘5B 
Bo o's oa 36 
: OH 8 8 4 
HB 8 o's 36 
|) ec 3 
i} 2 
3 OG 36 
12 ae = 
AR 36 











(b) F is the event X > 10. 


3 2 1 6 1 
P(4)=P(X 210) =P(X =10)+P(X =11)+ P(X =12)=—+-—-+-—=— =>. 
(c) P(E) = P(X > 10) = P(X = 10) +P(X=11) +P(X=12)- 54S =o =5 
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{3}mel3) 


(7a)( 2) <5 ra) 8-12 


A487. If S = {1,2,3,4,5,6}, then X : S > R defined by X (s) = s is of course a random 
variable. A random variable is simply any function with domain S and codomain R; and 
X certainly meets these requirements. 

POteleP( (Xx =2)2P (4.23) 2P(x%=4)=P(x 25) =P (Xx =6) =1/band Pix =k) = 
O for any k #1,2,3,4,5,6. 


A488(a). (i) The sample space is 





Ss {HHHH, HHHT, HHTH ATHH,. THA, HTT ATHT, THAT, 





HTTH,THTH,TTHH, HTTT,THTT,TTHT,TITTH, TTTT}. 


The event space © is the set of all possible subsets of S and contains 2'© elements. 
The probability function P : © > R is defined by P(A) = |A|/16, for any event A € ™. 


(a) (ii) The random variable X : S > R is the function defined by 


HTTT.THTT. TTHT,TITH + 1, 
HHTT. HTHT THHT HITH THT TTHH 2, 
HHHT HHTH, HTHH,THHH = 3, 

TTTT+0, HHHH}4. 





(a)(iii) P(X =4)=1/16, P(X =3)=4/16, P(X =2)=6/16, 
P(X =1) =4/16, P(X =0)=1/16, P(X=k)=0, — for any k#0,1,2,3,4. 
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A488(b). (i) The sample space S consists of 216 outcomes: 


Gy OU tC) G3} Gy 
5 = GJ, G) cr) &3) () ’ GJ ee, &3 (J ’ (J peg &3) 
OH 3 eM 3 eWo 


9216 


The event space © is the set of all possible subsets of S and contains elements. 


The probability function P : © > R is defined by P(A) = |A|/216, for any event A € ™. 


(b) (ii) The range of X is {3,4,5,...,18}. We now count the number of ways there are for 

the three dice to reach a sum of 3, to reach a sum of 4, etc. This will enable us to write 

down the mapping rule of the function X:S > R. 

To get a sum of 3, the three dice must be GI) or permutations thereof. There is thus 
| 


31 = 1 possibility. 


To get a sum of 4, the three dice must be CL, or permutations thereof. There are thus 


a 
ha 3 possibilities. 


To get a sum of 5, the three dice must be CO), CLC, or permutations thereof. There are 
thus a + 2 = 6 possibilities. 

2! 2) 
To get a sum of 6, the three dice must be CLE, GL), SLL, or permutations thereof. 


3! 3! 
There are — + 3! + — = 10 such possibilities. 




















2! 3! 
To get a sum of 7, the three dice must be GUE, GE, Oe), OG), or permutations 
thereof. There are - +3!) + - + - = 15 such possibilities. 
To get a sum of 8, the three dice must be WH, HA, CE, OO, OH), or permutations 
thereof. There are * +3!4+3!4+ - + * = 21 such possibilities. 
To get a sum of 9, the three dice must be CLG, GEG), GIG, OU), OEE, HE, or 
permutations thereof. There are 3! +3! + * + = +3!+ a = 25 such possibilities. 
To get a sum of 10, the three dice must be WE, CEI), CRE, OG, CEI, GEES, or 
permutations thereof. There are 3! +3! + * +3! + * + - = 27 such possibilities. 


By symmetry, there are also 27 ways to get a sum of 11; also 25 ways to get a sum of 12, 
etc. 


(... Answer continued on the next page ...) 
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(... Answer continued from the previous page ...) 


So X : S > R is defined by 


GJ (J (-) 
XI OXI QO [=X] © 
() (J (J 
(b)(iii) P(X =3)=—— 
111 = ~ 516’ 
10 
P(X =6) ==, 
25 
P(X =9) ==, 
0 ean) ene 
216 
10 
P(X =15)= 5, 
Po ye 
7 ~ 216’ 


for any k¢ (3,4, 5,..., 18}. 
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o 
X| oO ]=xX 
Q 
o 
=-X| @|=x 
G 
3 
P(X =4) = — 
(et) 577 
15 
P(X =7) = — 
(X =7) = 576 
i 
P(X = 10) = — 
(As) Se 
21 
P(X = 13) = — 
(X = 13) = 55, 
6 
P(X = 16) = — 
(4 > 16)= 5 
P(X =k) =0, 


o 
x| @ |=3. 
G 
a a 
(1) =X (=) = 4, 
a a 
a a 
O }=X] © ]=5, 
a G 
6 
P(X =5) =. 
21 
P(X =8) ==. 
Peas. 
216 
Poe. 
216 
3 
P(X = 1) ==, 
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154.8. Ch. 120 Answers (Random Variables: Independence) 
A489. No. For example, P(X =0,Y =0) =0, but 


P(X =0)P(Y =0) =0.5 x 0.25 = 0.125. 


154.9. Ch. 121 Answers (Random Variables: Expectation) 


A490. (a) P(X +Y =2) is simply the probability of 2 heads and 0 sixes OR 1 head and 
1 six OR 0 heads and 2 sixes. So 





2 2 6 6 


tt 1 1 £ tliat 2 2 1 4 
Pease eae a ee +o+a+s-+ t= an one = a 
17/2 2\1/66 22 66 144 144 144 144 


(b) P(X +Y =3) is simply the probability of 2 heads and 1 six OR 1 head and 2 sixes. So 


11 5 1 1111 10 2 «12 
PO aa ee eee rane mee ele 
a: 1/2266 144° 144 144 


(c) P(X +Y =4) is simply the probability of 2 heads and 2 sixes. So 


ee ae oe 


BE ea 6 6 ad 





(d) E[LX+Y] 


= Yi P(X+Y=k)-k 
keRange(X+Y) 


=P(X+Y =0)-0+P(X+Y =1)-1+P(X%+Y =2)-2 
+P(X+Y =3)-3+P(X+Y =4)-4 
25 60 AG 12 1 60+92+36+4 192 4 


= -O+ -1+ -2+ -34+— -4= = =—, 
144 144 144 144 144 144 144 3 
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A491(a). The range of X consists simply of the possible prizes from the “big” game. 
Range (x) = {2000, 1000, 490, 250, 60,0}. (Don’t forget to include 0.) 


Similarly, Range (y) = {3000, 2000, 800, 0}. 
1 


P(X =2 =P =1 = P(X = 490) = —_— 
(b) P( 000) = P( 000) = P( 90) TT 


10 


P(X = 250) = P(X = 60) =, 


_ 9977 


aa ae 10000’ 


i 


P(¥ = 3000) = P(¥ = 2000) = P(Y = 800) = 
9997 

P(Y =0) =—_. 
10000 


(c) E[X]= YY. P(X =k)-k=2000P(X = 2000) + 1000P (X = 1000) +... 
keRange(z) 


-- + 490P (X = 490) + 250P (X = 250) + 60P (X = 60) + 0P (X =0) 


2000 1000 490 250-10 60-10 9977-0 
= ——— + —— + —— + + ——— + 


= = 0.659 
10000 10000 10000 10000 §=10000 ~=—- 10000 








E[Y]= > P(¥ =2000)-k 


keRange(y) 
= P(Y = 3000) -3000+ P(Y = 2000) - 2000 + P(Y = 800)-800+P(Y =0)-0 


1 | 9997 
=; — 49 ——_.. —— -0=0. .2+ 0.08 + 0 = 0.58. 
10000 3000 + 10000 000 + 10000 800 + 10000 0=0.3+0.2+ + 


(d) For every $1 staked, the “big” game is expected to lose you $0.341 and the “small” 
game is expected to lose you $0.42. Thus, the “big” game is expected to lose you less 
money. 
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154.10. Ch. 122 Answers (Random Variables: Variance) 


A492. In Exercise 488(b), we already found that P (Z = 3) = 1/216, P(Z = 4) = 3/216, ..., 
P(Z = 18) = 1/216. By symmetry, we have u = E[Z] = 10.5. So, 


1 3 1 25704 
2) _ 222 —_. 4? Soe —_ .192 = > =~ 119, 
BAI og "1g ong ane 


1 
Hence, V[Z| = E[Z?] - »? = 119 - 10.5? = a 


A493. E[Y]= = x 20 cm + = x 30 cm = 26.5 cm. 


V[Y]= = x (20 em - 26.5 em)? + = x (30 em — 26.5 em)’ = 22.75 cm?. 


SD[Y] = J/V[Y] # 4.77 cm. 


A494, (a) 2u kg, 207 ke”. 
(b) 2u kg, 40? ke. 


(c) The mean of the total weight of the two fish is 24 kg. However, we do not know the 
variance, since the weights of the two fish are not independent. 


154.11. Ch. 123 Answers (The Coin-Flips Problem) 


(This chapter had no exercises.) 


154.12. Ch. 124 Answers (Bernoulli Trial and Distribution) 


(This chapter had no exercises.) 
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154.13. Ch. 125 Answers (Binomial Distribution) 


A495. Let X ~ B(20,0.01) be the number of components in engine #1 that fail. Let 
Y ~ B(85, 0.005) be the number of components in engine #2 that fail. 


The probability that engine #1 fails is 
P(X >2)=1-P(X <1)=1-P(X =0)-P(X=1) 
=]- ( = }oovroas - ( a Jovan" 
0 1 
x 0.0169. 
The probability that engine #2 fails is 
P(¥e2)s1=P(¥ <1a=1]—P(Y =0)-PY =1) 
=1- ( , }o.0n5%0.n5 - ( i } 050.05 
x 0.0133. 
Hence, the probability that both engines fail is 


P(X >2)P(Y > 2) * 0.00022. 
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154.14. Ch. 126 Answers (Continuous Uniform Distribution) 


A496. 
0, i <3, 0.5, ifke [3,5] 
Oo. i Rel, 
(a) Fy(k)=40.5k, ifke[3,5], (0) fr(k)= 
0, otherwise. 
1, if k> 5. 


(c) P(3.1< Y < 4.6) =0.75 is in blue and P(4.8< Y < 4.9) = 0.05 is in red. 





O5ifO<k <1 
0, otherwise. 


0.5 







fr(k) = 





P(3.1<Y <4.6) =0.75 


P(4.8 < X < 4.9) = 0.05 —S———— 
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154.15. Ch. 127 Answers (Normal Distribution) 
A497. (a) From Z-tables, 
P(Z>1.8) =1-P(Z<1.8) =1- (1.8) x 1-0.9641 = 0.0359. 


Graphing calculator screenshot: 














p(a) p(a) 


P(-—0.351 <Z 























(b) From Z-tables, 


P (-0.351 < Z < 1.2) = ®(1.2) - ®(-0.351) = (1.2) - [1 - ©(0.351)] 
x 0.8849 - (1 - 0.6372) = 0.8849 - 0.3628 = 0.5221. 


Graphing calculator screenshot: 
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A498. If w~=0 and o? =1, then 





- 4,0)’ 1 -05(9)? | 4,-0.0? _ 
fx (a) am Ws Ta (a). 


We’ve just shown that the PDF of X ~ N(,07) when ys = 0 and o”, is the same as the PDF 
of the SNRV Z ~ N(0,1). Hence, the SNRV is indeed simply a normal random variable 
with mean p = 0 and variance o? = 1. 





X - X - 1 
A499. ‘First observe that poo ae Now simply use Fact 242, with a = — and 
g¢ 6 ol 
p=. 
ol 


xX - xX - —u 1 
Baa Hn (2s So?) 00,1). 
oO o Oo o 0 O 
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A500. Let X ~N(y,07) and let fx and Fy be the PDF and CDF of X. 


ils 


We know from Fact 241 that ®(0oo) = 1. And by Corollary 49, F'x(00) = P(X < 00) = 


@ (00). This, Fy(co) = L, 


. fx (a)=¢ (—*). But we already know from Fact 241 that @ (| >0. 
o o 


3. E[X] =E[oZ+p] =cE[Z] + p= up. 
4. We know from Fact 241 that the standard normal PDF ¢ attains a global maximum 


10. 


WE 


at 0. That is, (0) > ¢(a), for alla e R. Since X = 0Z +p, this is equivalent to 
fx(o-0+p) > fx(o-a+p), for all a¢ R. Equivalently, fy (uw) > fx (b), for all be R. 
That is, fy attains a global maximum at pL. 


. Var [X] = Var[oZ + ps] = 0? Var [Z] = 07. 
. We know from Fact 241 that for any a¢ R, we have P(Z <a) =P(Z <a). Equivalently, 


for alla e R, P(X <oa+p) = P(X <ca+p). Equivalently, for all be R, P(X <b) = 
P(X <b). 


. We know from Fact 241 that ¢ is symmetric about 0. Since X =a0Z+p, fx must likewise 


be symmetric about 0-0+ w= p. 
(a) By Corollary 49, P(X >p+a)=P (z > =), By Fact 241, P (z > =) =P (z < -*). 
0 oO 0 
Now again by Corollary 49, P (z < -*) =P(X <p-a). Altogether then, P(X > +a) = 
0 

P(X < y-a), as desired. And of course, by definition, P(X < w-a) = Fy (u-a). 

(b) Obvious. 

(c) Obvious. 


. First use Corollary 49: P(u-o0<X<pto) = P(-1<Z<1). Now use Fact 241: 


P(-1< Z <1) = 0.6827. 


. First use Corollary 49: P(u-20< X <u+20) = P(-2<Z<2). Now use Fact 241: 


P(-2< Z <2) =0.9545. 

First use Corollary 49: P(w-380< X <ut+3a) = P(-3<Z<3). Now use Fact 241: 
P(-3< Z <3) = 0.9973. 

By Fact 241, ¢ has two points of inflexion, namely at +1. That is, @ changes concavity at 
+1. Since by Corollary 49, X =0Z +p, fx must likewise change concavity at +l-0+p= 
uta. That is, fy has two points of inflexion, namely at w+o. 
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A501. Weare given that X ~ N(2.14,5) and Y ~ N(-0.33, 2). 


1-2.14 





(o) P(X21)=P(Z2 } = P(z 2-0.5008) 
=P (Z < 0.5098) = © (0.5098) » 0.6949. 


p(ve1)=P(z24-C*)). p(z 20.0406) 


=1-P(Z<0.9405) = 1- © (0.9405) « 0.1735. 


P(X 21) and P(Y 31) P(-2< X <-1.5) and P(-2< Y <-1.5) 





(0) P(-2sXs-18)=P( 


(-2)- 2.14 | Be (-1.5) =) 
V5 5 


x P(-1.8515 < Z < -1.6279) = P (1.6279 < Z < 1.8515) 


= (1.8515) — (1.6279) ~ 0.9679 — 0.9482 = 0.0197. 


_ of (-2) - (-0.33) (-1.5) — (-0.33) 
P(-25¥ 5-15) =P/ 53 <7 6 5; 








« P (-1.1809 < Z < -0.8273) = P (0.8273 < Z < 1.1809) 


= @ (1.1809) - © (0.8273) « 0.8812 - 0.7959 = 0.0853. 
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A502. (a) Let W ~ N (25000, 64000000) and E ~ N (200, 10000). Let B = 0.002W +0.3F 
be the total bill in a given month. Then 


B ~ N (0.002 x 25000 + 0.3 x 200, 0.002? x 64000000 + 0.3? x 10000) 
= N (50 + 60, 256 + 900) = N (110, 1156). 


Thus, P(B > 100) * 0.6157 (calculator). 


(b) Let By ~ N(110,1156), By ~ N(110, 1156), ..., Big ~ N (110, 1156) be the bills in each 
of the 12 months. 


Then the total bill in a year is T = By +B +---+Byy ~ N (12 x 110, 12 x 1156) = N (1320, 13872). 
Thus, P(T > 1000) » 0.9967 (calculator). 


(c) The total bill in a given month is B = 0.002W + «EF and 
B~N (50 +2002, 256 + 100002”) . 


Our goal is to find the value of x for which P(B > 100) =0.1. We have 








LOO'= y _9 
P(B> 100) =P(z> AAR) -p(z 50 — 200x 


/256 + 10000x2 /256 + 10000x2 
=) 
1-4 50 - 2002 J=o1 
/256 + 10000z2 


From the Z-tables, 








( 50 — 200x J=0 — 50-2002 one 
/256 + 10000x2 /256 + 10000x2 


One can rearrange, do the algebra (square both sides), and use the quadratic formula. 
Alternatively, one can simply use one’s graphing calculator to find that x » 0.084. We 
conclude that the maximum value of x is approximately 0.084, in order for the probability 
that the total utility bill in a given month exceeds $100 is 0.1 or less. 
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A503. From our earlier work, we know that each die roll has mean 3.5 and variance 
35/12. 

The CLT says that since n = 30 > 30 is large enough and the distribution is “nice enough” 
(we are assuming this), X can be approximated by the normal random variable Y ~ 
N (380 x 3.5, 30 x 35/12) = N (105, 1050/12). Thus, using also the continuity correction, we 
have P(100 < X < 110) * P(99.5 < Y < 110.5) = 0.4435 (calculator). 


A504. Let X be the random variable that is the sum of the weights of the 5,000 Coco-Pops. 


The CLT says that since n = 5000 > 30 is large enough and the distribution is “nice 
enough” (we are assuming this), X can be approximated by the normal random variable 
Y ~ N(5000 x 0.1, 5000 x 0.004) = N (500,20). Thus, P(X < 499) » P(Y < 499) x 0.4115 
(calculator). 
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154.16. Ch. 130 Answers (Sampling) 


_3+144+24+8+8+6+0 Al 


A505. & 7 = > and 
> La} -nz? 9+156+4+64+64+36-417/7 155 
rs 6 a 


A506. (a) The sample mean and sample variance are 


a | 
z= Lis _ 188 _ ges 
n 10 
rig? (Eiit) 378, 965 — 1885" 
gpa ee On 8, 10 9 2550, 





n-1 9 


(b) The sample mean and sample variance are 


Dat _ Er (e= 80+ 50) _ Fi (w= 50) +E 50 _ 1885 + 50 _ 1885 


Zi = — +50 = 238.5, 
nm 


nr nr nr nr 


2 10 
: w 2550. 
: nat 9 


A507. (a) Assume that the weights of the five Singaporeans sampled are independently and 
identically distributed. Then unbiased estimates for the population mean p and variance o7 
of the weights of Singaporeans are, respectively, the observed sample mean x and observed 
sample variance s: 


a; 32+884+67+ 75+ 56 





p= - : = 63.6, 
nm 

Qe Lae : ee ie 
n— 


(b) We don’t know! And unless we literally gather and weigh every single Singaporean, we 
will never know what exactly the average weight of a Singaporean is. 


All we’ve found in part (a) is an estimate (63.6 kg) for the average weight of a Singaporean. 
We know that on average, the estimator we uses “gets it right”. 


However, it could well be that we’re unlucky (and got 5 unusually heavy or unusually light 
persons) and the estimate of 63.6 kg is thus way off. 
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A508. 


ee ee El 
y n 
_E[XJ+E[Xo] + +E[Xn] _wtut tm _ re 


= —- > LL. 
n n n 
We have just shown that E [x | = 1. In other words, we’ve just shown that X is an unbiased 
estimator for p. 


A509. (a) The observed random sample is (#1, 2%2,..-,%19) = (1,1,1,1,1,1,1,0,0,0). The 
observed sample mean and observed sample variance are 


— tates + XO 
go 0,7, 
n 


> (t1-E) + (a9 -£)? +--+ (219-2) _ 7-0.824+3-0.77 
5? = ; = ; = 0.23. 
n a 








(b) Yes, the observed sample mean & = 0.7 is an unbiased estimate for the true population 
mean p (i.e. the true proportion of coin flips that are heads). 


And yes, the observed sample variance s? = 0.23 is an unbiased estimate for the true 
population variance o?. 


(c) No, this is merely one observed random sample, from which we generated a single 
estimate (“guess”)—namely £ = 0.7—of the true population mean wp. 


All we know is that the sample mean X is an unbiased estimator for the true population 
mean pt. That is, the average estimate generated by X will equal p. 


However, any particular estimate 7 may or may not be equal to jz. Indeed, if we’re unlucky, 
our particular estimate may be very far from the true p. 


1 
A510. “ |= “Tevet X1+Xy+- -+Xy)| = GVar[X1 + Xp4e--+ Xu] = — (Var [Xi] + Va 
nr n 
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k 
A511. (a) The population mean yp is the number defined by ps = )>a;/k. It is the 
i=l 
average across all population values. 
k 
(b) The population variance o” is the number defined by o? = )) (a; - 1) /k. It measures 
i=l 
the dispersion across the population values. 


(c) The sample mean X is a random variable defined by X = )) X;/n. It is the average 
i=l 


of all values in a random sample. 


n 


(d) The sample variance S” is a random variable defined by S? = )*(X;-X)/(n-1). 
i=l 
It measures the dispersion across the values in a random sample. 


(e) The mean of the sample mean, also called the expected value of the sample mean, is the 
number E [x |. The interpretation is that if we we have infinitely many observed samples 
of size n, calculate the observed sample mean for each, then E [x | is equal to the average 
across the observed sample means. It can be shown that E[X | = pw and hence that the 
sample mean X is an unbiased estimator for the population mean pu. 

(f) The variance of the sample mean is the number Var [x i The interpretation is that if 


we have infinitely many observed random samples of size n, calculate the observed sample 
mean for each, then Var [x | measures the dispersion across the observed sample means. 
(g) The mean of the sample variance, also called the expected value of the sample variance, 
is the number E[S?]. The interpretation is that if we have infinitely many observed 
random samples of size n, calculate the observed sample variance for each, then E [S?] is 
equal to the average across the observed sample variances. It can be shown that E [Ss °] =o" 
and hence that the sample variance S? is an unbiased estimator for the population variance 
2 
o 
(h) Given an observed random sample, e.g. (21,22,23) = (1,1,0), we can calculate the 
corresponding observed sample mean as 


mteetes; elt 2 
3 ~~ 3 ~3 





T= 


The observed sample mean is the average of all values in an observed random sample. 


(i) Given an observed random sample, e.g. (21,%2,%3) = (1,1,0), we can calculate the 
corresponding observed sample variance as 


» _ (a -%)" + (a2- 2%) +(a3-8)” _ 1/9+1/9+4/9 1 


i 9 3" 





The observed sample variance measures the dispersion across the observed sample variances. 
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154.17. Ch. 131 Answers (Null Hypothesis Significance Testing) 


A512. Let p be the probability that a coin-flip is heads. The null and alternative 
hypotheses are 


Hyte=0.0 and. dat >0.o. 


Our random sample is 20 coin-flips: (X1, Xo,..., X20), where X; takes on the value 1 if 
the zth coin-flip is heads and 0 otherwise. 


Our test statistic is the number of heads: T =X, + Xo +---+ Xo. 


In our observed random sample (21, %2,...,220), there are 17 heads. So the observed 
test statistic is t = 17. 


Assuming Ho were true, we’d have T ~ B (20,0.5). Thus, the p-value is 
P(T > 17|Ho) = P(T = 17|Ho) + P(T = 18|Ho) + P(T = 19|Ho) + P(T = 20|Ho) 


= iw 0.570.539 + 0.5'80.5? + a 0.5190.5! + ae 0.57°0.5° ~ 0.0013. 
17 18 19 20 


Since p x 0.0013 < a = 0.05, we reject Hp at the 5% significance level. 


A513. Let p be the true long-run proportion of coin-flips that are heads. The null and 
alternative hypotheses are 


Ho: ~=0.5 and Ay: p#0.5. 


Our random sample is 20 coin-flips: (X1, Xo,..., X29), where X; takes on the value 1 if 
the zth coin-flip is heads and 0 otherwise. 


Our test statistic is the number of heads: T =X, + Xo +---+ Xap. 


In our observed random sample (21, %2,...,229), there are 17 heads. So the observed 
test statistic is t = 17. 


Assuming Ho were true, we’d have T ~ B(20,0.5). Thus, the p-value is 
P(T >17,T <3|Ho) = P(T =0|AHp) +---+ P(f =3/Ho) + P(T = 17|Ho) +--- +P (T = 20|Hp) 


© a 0.590.570 + a0 0.510.519 + a 0.50.5 se a0 0.52°0.5° ~ 0.0026. 
0 1 17 20 


Since p x 0.0026 < a = 0.05, we reject Hp at the 5% significance level. 
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A514. Let yz be the probability that a coin-flip is heads. 





(a) The competing hypotheses are Ho: w= 0.5, Ha: > 0.5. 
The test statistic J’ is the number of heads (out of the 20 coin-flips). 


For t = 14, the corresponding p-value is 
P(T > 14|Ho) = P (ZL = 14|Ap true) + P(T = 15] true) +---+ P (TL = 20|Hp true) 


© 050.5" + aU G.5°05 4224 a 0.57°0.5° ~ 0.05766. 
14 15 20 


For t = 15, the corresponding p-value is 


P(T > 15|Ho) = P(L = 15|Ap true) + P (T = 15] true) +---+ P (TL = 20|Hp true) 


S bce 0.5140.5° + ay 5°05? eet Ze 0.5290.59 ~ 0.02069. 
15 15 20 


Thus, the critical value is 15 (this is the value of t at which we are just able to reject Ho at 
the a = 0.05 significance level). 


And the critical region is {15,16,...,20} (this is the set of values of t at which we’d be able 
to reject Ho at the a = 0.05 significance level). 


(b) The competing hypotheses are Hp: w= 0.5, Ha: # 0.5. 
The test statistic T’ is the number of heads (out of the 20 coin-flips). 


For t = 14, the corresponding p-value is 


P(T >14,T <6|Ho) =1-P(7<T < 13|Ap) 
=1-[P(T =7|Hp true) + P(T = 8|Ap true) +--- + P(T = 13|Hp true) | 


as ( a Jostos" ( 7 Jostos roe| : Jost ~ 0.1153. 


For t = 15, the corresponding p-value is 


P(TS 1b <5/H,)H 1 P C< 71 < 141) 
=1-[P(T =6|Hp true) +P (T =7|Hp true) +--- + P(T = 14|Hp true) | 


=]- ( . Jostos" ( . Jostostcne 7 Jasros w 0.1153. 


Thus, the critical value is 15 and the critical region is {15,16,...,20}. 
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A515. The competing hypotheses are 


Ao: = 34, 
Has peo, 


The observed sample mean is 


35 + 354+ 31+ 324+ 334344314 344 35+ 34 © 


33.4. 
10 





i= 
The corresponding p-value is 


p=P(X 233.4, X < 34.6|Ho) = P(X > 33.4|Ho) + P(X < 34.6]Hp) 


4-34 4.6 -— 34 
-P(22 BiB) op( «MAB ) -oser 


~ /9/10 \/9/10 


The large p-value does not cast doubt on or provide evidence against Hp. We fail to reject 
Ho at the a = 0.05 significance level. 


A516. The competing hypotheses are 


Ay: w= 34, 
Hy: pt 34. 


The observed sample mean is 7 = 33.4. 


The corresponding p-value is 


p=P(X < 33.4, X > 34.6|Ho) = P(X < 33.4|Ho) + P(X >34.6]Hp) 


CLT, (2 < oii +P (2 > Pe wi04550:; 


~ ,/9/100 9/100 


The large p-value casts doubt on or provides evidence against Hp. We reject Ho at the 
a = 0.05 significance level. 
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A517. The competing hypotheses are 


Ao: = 34, 
Has (ce 34. 


The observed sample mean is % = 33.4. And the observed sample variance is s? = 11.2. 


The corresponding p-value is 


p=P(X < 33.4, X > 34.6|Ho) = P(X < 33.4|Ho) + P(X > 34.6]Hp) 


CLT . (7 . 33.4-34 , 34.6 - 34 
~ y/11.2/100 ~ y/11.2/100 


The fairly small p-value casts some doubt on or provides some evidence against Ho. But 
we fail to reject Hp at the a = 0.05 significance level. 


| # 0.07300. 


A518. The observed sample mean is Z = 68 and the observed sample variance (use Fact 
244(a)) is 





2 _ ae c = (eas) = ee So 


a % 383.7. 
n= 1 49 








Let p be the true average weight of a Singaporean. The competing hypotheses are Hp: p = 
75 and Aas je< 715; 


(This is a one-tailed test, because your friend’s claim is that the average American is heavier 
than the average Singaporean. If the claim were instead that the average American’s weight 
is different from the average Singaporean’s, then we’d have a two-tailed test.) 


Since the sample size n = 50 is “large enough”, we can appeal to the CLT. The p-value is 


Hp[a 68 — 75 


p=P(X < 68/4) \/383.7/50 


The small p-value casts doubt on or provides evidence against Hp. We can reject Ho at any 
conventional significance level (a@ = 0.1, a = 0.05, or a = 0.01). 


# 0.0058. 
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154.18. Ch. 132 Answers (Correlation and Linear Regression) 
A519. 

1200 + ¢ 

1000 e 

800 aN 

600 ~. 


PO) 





A520. Compute p = (8+9+4+10+8)/5 = 7.8 and 
q = (300 + 250 + 1000 + 400 + 400) /5 = 470. Also, 


5 (pi - B) (a: -@) = (8—B) (300 - @) + (9-H) (250 -G) +--+ (8-H) (400-4) 


i=1 
= (8 — 7.8) (300 - 470) + (9 — 7.8) (250 — 470) +--+ + (8 — 7.8) (400 — 470) = -2480, 


\ 30.7" (8—H)° + 9-H)" + (4—p)* + (10 —p)* + (8-py 


(8 -— 7.8)? + (9- 7.8) + (4-7.8)° + (10 - 7.8)" + (8 - 7.8)° = 20.8 » 4.56070170, 


= \/ (300 — 470)” + (250 — 470)” +--+ (400 - 470)” = 368000 ~ 606.63003552. 





Diet (Pi-P) (G-D . —2480, 


5 5 * 156070170 x 606.63003552 © 
n — n = : x < 
Diet (Pi- D) VY Lies (Gi - @) 
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A521. (a) We already computed (in the previous exercise) that p = 7.8, ¢ = 470, 


Y* (pi — P) (Gi — @) = 2480 and Y* (p; - p)” = 20.8. So, 
i=l i=l 
p Lie (P =P) (a= 9) ew. age 
iat (pi =p) 20.8 





Thus, the regression line of g on p is q-q = b(p-3) or q- 470 = -119.2(p-7.8) or 
q = 1400 — 119.2p. 








(b) a i 2 3 4 5 
pi ($) 8 9 t 10 8 

di 300 250 1000 400 400 

di 446 327 923 208 406 

= —-G|-146 -77 77 192 -46 





(c) 0 2 4 6 8 10 


5 
(d) The SSR is > a? » (-146)° + (-77)" + 77? + 192? + (-46)” = 72308. 
1=1 
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A522. 


After Step 1. After Step 2. After Step 3. After Step 4. 


After Step 5. After Step 6. After Step 7. After Step 8. 


After Step 9. After Step 10. After Step 11. After Step 12. 


The T184 tells us that r = —.8963881445 and the regression line is y = ax+b = —119.2307692+ 
1400. This is indeed consistent with the answers from the previous exercises. 














A523. In the previous exercises, we already calculated that the OLS line of best fit is 
q = 1400 - 119.2p. Thus, 


(a) By interpolation, a barber who charged $7 per haircut sold 1400 - 119.2 x 7 » 566 
haircuts. 


(b) By extrapolation, a barber who charged $200 per haircut sold 1400-119.2x200 = -22440 
haircuts. This is plainly absurd. 


The second prediction is obviously absurd and thus obviously less reliable than the first. 
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A524. (a) rx 0.954. 







yi | 10.59 10.54 27.30 33.84 56.6 U--""@ 


- 


1 4 9 16 25 
iG 10.59 10.54 27.30 33.84 56.6°7~ 
_--"®@ 
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155. Part VII Answers (2006-19 A-Level Exams) 


155.1. Ch. 133 Answers (Functions and Graphs) 


A525 (9758 N2019/1/3)(i) p{(a+q) +r} =p(a> + 3qa? + 3Pe4+q +r). 

Comparing coefficients, p = 2, 3pq = —6 or g=-1, and p (¢3 + r) =-12 or r=-5. 

So, f (x) = 2a - 62? + 62 - 12 = 2{(2-1)* -5}. 

(ii) Right 1, down 5, stretch vertically by a factor of 2 (outward from the x-axis). % 


A526 (9758 N2019/1/4)(i) You are supposed to mindlessly use your calculator and copy. 
But as an exercise (and proof that we are not mindless monkeys), let’s not: 


1. Start with the graph of y =e’. 

2. The graph of y = 2” is similar, just flatter. 

3. Translate down 10 to get y = 2” - 10. 

4. Reflect portion below the z-axis in the z-axis to get y = |2” — 10]. % 
If x = 0, then y = |2° - 10| =9. So, the only y-intercept is (0,9). 

If y = 0, then 0 = |2"-10| <— > 10= 2” <=> 2g = log,10. So, the only x-intercept is 
(logy 10,0). 


A 






Y 









a 
> 
(log, 10,0) 

3. y= 2-10 





(ii) |2°-10|<6 <> 27-10 [-6,6] <> 2°€[4,16] <=> In2°€[In4,In16] 


In4 In16 
In2¢[In4,In1 ——, —— | = [2,4]. % 
<=> rln2€[In4,In 6] = ve[> >| [2,4] 
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A527 (9758 N2019/I/5)(i) Write y = f (x) =e?" —4¢ Range f = (—4, 0). 

Do the algebra: y + 4 = e”” or x = 0.5In(y +4). 

So, f is one-to-one and its inverse is f~!: (-4,00) > R defined by f~! (x) = 0.5In (x + 4). 
(ii) 5 = fg (x) = f (g(2)) = f (vo +2) =e) 4 > Gael) o> g=ln3-2. 


A528 (9758 N2019/1/7)(i) y' (2) =e* -xze™. 
y(1)=1-e' =e", y’(1)=e!-1e' =0. So, the tangent at x = 1 is 
y-y(1)=y' (1) («- 1) or y-el=0 or y=e?, 
y (-1) =-e CY = -e, y! (-1) =e - (-1)e" = 2e. So, the tangent at x =—-1 is 
y-y(-1l) =y'(-1)[2-(-1)] or yt+e=2e(r+1) or y = 2er +e. 
(ii) The first tangent line is horizontal. The second has slope 2e. 
Hence, the acute angle between these two lines is tan”! (2e) = 1.39. 


A529 (9758 N2019/II/2)(i) You’re supposed to just copy from your calculator: 


y & 


(2,0) x 














Since 3x? + 52 —8 = (34 +8) (x-1), the graph has vertical asymptotes x = —8/3 and z = 1. 
As x > +00, y > 0. So, the graph has horizontal asymptote y = 0. 

If y =0, then 2-2 =0 or x =2. So, the only x-intercept is (2,0). (Note: r>2 = > y<0.) 
If z =0, then y = -1/4. So, the only y-intercept is (0,-1/4). 

(ii) x € (—co, -8/3) u (1,2). (iii) x € (-8/3, 1) U (2, 00). 
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A530 (9758 N2018/I/4)(a) 277+ 32-2 =(2r-1)(x4+2)=0 — > r=-2,0.5. 
Since 2x77 + 3a -2 is a U-shaped quadratic, 277+37-2>0 <= x<-2 OR 2>0.5. 
For x < -2 OR + >0.5, 2x? + 3x -2|=2-2 <=> 277+32-2=2-2 


—2+,/2? -4(1) (-2) 
Ne eS ay 

2(1) “A 
Check whether each of -1 + V3 satisfies the condition “x < -2 OR x > 1/2”. Both do: 
-1- V3 # -2.73 < -2, while -1+ V3 » 0.732 > 1/2. So, both -14 V3 are roots. 


Next, for -2 <x < 1/2, we have 22? + 3x - 2| = 


— 0 = 2074+ 47 -4=0774+27-2 <> r= 


2 


~2n? —380+2=2-0 — > 0=2274+2r=27+n=2(2+1) — v=0,-1. 


Check whether each of 0 and -1 satisfies the condition “-2 < x < 1/2”. Both do. 
So, the given equation has four roots: —1 + V3, 0, and -1. 5 
(b) The inequality holds if and only if x € (-1 4/8. -1) U (0, —1+ v3). Sa 


y = (20? + 32-2 








-1-V3 -2 - OO 4/3 





A531 (9758 N2018/I/5). We are given that for all x eR \ {Db}, 








7 _ i futa\ Feta xtat+a(z+b) (l+a)et+a+ab_ 
FF (a) = FF) = F(S5) = “o+b x+atb(x+b) (1+b)xr+a+P? © 
— (l+a)c+a+ab=(1+b) 2’? +ar+b'x 
—> O=(1+b)2?+(b?-1)x-a(b+1) =(14+b)[2? +(b-1)2-a| 


Since the above is true for all « ¢ R\ {b}, we must have b = -1. 


Since f is a rectangular hyperbola whose asymptotes are parallel to the axes, f is one-to-one 
and its inverse f! exists. 

L+a x+a 
Since ff (x) = 2 if and only if f-' (x) = f (x), we have f7! (x) = f (x) = ae 

x x - 
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A532 (9758 N2017/I/2)(i) 








1 
(ii) First find the two graphs’ intersection point(s): For x > a, 


il i 1 1 
cog 7 le al = (2-4) —— 5 (e-ay ra r-azyft —> eee: 


(At , , we can discard the negative square root because x > a.) 
1 
We must check that a+ 1/Vb satisfies >, as indeed it does: a+ 1/Vb 2 


1 
So, for x >a, the two graphs intersect at 7 =a+ 1/Vo. 


1 1 9 
Next, f < ——_ =plr-—al=bla- 2—-=(p=a) . 
ext, for x <a, ar |x —a| = b(a-2z) —> 5 (1 -a) 
which never holds since (x - a)* > 0, while -1/b < 0 (because b > 0). So, for x <a, the two 
graphs do not intersect. 
Altogether, the two graphs intersect only at x =a+1/ Vb. 


And now, from the graph, we see the given inequality holds if and only if 


Vb 
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re (-e,a)u(a+ 00) a 


47x +9 i 

=4+ 
r+2 r+2 
(i) Compute y! (2) = - (+2). So, y! (x) <0 for all x € R\ {-2}. 
(ii) We already did this above: a = 4 and b=1. The horizontal asymptote is y = 4. The 
vertical asymptote is x = —2. 


(iii) Right 2 (to get y=4+1/x), then down 4 (to get y = 1/z). * 


A533 (9758 N2017/1/4). First, observe that y = 








(x € R\ {-2}). 


A534 (9758 N2017/1/5)(a) Let f (x) = 2°+axz7+bx+c. By the Remainder Theorem, 


f(l)=ltatb+c%8, 
f (2)= 8 +4a+2b+c 212, 
f (3) =274+9a+3b+c 225. 
Either solve this system of three equations with three unknowns using your graphing cal- 
culator or “manually”, as we now do: 
Taking 2 - 4 yields 7+ 3a+b=4 or b3 -3(a+1). 
Plug * into 2 to get 18+ c= 25 or c27. 
Next, plug = and 2 into 4 to get 1+a-3(a+1)+7=8 or 5-2a=8 or a=-1.5. 


Now from 5 we also have b= 1.5. + 


(b) From f (x) = 2°- 1.52? +1.54+7. compute f’ (x) = 32? -3r+4+ 1.5. This quadratic has 
negative discriminant (-3)° - 4 (3) (1.5) = 9-18 = -9 <0 and so does not touch the z-axis. 
It has positive coefficient on the x? term and so is U-shaped. Altogether, it is everywhere 
above the z-axis—i.e. f’(x) >0 for all ce R. 

Which means that f is everywhere strictly increasing. And so, f can only intersect the 
x-axis at most once—equivalently, f can have at most one (real) root. © 


Now observe that f(-100) < 0 and f(0) = 7 > 0. Since the polynomial function f is 
continuous, by the Intermediate Value Theorem, there exists d ¢ (-100,0) such that 


f (d) = 0. This shows that f has at least one (real) root. e 
Together, © and ® show that f has exactly one real root. Using our graphing calculator, 
we find that it is x » -1.33. xt 


(c) f'(z4)=2 <> 32?-324+1.5=2 <> 327-32-0.5=0. 


~ 1.15, -0.145. % 


——=>_—sr TT = 


3 +f (-3)"-4(3)(-0.5) 34/15 
2-3 «6 





663 Theorem 9 
64By the Increasing /Decreasing Test (Fact 208). 
665Theorem 38.8. 
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A535 (9758 N2017/II/1)(i) Plug x = 3/t and y = 2t into y = 2z to get 2t = 6/t ort = V3. 
So, the two points are A = (v3, 2/3) and B= (-v3, -23). And 





|AB| = \/ [v3 - (-v3)] +[2v3- (-2v3)] = /4-3+ 16-3 = V60 = 2V'15. 


dy dy d 3 2 2 5 
(ii) = he S22 =-50. The tangent to C at P is y~2p=~57?(a-5). 
2p 3 oe 3 1 1 6 6 
At D,0-2p=-- (2-"} => ~ ==(2-=)=s2-— == $0, D=(~.0). 
3 p p 3\ ph 3 p P p 


At FE, y-2p=- (2p?/3) (0-3/p) — > y=2p+t 2p?/p = 4p. So, E = (0, 4p). 
Thus, the midpoint of the line segment DE is F = (3/p, 2p). 
Write x 4 3/p and y 2 2p. Rearrange = to get p = 3/x—plug into 2 to get y = 6/x. 


A536 (9758 N2017/II/3)(a) To get from y = f (2) to each equation, ... 


(a)(i) Compress inwards towards the y-axis by a factor of 2. So, an x-intercept is (a/2,0) 
and a y-intercept (0,6) is the same. 


(a)(ii) Translate 1 unit right. So, an x-intercept is (a+ 1,0). We can’t say anything about 
the y-intercepts. 


(a) (iii) Translate 1 unit right, then compress inwards towards the y-axis by a factor of 2. 
So, an x-intercept is ((a+ 1) /2,0). We cannot say anything about the y-intercepts. 


(a)(iv) Reflect in the line y = x7. So, an x-intercept is (b,0) and a y-intercept is (0, a). 
(b)(i) a= 1 is excluded because g (1) would be undefined. 


(B)(H) o (2) = 1 gy = ded (l-a)=e 
Since g? (x) = 2, g(x) = g(x) =1-1/(1-2). 
(b) (iii) g?(b) = g°' (b) —= b=1-1/(1-b) — 1-b=1/(1-0b) 


<> (1-b)=1 —> 1-b=41 <— 6=0,2. 




















4x? + Ax — 14 4x? + 4a -14- (2? -2 -12 

A537 (9740 N2016/I/1). ge + 2 +(a5)< x? + 4 : @ L ) 
_307+58-2 (82-1) (242) 
7 xz-A 7 r—4 ) 


(3a - 1) (a +2) 


1 <Q. Sign diagram for LHS: 
x —_ 


So, the given inequality is equivalent to 


- + ~ a 


The given inequality holds if and only if x € (-o0, -2) u (1/3, 4). * 
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A538 (9740 N2016/I/3). y = x*has turning point (0,0) and y-intercept (0,0). 
1. Start with the graph of y = 27. 
2. Translate 1 units right to get y = («—1)' (turning point (1,0), y-intercept (0,/*)). 


3. Stretch vertically (outwards from the x-axis) by a factor of k to get (turning 
point (/,0), y-intercept (0, kl*)). 


4. Translate m units upwards to get y = f(x) = k(x—1)* + m (turning point (l,m), y- 
intercept (0, ki* + m)). 


So, (I,m) = (a,b) and (0, kl* +m) = (0, c). 
Hence, 1 = a, m=b, and k = (e-m) /I* = (c-b) Ja’. 











nN 
y 
y= x 
(0, c) 
= f(x) =k(x-l)' +m 
y= f(a) =k(e-l) y= (2-1) 
(a,b) = (l,m) 1 
(0, 1/c) a, 0 FG) 
—__—_—__ S|’ @ = 
(0,0) 





The graph of y = 1/f (x) has y-intercept (0,1/c) and turning point (a, 1/b). 
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A539 (9740 N2016/1/10)(a)(i) Write y = f (x) =1+ /v€ Range f = [1, 0). 

Do the algebra: x = (y- i 

So, the inverse of f is the function f~!:[1,00) > Rj defined by f7! (y) = (y- iy. 

(a) (ii) ff (a) =f (1+Va)=1+V1+ Vet => V1l+VJe=2-1 — 1+JVe=2°-2r4+1 
—> JSrar?-Qc => v=a'-de3 +40? <— > x(2?-42?+40-1)=0 <> (by + 1 +0) 
a —4z* +42 -1=0. 


(a) (iii) Let p(x) = 2° -427+4x-1. Since p(1) = 0, by the Factor Theorem, x-1 is a factor 
of p(x). So, write 2° — 4a? + 4x - 1 = (x -1) (ax? + ba + c). 


Comparing coefficients, a = 1, -a+b=-4, and -c=-1 (so, b=-3 and c=1). 
Now, O=a° - 4a? + de -1 = (2-1) (x? - 32 +1) <— «r=10R 


34 (-3)-4(1) (1) 34. V5 





—~ 2 
We now check whether each of these three possible values of x satisfies + We find that 
f= 1 and y= (3 - v5) /2 do not, while x = (3 + v5) /2 does. & 


(a)(iv) Suppose f (f(x)) = x. Since x € Range f = Domain f, we can apply f~! to get 


7a) 2 fo (x). But by the Cancellation Law, we also have f~! (f (f (x))) 2f (a), 


Now plug = into 2 to get f! (x) = f (x). 


(b) (i) 9 (0) = 1, g(1) =1+9(0) =14+1=2, g(2)=2+g(1)=24+2=4, g(3)=1+9(2)=5, 
g(4) = 2+9(2) = 6, g(5) = 1+ 9(4) = 7, 9(6) = 2+9(8) = 7, g(7) = 1+9(6) = 8, 
g (12) =2+ g(6) =9. 


(b) (ii) The element 7 in Codomaing is “hit” twice: g(5) = 7 and g(6) = 7. So, g is not 
one-to-one. Hence, g has no inverse. 


A540 (9740 N2015/I/1)(i) Compute y’ (x) = -2a/x° + b. 
The given information yields this system of equations: 
~2.4 £a/1.67 + 1.6b +c = a/2.56 + 1.6b +c, 
3.6 2 a/ (-0.7) -0.7b +c = a/0.49 -0.7b +c, 
y’ (1) =2 2 -2a/13 +b = -2a+ b. 


Solving by calculator or hand, a # -3.593, b » —5.187, cx 7.303. % 
(ii) af/x? +ba2+c=0 = > at+bs®+cr?=0 => 2x -0.589 (calculator). at 


(iii) As x > +00, y > ba +c. Hence, the other asymptote is y = ba +c or y ¥ 5.187 + 7.303. 





66These two extraneous solutions (see Ch. 42.1) were introduced at the two ==> ’s above. 
667 This answer is largely a reproduction of the proof of Fact 56 (== ). 
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A541 (9740 N2015/I/2)(i) You’re supposed to just copy from your calculator. But here 
we won’t. First, observe that (x+1)/(1-2) =-14+2/(1-2). 
1. Graph y = 1/2. A 
2. Translate 1 unit right to get y=1/(x-1). : 
3. Reflect in z-axis to get y=1/(1-2). 
4. Stretch vertically (outwards from z-axis) by 
a factor of 2 to get y=2/(1-72). 
5. Down | to get 


Z 
2 1 = 
yas = a l=" ZA 
-x 1-2 
(asymptotes x =1, y=-1). ge 


6. Reflect any portion of a 














the graph below z-axis 3 


in the x-axis to get 
r+1 





Y= 








l-z 
(asymptotes x =1, y=1). 


As instructed, we’ve also graphed y = x + 2. 








(ii) We look for the intersection points of the two graphs. 
First, for (xv + 1)/(1-2) >0 or x€ (-1,1), we have 














1 1 
— gO 2s Doe es ee ag oe ee Se ee 
a, ~7 
Haag 22S 4t 1) 1 





2 (1) 


We can verify that -1 + V2 [-1,1], while -1- V2 ¢ [-1,1]. 
So, here, we’ve found only one intersection point: x = -1+ V2. 
Next, for (2+ 1)/(1-2) <0 or ze R\ [-1,1], we have 


c+1 +1 9 


ee oa S990 2 aye | Se —~2+2 <> 0=27°-3 <— r=4v3. 














1-2 —- 2 


We can verify that +V3 € R \ [-1, 1]. 
So, here, we’ve found two intersection points: x = +V3. 
From the graph, the given inequality holds <> we (-v3, oe 1) U (v3, 00). Sa 


1999, Contents www.EconsPhDTutor.com 


A542 (9740 N2015/I/5)(i) Translate 3 units right to get y = (7-3), then stretch 
vertically (outwards from the x-axis) by a factor of 0.25°° to get y = 0.25 (x - ay 


(ii) Ay 











iF for 0s a< 1, 
(iii) We were given f(€)=40.25(x@-3)2, forl<@<3, 

CG, otherwise. 

iF for 0< 2 <2, 
So, f (0.52) = 40.25(0.52-3)°, for2<x<6, 

0, otherwise. 

2, for Vee <2, 
And, 1+ f (0.52) = 414+0.25(0.52-3)°, for2<2x<6, 

I otherwise. 


Note that the usual method does also work here: First stretch horizontally (outwards from 
the y-axis) by a factor of 2, then translate 1 unit upward. However, we have to do these 
operations carefully for each of the three separate “pieces” or intervals. 





y=1+f (0.52) 





| 
=i 2 t 


68 Or equivalently, compress vertically (inwards towards the x-axis) by a factor of 4. 
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A543 (9740 N2015/II/3)(a)(i) Write y = f (x) = 1/(1-2”) € Range f = (0, 0). 

Do the algebra: x = +\/1-1/y. Since x > 1, we can discard the negative value and be left 
with =V/1-1/y. 

Hence, f is one-to-one and has an inverse. 

(ii) From our above work, the inverse of f is the function f~! : (0,00) > (1,0) defined by 
f(y) = V1- Ty. 


(b) Suppose y € Rangeg. Then there exists some x € R\ {-1,1} such that y = g(a) or 


2+2 
2 





i or y—yx?=24+2 or yor +n+2-y=0. 
=o 


The quadratic equation Z (in z) holds for some x € R if and only if the determinant is 
non-negative: 


2 
—4y(2-y)20 or Ay”? — 8y+1>0. 


The quadratic polynomial 4y? - 8y+1 is U-shaped (because its coefficient on y” is positive). 
By the quadratic formula, its roots are 


8+ V(-8)-4(4)) ae, 118 
144/ - 


Thus, 5 and therefore also = hold for some x € R if and only if 


y8(-m.1- Sul + oo. 
2 2 


But now, observe that + does not hold if r=-1 or x= 1. 


Hence, there exists some x € R\ {-1,1} such that y = g(x) if and only if é holds. 





Thus, Rangeg = (-20,1-2]u[1+ oo. % 
1 1 1-2 l-x «2-1 1 

A544 (9740 N2014/1/1)(i) f? - ( )- = = = a eae 
MDOP OFF \-s2 =a = 





t 
Write y = f (x) = i e Range f = Rx {0,1}. 
—2x 
Do the algebra: x = 1 - 1/y. 


So, f is one-to-one and its inverse is the function f~!: R \ {0,1} + R x {0,1} defined by 
f(y) =1- Ly. 

We observe also that, as was to be shown, f? (x) = f~' (2) for all x ¢ R \ {0,1}. 

(ii) f° (2) = ff? (a) = ff" @) =n 

A545 (9740 N2014/1/4)(i) The graph of y* = f (x) ... 





6°The last step uses an Inverse Cancellation Law—Proposition 5(b). 
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e Is symmetric in the x-axis. 

¢ Is empty wherever f (x) <0. So here, it is empty to the left of A and between B and C. 

e« Has turning points (0, Va) and (0, -Vd) because the graph of y = f(x) has turning 
point (0,d). 

¢ Intersects the graph of y = f (~) wherever f (x) = 1. 

e Has the same x-intercepts as y = f (xz), namely A = (-a,0), B = (6,0), and C = (c,0). 





(v8) 








(ii) The tangents to the curve y? = f (a) at the points where it crosses the z-axis are vertical. 


dy dy dz 1 
A546 (9740 N2014/II/1)(i) — = — + — =6+6t=-—=0.4. So, t=2.5. 
( AI) T= at ay ; 0, 
| 
(ii) The tangent line at (3p", 6p) has equation y — 6p = — (x - 3p”). Where this line meets 
Pp 
1 
the y-axis, we have x = 0 and so y = 6p + — (0 - 3p”) = 6p — 3p = 3p. Hence, D = (0,3p) and 
Pp 
the mid-point of the line segment PD is (1.5p”, 4.5p). 
Write x + 1.5p* and y 2 4.5p. Rearrange 2 to get p 3 y/4.5, then plug 3 into + to get 
xv = 1.5(y/4.5)° = 2y?/27. 
A547 (9740 N2013/I/2). Rearrange the given equation to get 


cy-y=a?tatl or v2 + (1-y)x+y+120. 


Observe that = is a quadratic equation (in the variable x). It holds if and only if its 
discriminant is non-negative: 


(1-y)?-4(1) (y+1) =y?-6y-320. 


By the quadratic formula, y? — 6y — 3 = 0 has roots 


pe! ee 


2 





2 
Since y” — 6y —3 is a U-shaped quadratic (because it has positive coefficient on y”), > holds 
if and only if y € (-0, 3- 2/3| U [3 + 2/3, 00) —so, this is the set of values that y can take. 
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irae ee es cee 
A548 (9740 N2013/1/3)(i) y = ——— = = + -———. % 
( /W/3)Q) ¥= 9-5 * 5p md 
As usual, you can just copy from your calculator. But here as an exercise, let’s see how we 
can get this graph as a series of transformations from y = 1/z: 


1. Start with the graph of y = 1/x (which has horizontal asymptote y = 0 and vertical 
asymptote x = 0). 

2. Translate 1 unit right to get y=1/(«%-1) (horizontal asymptote y = 0 and vertical 
asymptote x = 1). 

3. Compress (inwards towards the y-axis) by a factor of 2 to get y = 1/ (2x - 1) (horizontal 
asymptote y = 0 and vertical asymptote x = 1/2). 


> i 
———— (horizontal asymp- 


a 
4. Stretch (outwards from the y-axis) by a factor of a to get y = a 
is bi — 


tote y = 0 and vertical asymptote x = 5): 





1 3 1 
5. Translate 1/2 unit up to get y= —+ = (horizontal asymptote y = 5 and vertical 


1 2 227-1 
asymptote x = a 


If x =0, then y =-1. So, the only y-intercept is (0,-1). 
If y =0, then x =-1. So, the only x-intercept is (-1,0). 











des a seal =e +2 
2e-1 ~ Ww-l 
A549 (9740 N2013/II/1)(i) Rangeg = R¢ Rv {1} = Domain f. 


(ii) Since Range f ¢ R = Domaing, the composite function gf exists. 


<> re (-00, 1/2) U(2, 00). t 
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2 =3% =3 6 
Now, (gf) (#) = 9 (f (2)) = 1-2f (a) =1-25 7% = SES 322 34 
(iii) Write (gf) (y) =3+ oe bi. Solving, y= 4, Soy. Coy) (5) =4. 





v1 
The above line probably sufficed on the A-Level exam. But strictly speaking, it is incom- 
plete and thus incorrect. See footnote for details.°” 


A550 (9740 N2012/I/1). Let x, y, and z be, respectively, the costs of the under-16, 
16-65, and over-65 tickets. Then the given information yields this system of equations: 


Or + 6y+4z = $162.03, Tr+5y+3z2$128.36, 10r+4y+5z2$158.50. 


You can use your graphing calculator to solve this system of equations. But here as an 
exercise, let’s not: 

Taking 2x 4 - 2 - 3 yields x + 3y = $37.20 or y = $12.40 - 2/3. 

Plug 4 into 2 to get 7x + 4z + $74.40 = $162.03 or z 2 ($87.63 - 7x) /4. 


Plug = and 2 into 2 to get 10x + $49.60 — 42/3 + $438.15/4 - 3527/4 = $158.50 or 
x (10 — 4/3 - 35/4) = $158.50 — $438.15/4 - $49.60 or x = $7.65. 











And now from = and 2, we also have y = $9.85 and z = $8.52. ae 
k k+1 
A551 (9740 N2012/I/7)(i) Write y = g(x) = — ~=1+ i ; ¢ Rangeg = R\ {1}, 
r- r- 


k+1 
y-1 
So, g is one-to-one and its inverse is the function g7' : R \ {1} > R\ {1} defined by 


1 
fOeie 
z-1 





Do the algebra: x = 1+ 





Since g(a) = g7' (x) for all x € Domain f = R\ {1}, g is self-inverse (by the given definition). 
(ii) As x > 1, y > +00. So, the graph has the vertical asymptote zx = 1. 
As x > +00, y > 1. So, the graph has the horizontal asymptote y = 1. 


O+k 
If x = 0, then y = — =-k. So, the only y-intercept is (0, —-k). 


If y=0, then 0=2+k or x=-k. So, the only x-intercept is (-k,0). 





6In order to even write (gf)~'(5), we must first prove the existence of the inverse function (gf)~*. So, 
strictly speaking, the above is incomplete and incorrect because we simply presume the existence of 
(gf)~' without actually proving it. A correct answer for (iii) is the following: 
6 
Write y = gf (x) =3+ 7 & Range gf = Rs 13}. 


Do the algebra: x = 1+6/(y-3). 

So, gf is one-to-one and its inverse is the function (gf)~': IR \ {3} > R\ {1} defined by (gf) 7! (y) = 
1+6/(y-3). 

Hence, (gf)~' (5) =1+6/ (5-3) =4. 
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(iii) Since g is self-inverse, a line of symmetry is y = x. 


1 
1. Start with the graph of y = —-. 
a 
1 
2. Translate 1 unit right to get y = a 
x — 
; k+1 
3. Stretch vertically (outwards from the x-axis) by a factor of k + 1 to get y = ——. 


sean 
k+1 k; 
4. Translate 1 unit up to get y=1+ aoe aits 
fa el 


A552 (9740 N2012/II/3)(i) You’re supposed to just graph on your calculator and copy: 
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(iv) y=|f (x)| 








(i) y= f (2) 





3 


(ii) f(r) =4 — > a+a?-27-4=4 — > 2734+2?-27-8=0. By observation, x = 2 is an 


integer solution to this last equation. 
Write v3 +2? -2x-8 = (x-2) (ax? +br+c). 


Comparing coefficients on the x? and the constant terms, a = 1 and c= 4. 
Comparing coefficients on the z? term, -2a+b=1 or b=3. Hence, 
w+ x? -Qx-8=(x-2) (2? +3044). 
The quadratic polynomial x? + 32 + 4 has negative determinant. Thus, there are no other 
real solutions to f (x) = 4. ae 
(iii) The given equation is equivalent to f (a +3) =4. 
In (ii), we showed that f («) =4 has only one real solution: x = 2. 
Hence, the equation here in (iii) has solution has x + 3 = 2 or x =-1. 


(iv) See above. Where f (x) < 0, reflect the graph in the z-axis. Where f (x) > 0, keep it 

unchanged. 

(v) |f(x)| =4 <=> |a>+2?- 22-4] = 4, 

4 2 1 a 3 a go 

Suppose 2? + 2° —-27-4>0. Then = becomes 2° + 2* -24-4=4 or 2? +4°- 22-8 =0. 

We already found that this cubic equation has only one real root, namely 2. We can verify 
2 

that x = 2 satisfies >. Hence, x = 2 is a solution for S. 


3 


3 
Now suppose instead that 2° + 77-22 -4 <0. Then + becomes -x? — x? +227 +4 = 4 or 


QO=a%+2?-Qr=2(2?+u-2)=2(4+2)(x-1). 
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This second cubic equation has three real roots, namely 0, —2, and 1. We can verify that 
3 

all three of these values satisfy <. Hence, x = 0,-2,1 are solutions for Z. 

Altogether, the equation |f («)| = 4 has four real solutions (or roots): -2,0,1,and2. 


A553 (9740 N2011/I/1). The numerator x? +2 +1 is a quadratic polynomial with 
positive coefficient on x? (1 >0) and negative discriminant (1? - 4 (1) (1) =-3 <0). So, the 
numerator is always positive. 


2 


The denominator x? + 2-2 is a quadratic polynomial with positive coefficient on x? (1 > 0) 


and these roots: 


-1ltV/1?-4(1)(-2)  -123 91 


a 2(1) a i a 





Hence, the denominator is negative <> «x € (-2,1). 
Altogether, the given inequality holds <> «x € (-2,1). 
A554 (9740 N2011/1/2)(i) The given information yields this system of equations: 


a(-1.5)?+b(-1.5)+c=4.5, (2.1)? +b(2.1)+c=3.2,  a(3.4)? +0(3.4)+c=41. 
Or, 2.25a-1.5b+c24.5, 441a+2.1b+c23.2, 11.56a+3.4b+c24.1, 


Taking Z_i yields 2.16a + 3.66 £13. 
Taking 2 - 2 yields 7.15a + 1.3b 2 0.9. 


36 36 36 
Taking 7% 2 _ 5 yields 7g * 7-15a - 2.16a = = x0.9+1.3 or 


36 36 6 
=(— x0.9+1.3)/{= x 7.15 - 2.16) © 0.215. 
a (x09+13)/(E «715 6] 0.215 


From 4 and x, b = (-1.3 -2.16a) /3.6 ¥ -0.490. 
From a R, and fy, c= 4.5 —2.25a + 1.50 » 3.281. af 
(ii) f is increasing <= > f'(r)=2ar+b>0 <— r> ~~ w 1.14. 
a 
A555 (9740 N2011/II/3)(i) Write y = f (x) =n (27 +1) +3€ Range f =R. 
Do the algebra: x = [exp (y - 3) - 1] /2. 
So, f is one-to-one and its inverse is the function f~!:R > (-1/2, 00) defined by f7!(z) = 
[exp (a - 3) - 1] /2. 
As usual, Range f~! = Codomain f~! = (—1/2, 00). 
(ii) For the graph of f, 
e Asx > -0.5, f(x) > -00. So, x = -0.5 is a vertical asymptote. 
¢ If x=0, then y =In(2-0+1)+3=In1+3=3. So, the only y-intercept is (0,3). 
e If y =0, then 0 = In(2z+1)+3 or -3ln(2z +1) or x = [exp(-3) - 1] /2. So, the only 
x-intercept is ([exp (-3) - 1] /2,0). 
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The graph of f~! is the reflection of the graph of f in the line y = x. So, the graph of f+ 
has horizontal asymptote y = —0.5, x-intercept (3,0), and y-intercept (0, [exp (-3) - 1] /2). 


(iii) If f intersects the line y =z at the point P, then f and f~ also intersect at P.°" 


The points where f intersects the line y = x are the points where f (x) = x or In (2x + 1)+3 - 
x. Hence, f and f! also intersect at the points where + holds. 


You’re supposed to simply solve 2 using your graphing calculator: x ~ —0.4847,5.482. 





A556 (9740 N2010/I/5)(i) Starting with y = 2°, 


1. First, “a translation of 2 units in the positive x-direction” produces y = (x-2)°: 

2. Next, “a stretch with scale factor 0.5 parallel to the y-axis” produces y = (2x - 2)° 

3. Finally, “a translation of 6 units in the negative y-direction” produces y = (2x — . —6. 
If x = 0, then y = (2-0-2)*-6 =-14. So, the only y-intercept is (0,-14). 

If y = 0, then get 0 = (2x - 2)° ~6 or ¢ = 6'/3/2+1. So, the only x-intercept is (eu )2 +1, 0). 





671See Fact 53. 
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y & 







0,679) +1 
y= f" (2) ( 





(ii) The graph of f7! is the reflection of f in the line y = x. So, it has x-intercept (-14,0) 
and y-intercept (0, 61/3/24 1), 

A557 (9740 N2010/II/4)(i) You’re supposed to just graph on your calculator and copy. 
Two vertical asymptotes are x = +1 and a horizontal asymptote is y = 0. 
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(ii) Observe that f is symmetric in the y-axis. So, by restricting Domain f to Rj, the new 
function produced would be one-to-one. Hence, the smallest k for which f~! exists is k = 0. 
(iii) Note that Rangeg = R \ {0,-1,1} ¢ R\ {+1} = Domain f, so that the composite 
function fg exists. 





7 a 1 _ ke 3)? 
fale) = £(@)= t=)" Ga ea 
; (x -3)? __ @-3) 
[1-(a-3)][1+(@-3)] (4-2) (@-2) 
: ’ ; (x-3)? 1 
(iv) Consider the inequality GG) >0 


The numerator is strictly positive for all « ¢ Domain g = R \ {2,3,4}. 

Next, the denominator is a quadratic polynomial with negative coefficient on 2? (-1 < 0) 
and roots 4 and 2. Hence, the denominator is strictly positive if and only if x € (2,4). 
Altogether then, > holds if and only if x € (2,4). 

Note though that 3 ¢ Domain fg. Thus, the given inequality fg (x) > 0 holds if and only if 
ge(2,4)* {3}, 

p= 3) 
(v) Let y= fg (2) = ees) 


(4-2) (2-2) Rearranging, we get 


(2-3)? =y (4-2) (x -2) or x? —6x+9=y(-a2? +6r-8) or 
(l+y)22-6(1+y)x+9+8y 20. 


If y = -1, then 2 becomes 9 - 8 = 0, which is false. So, y = fg (x) #-1. 


3 
Next, if y #-1, then 2 isa quadratic equation in x that holds for some x € R if and only if 
the discriminant is non-negative, i.e. 


0<[-6(14+y)} -4(1+y) (9 +8y) = (1 +y) (36 + 36y - 36 - 32y) = (1+) 4y. 


3 
This last inequality in y in turn holds if and only if y € (-o0,-1] U[0, 00). By #, we must 
exclude —1. 


Note also that Domain fg = R \ {2,3,4}. That is, 2 cannot take on the values 2, 3, or 4. 











= 8)? 
So, we must check and see what values = A. take on when we plug in x = 2,3, 4: 
(4-2) (@-2) 
(2-3)? ; (3-3)? (4-3)? 
defined = (0, and Gack: 
Coo Gace” Bae 


So, we must also exclude 0 from our answer. 
Altogether, Range fg = (-00,-1) U (0, 0). 


A558 (9740 N2009/I/1)(i) Write u, = an? +bn+c. The information given yields this 
system of equations: 
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a-174+b-1+¢210, a- 2245-24026, a-324+b-34c35. 


Taking + - 2 yields -3a -b £4. 
Taking 2 - 2 yields -ba —b 2 1. 
Taking £_3 yields 2a = 3 or a S15. 


From 2 and g we have b=-5a-14-8.5. 


From é g and i we have c= 10-a-b=17. 


Altogether, uy, = 1.5n? - 8.5n +17. 


(ii) up = 1.5n? -8.5n +17>100 <> 3n?-17n- 16650. 


The LHS is a quadratic polynomial in n with positive coefficient on n? (3 > 0). By the 
quadratic formula, its roots are 


174+ ,/(-17)2-4(8) (-166)  17+./289+1992 17+./2281 
| | | _— 2 Eso: 
2 (3) 6 6 


We can discard the negative value. The positive value that remains is (17 + V2281) /6 
10.8. 
Since n must be an integer, the set of values of n for which uy, > 100 is {11,12,13,...}. 
A559 (9740 N2009/1/6)(i) C, is a rectangular hyperbola with y-intercept (0,-1) and 
x-intercept (2,0). 
ga7 4 

Write y = —— =1- 

abi i ies EAD 
As 2 > -2, y > +00. So, x = -2 is a vertical asymptote. 





As 7 > +00, y> 1. So y=1 is a horizontal asymptote. 











C> is an ellipse centred on the origin, with y-intercepts (0, +3), x-intercepts (+ V6,0), 
and no asymptotes. 


(ii) Plug the equation for C; into that for C2 to get 
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ail or x? (x +2)? +2(x-2)° =6(2+2) or 

















6 3 
2(a-2)° =(%+2) (6-2°). 
(iii) x» -0.515, 2.45. 
: ax a a/b 
A560 (9740 N2009/II/3) (i) Write y = f (x) = =—+ e Range f = R\ {a/b}. 
br-a b bx-a 
1 bx —a ae a?/b a 
Do the algebra: _ Ope _ a 
o the algebra =a ae — a cT-a => 2 eae: 
So, f is one-to-one and its inverse is the function f~! : R \ {a/b} > R \ {a/b} defined by 
piy)= 2" 48 
a by-a b 


Observe that f (x) 2 f- (2), for all 2 ¢ Domain f. 

Hence, f2(x) = f (f (x)) =f (f-'(x)) =a (the last step uses an Inverse Cancellation Law). 
Range f? = Domain f = R \ {a/b}. 

(ii) Range g = R\ {0} ¢ Rx {a/b} = Domain f. So, fg does not exist. 





(iii) f' (ev) =2 =~ =2 <> arv=2x(br-a) — 0=2(br-2a) — «r=00R 
x = 2a/b. 

pep ey NOPE) O=(areb)e wdave 
A561 (9740 N2008/1/9)(i) f' (x) = “eae (cas a? (Quotient Rule). 


Since ad - bc +0, f’ (x) #0 for any x. So, there are no turning points. 


(ii) Suppose ad- bc = 0. Then f’ (x) = 0 for all x, f is a constant function, and its graph 
is simply a horizontal line. 
(iii) a=3, b=-7, c=2, and d=1. Since ad - bc = 3-1- (-7)-2=17 >0, by (i), the given 
graph has a positive gradient at all points. 
(iv)(a) This is a rectangular hyperbola with y-intercept (0,-7) and z-intercept (7/3, 0). 
32-7 8.5 
Write y= = 1.5- 
24+1 24+1 
As x > -0.5, y > +00. So, a vertical asymptote is x = —0.5. 





As x > +00, y > 1.5. So, a horizontal asymptote is y = 1.5. 
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(iv)(b) The graph of y? = f (2) ... 

e Is symmetric in the x-axis. 

¢ Has the same x-intercept as y = f (x), namely (7/3,0). 
¢ Intersects the graph of y = f (x) wherever f (x) = 1. 


e Is empty wherever f (x) < 0. So here, it is empty between the vertical asymptote x = —0.5 
and the x-intercept. 


e Has horizontal asymptotes y = +V 1.5. 


e Has no y-intercepts 
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7 e 
-1  (x+1)(x-1) 
As x > +1, y > +0. So, two vertical asymptotes are x = 1 and x = -1. 


A562 (9233 N2008/1/14)(i) Write y = aI 


As x > +00, y > 0. So, a horizontal asymptote is y = 0. 
If x =0, then y =0/ (0? ~ 1) = 0. So, the only y-intercept is (0,0). 
If y =0, then x =0. So, the only x-intercept is (0,0). 
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Ay 











(ii) The graph of y? = f (2) ... 
e Is symmetric in the x-axis. 


e Is empty wherever f (x) <0. So here, it is empty to the left of « = -1 and between x = 0 
and z= 1. 


¢ Intersects the graph of y = f (x) wherever f (x) = 1. 


¢ Has the same x-intercept as y = f (x), namely (0,0). Of course, this is also a y-intercept. 


e 
At the origin, the tangent to the curve y? = wo] is vertical. 
x — 


2015, Contents www.EconsPhDTutor.com 











Lv 








(iii) For x # +1, 





qa = 
(iv) Try 7; =1. Then 


vo = Ji tae" =V1+e! » 1.169564, 

vg = /1+ roe = V1 + 1.169 564e~ 1169564 w 1.167541. 

aq = V1 + 2ge-® = V14 1.167541 e71167 541 w 1.167587. 
So the positive root of x= /1+zre* is x» 1.17. af 
A563 (9740 N2008/II/4)(i) Starting with the graph of y = 2’, 
1. Translate 4 units right to get y = (#- 4)*: then 


2. Translate 1 unit up to get y =(x-4) +1. 


Note that Domain f = (4, co0)—and in particular, the graph of f does not include the point 
(4,1). 
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(ii) Write y = f (x) = (x - 4)? +1 € Range f = (1, 0). 

Do the algebra: x = 44 Vx-1. Since x > 4, we can discard x = 4- Vx-1 and be left with 
g=4+vVar-1. 

So, f is one-to-one and its inverse is the function f~! : (1, 00) - (4,00) defined by f7' (x) = 
Vae-1+4. 

(iii) See above. 

(iv) Reflect (the graph of) f in the line y = x to get (the graph of) f~'. 


(v) Any point at which f intersects the line y = x is also a point at which f intersects 
f 1.8” And so, let us find points at which f intersects y = x. To do so, write 


f@Q=2 or (x-4)?+1l=2 or x? -9r +1720, 


(-9)-4()(17)_ 94. VB 





or, 


Since Domain f = (4, co), we may discard (9 -Vv 13) /2<4 and be left with (9 + 13) /2>4 
as one solution to f (x) = f-' (x). xt 
The above answer probably sufficed for the A-Level exam. However, it is incomplete (and 
thus incorrect) because there may be additional points at which f (x) = f~' (a2) but which 


are not on the line y = x. My guess is that those who wrote this question made the mistake 
of believing that the following statement is true: 





672 Fact 53 of this textbook. 


2017, Contents www.EconsPhDTutor.com 


“If f intersects f~' at some point P, then f also intersects the line y= at P” &X 


As discussed in Ch. 24.6, the above statement is false. 


It turns out that there are no other points at which f(z) = f~' (2). However, the above 
answer has not shown this. For how to show this (and hence complete and correct the 
above answer), see footnote.°” 


2x? — 2-19 | _ ee 19 - (a? + 30 +2) _ @* 4x -21 











A564 (9740 N2007/I/1 = <0 
( Ap) x? +30 +2 x? +30 +2 pesnen 
207-2 -1 2x7-x-1 2_ 4dr -21 - 
pect ae 9 o—_ O=1-1< 27 2% 9, _ u-4e _ (+3) (x 7) 

x? + 3x +2 x? +3n+2 u2+3n+2 (4+1)(x+2) 
ane - , + (+3) (@-7) 
my | (x +1) (+2) 
Sign diagram: -3 -2 -l 7 
The inequality holds if and only if x € (-o0, -3) U (-2,-1) U (7, «). * 


A565 (9740 N2007/I/2)(i) Since Range g = Rj ¢ Rx {3} = Domain f, fg does not exist. 


Since Range f = Rx {0} ¢ Domaing = R, gf exists. Assuming Codomain g = R, we have the 
composite function gf : R\ {3} > R defined by 





673We give two methods: 


Method 1. f (x) = f-* (a) — (x-4)?+12Vc-1+4 —> x? -82+13=VJVr-1 
= x’ + 6427 + 169 - 162° + 26x? - 2082 = 2-1 


= 0 = a+ - 16x? + 90x? — 209x + 170 = (x? — 9x + 17) (ax? + bx + c) 


Comparing coefficients on the x*, constant, and x® terms, we have a = 1, c = 10, and b- 9a = -16 (so, 
b=-7). 

Hence, ax* + br +c =a? -7x +10 = (x -2)(x-5). 

So, the other two possible solutions to 2 are 2 and 5. 

Since 2 ¢ Domain f = (4,00), we may discard 2 as a possible solution. 

Next, let’s check whether 5 actually solves 2: (5-4)?+1=246=V5-1+4. It doesn’t. 


Altogether, the unique solution to f (x) = f~' (2) is (9 + v13) /2. This completes the answer. 
Method 2 (calculus). Define g : (4,00) + R by g(x) = f (x) - f-' (x) = (a@- 4)? +1-Vxr-1-4. 
Compute g’ (x) = 2 (a -4) 

















; We now show that g’ (x) >0 for all x € (4, 00): 


22x - 


1 4 
—_> 


<> 16(2-4)?> 








1 
'(2)s 0. = 2—4) > ——.. =. 4(e—4)5 
i! (2) (4) > Fame we A -4) — 


1 
vo=l 
16(x-4)?(a-1)>0 — > we (4,00). 
At <> , we use the fact that both sides of the inequality are positive (because x > 4). 


At <> , we use the fact that x-1> 0 (again because z > 4). 
We’ve just shown that g’ > 0 everywhere. So, g is strictly increasing everywhere. Hence, there exists at 


most one x such that g(x) = f (x) - f-' (x) =0 or f (x) = f-* (2). 
We already found one such x above—namely (9 + v13) /2. So, this must be the unique solution to 
f(a)= ff" (2). 
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af (0) = a(t ())=9(-5) = -a 





(ii) Write y = f (x) = 1/(a-3) € Range f = Rv {0}. 

Do the algebra: x = 1/y +3. 

So, f is one-to-one and its inverse is the function f~' : R \ {0} + R \ {3} defined by 
f(y) = Ty +3. 


Qn +7 3 
A566 (9740 N2007/1/5) y= ——~ =2+ 


* pa ~ g4D 








Starting with y = 1/z, 

1. Translate 2 units left to get y =1/(x+ 2). 

2. Stretch vertically (outwards from the x-axis) by a factor of 3 to get y = 3/(x# +2). 
3. Translate 2 units up to get y=2+3/(a# +2). 


This is a rectangular hyperbola. 











As 7 > -2, y > +00. So, x = -2 is a vertical asymptote. 

As 2 > +00, y > 2. So, y = 2 is a horizontal asymptote. 

If x =0, then y = 2+3/(0+ 2) =3.5. So, the only y-intercept is (0,3.5). 
If y =0, then 0 = 27 +7 or x = -3.5. So, the only x-intercept is (-3.5, 0). 


A567 (9740 N2007/II/1) Let p, m, and | be the prices (in dollars per kilogram) of the 
pineapples, mangoes, and lychees, respectively. Then the given table yields this system of 
equations: 
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1.15p+0.6m+0.55128.28, 1.29 +0.45m+0.3126.84, 2.15 + 0.9m + 0.651 2 13.05. 


Taking 2x 2_3 yields 0.25p — 0.05/ = 0.63 or or 25p — 5! = 63 or | £ 5p — 12.6. 
Taking 4x 23x 4 yields 1.35p — 0.45! = 2.52 or 1385p — 451 = 252 or / 2 3p — 5.6. 
Putting * and together, p=3.5. Now we can also find / = 4.9 and m = 2.6. Sf 


So, Lee Lian paid 1.3p + 0.25m + 0.5] = 4.55 + 0.65 + 2.45 = 7.65 dollars. 
4 
rt+1 “ffs 13 . 
xr-3 xr-3 
As x > 3, y > +0. So, x =3 is a vertical asymptote. 


A568 (9233 N2007/II/4)(i) Write y = 








As x > +00, y > 4. So, y =4 is a horizontal asymptote. 
(ii) If x =0, then y = -1/3. So, the only y-intercept is (0,-1/3). 
If y =0, then 0 =42+1 or x =-1/4. So, the only x-intercept is (-1/4,0). 


y & 








av 
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| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
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| 
| 
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| 
| 
| 








(iii) Write y = f (x) = 4+ 13/ (x -3) € Range f = R\ {4}. 
Do the algebra: x = 13/(y-4) +3. 


So, f is one-to-one and its inverse is the function f~! : R \ {4} > R \ {3} defined by 
fF (y) = 13/ (y-4) +3. 
A569 (9233 N2006/1/3)(i) Range g = R* ¢ R* = Domain f. So, fg exists and 


fg (w) =f (g(a) = F(=) = 5-2 43-43, 


Range g = R* ¢ R* = Domain g. So, g? exists and 
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9? () = 9(9(2))=9(=) oer 
Let i= g?. Then g(x) = 9 09% (x) = god? (x) = g (2) = 3/x. 
(ii) h(x) = 5f (x) +3. 


x-9 x-9 r-9-27+9 x(1-2) 
A570 (9233 N2006/IT/1 <1 <= 02 -1 = ———_ = — ——.. 
( / i) x? —9 x? —9 (2-3) (x+3) 








= j + ra 3. Se x(1-2x) 
| ia | (x - 3) (x +3) 
Sign diagram: —3 O 1 3 





Take care to note that x # +3. 


The inequality holds if and only if x € (-—o0, -3) U[0, 1] U (3, 0). *t 
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155.2. Ch. 134 Answers (Sequences and Series) 


1 
ADd71 N2019/I i te aa = 
571 (9758 019/1/6)(i) Write Ar2—~1 9 Or 4] ei 





A B  2(A+B)r+B-A 


Ar2—] 


Comparing coefficients, A+ B=0 and B- A=1, so B=1/2 and A=-1/2: 











1-1/2 1/2 
Ar2-1 9 2r4+19 2r-1 
m4 mr -1/2 1/2 
ce yy ra ae (53 1) 
(aril bra) Arar 
= S+-)+(~+4)4+(A+F]+ - 
3° 5 7 





1 10 1 n 10 











1 10 1 
= lim —— = eS 
Ar2?-] noo2nt+1 2-10+1 2 21 42 





A572 (9758 N2019/I/8)(i) For the arithmetic series, the 64th term is a + 126a; so, the 


sum of the first 64 terms is 64 (a +a+126a) /2 = 32 x 128a = 2'a. 


The kth term of the geometric series is a2"~!. So, k = 13. 


(ii) O= f+ fr+fre+fre=f(lt+rt+r?+r3) = f(1+r)(1+7°). Since f #0 and r eR, it 


must be that r =—-1 and also, f can be any real number. 





| 1 a 
The sum of the first n terms is Lr ; ) = il ((-1)"-1)= 
| i — 


for n even, 


for n odd. 


(iii) Let a <0 be the first term and d be the common difference. We are given that 


a+(a+td)+(a+t2d) + (a+ 3d) =4a+6d = 14 and a(a+d)(a+2d)(a+3d) 20. 


7 -3d 
From = a= os 


From Z at least one of these three equations is true: 


as —d, a= -2d, or a 2 34. 


If 3, then d=7 and a=-7. 
If < then d= —-7 and a= 14, contradicting a < 0. 
If 2 then d = -7/3 and a =7, contradicting a < 0. 


Hence, d= 7, a = -7, and The 11th term of the series is a + 10d = 63. 
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A573 (9758 N2018/1/8)(i) u.=2u,+A-1 — > 15=2-5+A —> A=5. 
ug = up + A-2=2-154+5-2=40. 


(ii) Plug n = 1,2,3 into the given equation: 


uj = 5 =a(2')+b-1l+c=2atbte, 
Ug = 15 =a(2?)+6-2+c24a+2b+¢, 


ug = 40 =a(2?)+b-34+c28a+3b+¢. 


First, 2x 4-2 yields c= -5. Next, 2-4-2 yields 2a-c4£2a+5=200ra=7.5. 
Now plug * and 2 into 2 to get 15 = 4x 7.5+2b-5 or b=-5. 
(il) Ya = 5° [7.5 (2") - br —-5)=7.5) 27-5 > r-5) 1 
r=l r=1 r=l r=1 
=7.5(2"*1 - 2) /(2-1)-5n(n+1)/2-5n=7.5-2"1 -15-2.5n(n+1)-5n 
a7 532)! 29 bn” =7 bn 15, 


A574 (9758 N2018/1/11)(i)(a) $100 - 1.0027 » $102.43. 


1.002'8 — 1.002 
(i)(b) $100 - (1.002! + 1.002" + --- + 1.002!) = $100 ~~ — « $1215.71. 


(i)(c) On the last day of the nth month, the account will contain 





1.002"*! 1.002 _ |$2988.65, for n = 29, 
1.002 — 1 $3094.82, for n = 30. 





$100 - (1.002! + 1.0021? + --- + 1.002”) = $100 


Hence, the account will first exceed $3000 after the last day of the 29th month—or equiv- 
alently, the first day of the 30there month, i.e. 01 June 2018. 


(ii) (a) $ (100 + 120). 
(ii)(b) The total at the end of 31 December 2017 is 
$ [2400+ b(1+2+---+24)] = $(2400 + 300b) = $2800. 
50, b=4/4, 
(iii) On the last day of the 60th month, the totals under plan P and plan Q are 





1.01%! - 1.01 
100201 10! 6.000 
Setting these two equal and rearranging, b = ee x 1.23. 


1830 
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A575 (9758 N2017/1/9)(a)(i) un = Sn - Sp = An? + Bn-[A(n-1)° + B(n-1)] = 
2An-A+B. 


(ii) ee el ee 
2 minus = yields 14A = 42 or A=3. Now by = , B=48-19A = 48 -57 = -9. 
(b) r(r+4ly-(r-1Pr =r 20 (r +1) —(r-1) al = r?[(r+1-(r-1))(r+1l4+r-1)] = 


r?[2(2r)] =4r®. So k=4. 
yar : 


ee a eee ee ee 


Ae (r +1) -(r- 1)*r?] 


— 


=-07- 1 4n?-(n+1) =n?-(n +1) 











(c) We have An+1 _ rt] (n+1)! Pe: 
On a” {nl n+l 
7a lim |") = lim =0<1. 











By D’Alembert’s ratio test then, the given series converges. 


If you’ve done Part V (Calculus), you should be able to easily recognise that this is the 
Maclaurin series expansion for e”. In fact, this is even printed on List MF26, p. 2: 


A576 (9758 N2017/II/2). Let the arithmetic progression be (a;) and the geometric 
progression be (g;). Let d = aj-a, be the common difference in the arithmetic progression. 


(i) a; =3 and aj3 = 3 + 12d. So, (3+ 3+ 12d) x 13/2 = 156, so d = (2 x 156/13 - 6)/12 = 1.5. 
13 

(ii) The common ratio r cannot be equal to 1, because if so, > g =130¢;¢213%3= 392 156. 
i=l 

The sum of the first 13 terms is 3(1-r!?)/(1-r) = 156 <=» 3-3r'3 = 156-156r <=> 


— 52r+51=0. 
Use your graphing calculator to find that besides 1, the other two possible roots to this last 
equation are r * —1.451, 1.210. ae 


These are thus also the two possible values of r. 
(iii) We know that g, = 3r"' and a, =3+1.5(n-1). 
We are told that r * 1.210. We are told also that g, > 100ay. 


Thus, 3-1.210"! > 100[3 + 1.5 (n-1)] = 150+ 150n. Graph 3-1.210%! - 1502-150 in your 
graphing calculator. You should find that there is a positive x-intercept. To the left of this 
x-intercept, the graph is below the z-axis and to the right, it is above. 


This x-intercept is given by x x 41.149. Thus, the smallest value of n for which the inequality 
holds is 42. % 
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k 
A578 (9740 N2016/I/6) (i) Let P (k) be the proposition: )*r (r? +1) = ak (k+1)(k? +k+2) 
r=1 


We first verify that P (1) is true: 


il 


1 
Yor (r? +1) =1(1?+1)=2=5-1-2-4=7, 1(1+1)(1?+1+2) = J 
r=l1 


ice 


Now let k be any positive integer. Suppose P (k) is true. Below we show that P (k +1) is 
also true and hence, by the principle of mathematical induction, that the given proposition 
is also true: 


r(r? +1) re Sir (r24+1)4+(k+1)[(k+1)° +1] 


= Th (b+ 1) (A? +k +2) + (b+ 1) (B+ 2k +2) 
=(h+1)[ h(i +k+2) + (K+ 2k+2)| 
ss illus 

= + (41) (43 + 5K? + 10K +8) 
=—(k+1)(k+2)(k?+3k +4) 


= —(k+1)(k+2)[(k+1)?+(k+1) +2]. Vv 


i is side (es 


(ii) wp = up +12 +1=24+14+1=4. 
Up =U, +22 +2=44+842=14. 
ug = U9 + 3°43 = 1442743 =44. 


(iii) Through telescoping, we have ye (ty — Up-1) = Un — UO 2 tig =O: 


r=1 
But for r > 1, we also have u, — up_1 = Up—1 tretr—uUpperetr. So, 
Y (thet) £8 +r) = Yr (4 +1) 2 Th(k +1) (# +k +2). 


r=1 r=l1 


1 
Plugging this last equation into a we have un = ak (k +1) Ge +k+ 2) +2. 
A577 (9740 N2016/I/4). We are given 


a+3d+ br4, 
a+8d2 br’, 
a+ 11d 2 br! 
(i) 2 divided by = yields r!° = (a+11d)/(a+3d), while 2 divided by = yields r® = 
(a+ 8d) /(a+3d). Now, 
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at+l1ld a+ 8d g _ a+ Sdd ~ Ba 64d + a+ 9d _ 


5rto — 873.43 =5 - = 
7 ot a+3d maa a+3d 








Use your graphing calculator to find that the solutions to this 10th-degree polynomial 
equation are r x 0.74, 1. % 


You can verify by the Factor Theorem that r = 1 is indeed a root. So, given that |r| < 1, 
the only possible value of r is r » 0.74. 


(ii) The limit of the infinite geometric series is b/ (1-7). And the sum of the first n terms 
is b(1-r”)/(1-r). 
Hence, the sum of the terms after the nth is 


b= =6b(1-r") b br” 0.74" 


— = 1-(1-r”)| = x . 
l=F l-r a ( r")| l=r 0.26 








A579 (9740 N2015/1/8). First, note that in seconds, the required time interval is 
[5 400, 6 300]. 


(i) The time (in seconds) taken by A to complete the 50 laps is 


First term + Last term) x une eS =(T+7'+49x2)x oy = 507 + 49 x 50= 
2 2 


507 + 2450. 





So, we need 507 + 2450 € [5 400, 6300] or 507’ € [2950,3 850] or T € [59,77]. 
(ii) The time (in seconds) taken by B to complete the 50 laps is 


1-r°9  1-1.0259 1.0250 -1 
t = t————_. = t= 50# (1.02°° - 1). 
l-r 1- 1.02 0.02 ( ) 





So, we need 50t (1.02°° - 1) € [5 400, 6300] or 3192.267 < 50t < 3724.311 or 63.845 St < 
74.486. 


(iii) T = 59 and t » 63.845. So the times taken to complete the 50th lap by A and B are 
T+49x2=157 and tx 1.02*9 ~ 168 seconds. 


And so the desired difference is 11 seconds. 
A580 (9740 N2015/II/4)(a) Let P(k) be the following proposition: 


oe (r+2)(r+5)=—h k(k +1) (3k? + 31k +74). 


We show that P (1) is true: 
: 1 
a (r+2)(r+5)=1x3x6= 18= 1 x2%108= 1 (1+1)(3- 1?+31-1+74). / 


We next show that for all j « N, if P(j) is true, then P (j + 1) is also true: 
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Yr(re2y(r45) = Sor(r+2) (45) + (541) G+) G46) 


P(j) 1 .,. ; 
©) 5 (j +1) (372 +31) +74) (J +1) (GF +3) G+ 6) 


rad 

-i— (373 + 317? + 74) + (f+ 1) (72 + 97 +18) 
ged 23 2 3 2 : 

rie + 31j? + 74j + 127? + 108j + 216) 
Eo 

" i (37° +43j? + 182j + 12; + 216) 


= T°" (+2) (8) +1)? +31) +1) +74). v 





(b) (i) Write AB _ (2r+3)A+(2r+)B 
Qr+1 2r+3 = (2r+1)(2r+3) 
_(QA+2B)r+3A+B 

7 Ar? + 8r +3 


So 2A+2B +0 and 34+ B22. 
2x 2 minus + yields 4A = 4 or A=1 and thus B =-1. Hence, 


9 ae! 1 
Ar2+8r4+3 241 2r+37 





(ii) y_2 >| 1 1 
4r2+8r+3 SS \Or+1 2r+3 


r=l1 
(5-5) *(5-z)*(-9) 
=f--zt]Jtto-=]t+ttlo-aryt 
3. OO oS 7 7 9 
1 1 
3 














Gara, 
+ — 
2nt+1 2n+3 





(iii) The sum to infinity is 1/3. Hence, the difference between S,, and the sum to infinity 





_N 
oe eee 


<10°° — > 1000<2n+3 <— > n>498.5. 





2n+3 


So the smallest such n is 499. 
A581 (9740 N2014/1/6)(i) Let P(k) be the following proposition: 


Dk = (7 -4*). 


Ww! Re 


We show that P (1) is true: 
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1 1 
Pl 3 ( ) 3 ( ) 
We next show that for all j ¢ N, if P(j) is true, then P (j + 1) is also true: 


Djs = 4p; —7 


PO) 4 (7 _ gi) _ 
= ae 4)) —7 








i] 
os _ Agri 
se we 
(ii) n 9 12 [2 n 
So Shane) = Fea-8) =3(E7-d4] 
r=1 m1? 3 pal 3 r=1 r=1 
1 1-4 1 1-4") 4 Tn 4ntl 
= 3 (7-4-7) = g(t 3 ) 9°39 


(b)(i) As n > 00, we have (n+ 1)! > oo and so ; > 0 and thus S;, = 1— 


1 
(n+ 1)! 
(ii) 1 (1 1-4 1 


n = On - Sn-1 = 1 - —— - ae 
" (aed! nl 


— nti 1 _ n 
(n+l) (nt (nt 0) 


es 1 
—> 
(n+1)! 


nl (n+1)! 


A582 (9740 N2014/II/3)(i)(a) She runs 8m in Stage 1 and an additional 8m in each 
subsequent stage. So, in Stage n, she runs 8nm. Altogether then, the distance run in the 
first 10 stages is 


Number of terms 
2 





10 
(First term + Last term) x = (8+ 10-8) x o 440 m. 


(b) The distance run in the first n stages is 


N i 
(First term + Last term) x wma ae (8 + 8n) x 5 =4n+4n? m. 





1 
Write 4n + 4n? > 5000 or n?+n-1250>0. By the quadratic formula: 


—1+,/12-4(1) (-1 250 
i= AS = -0.5+0.5V5001 » 34.859, -35.859. 
1 
Hence, > holds if and only if n < -35.859 or n > 34.859. 
Thus, the minimum number of stages to complete in order to have run at least 5km is 35. 


(ii) The distance run in the nth stage is 2""'-8m. Thus, the distance run in the first n 
stages is 
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(2:8) =8 y ght = gt =o" _ gm. 
k=1 2-1 


[wis 


k=1 


Let j be the largest integer such that 2/*° — 8 < 10000. Since 2'° = 8192 and 2 = 16384, 
we have 7+ 3= 13 or 7 = 10. 


So, the distance run after completing exactly 10 stages is 2'° — 8 = 8184m. 


So, at the instant at which he has run exactly 10km or 10000 m, he has completed 1816m 
of the 11th stage. Since Stage 11 is 2'!-!-8 = 8192m long , at this instant, he will not 
even have completed half of Stage 11. Thus, at this instant, he is 1816m away from O and 
running away from O. 


A583 (9740 N2013/1/7)(i) The nth piece is p = (2/3)""' x 128cm long. Applying In to 
1 
=, we get 


Inp = In[(2/3)"™* x 128] = In (2/3)""" +n 128 
= (n-1)1n (2/3) +1n 2’ = (n- 1) (In2-1n3) + 71n2 = (n+6)In2+ (-n +1) In3. 


Thus, A=1, B=6, C=-1, and D=1. 
(ii) Let S;, be the total length of string cut off after cutting off n pieces. Then, 


Oe aes 1=0/3)" pa 
S,= (= 128 = 128" = 384- 384/ =] . 
> (5) * 1-273 (;) 


2 n 
Thus, as n > 00, S,, = 384 -384(=} — 384. 


(iii) Let 7 be the smallest integer such that S; > 380. Then, 


2\I 2\4 j 4 
sj = 384 - 384(=] — or 1>384(3} or (5) > = 96 or 
n 96 
‘In—>1n96 057. 
ee or I> TBP) 


Thus, the minimum number of pieces one must cut off in order for the length cut off to 
exceed 380cm is 7 = 12. 


A584 (9740 N2013/1/9)(i) Let P (k) be the following proposition: 
1 
Dit (2r? +1) = Sk (k+ 1) (ke +k +1). 
r=1 
We show that P (1) is true: 
: 2 2 i 1 2 
YrQrslja1(-1 be ar ae aes) e +1l+l1). Vv 
r=1 


We next show that for all 7 ¢ N, if P(7) is true, then P (7 + 1) is also true: 
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r (2r? +1) = Sor (2r? +1) + (441) [2G 41241] 











PO) i+) (P4741) + G+) QP +443) 
=F (a page * (43? +89 +6) 

- "(9 +5)? +9) +6) 

= 5 (5 +1) (J+2) (7? +a) +8) 

= 5 (4 +1) (542) (7 #3) +3) 
=SG+)G+D[G++ G++ v 


CN) — f(r —1) = (2r3 + 3r24+7424) -[2(r-1)°+3(r-1)' + (r-1) +24] 
: (218 + 31241421) - [2 (7? -3r°s38-4) + 3 (1? 5294) + (x=) 424] = 


So, a=6. And hence, 


Yr? = 2S LF) -Fr-DI=S LF) -F + FQ) -F) ++) -F@-D] 
f 








r=1 r=l 
_f(n)-f (0) _ 2ni4+3n24+n4+24-24 2nF+3n?+n_ n(n+1)(2n+1) 
7 6 7 6 7 6 7 6 | 
(iii) n n n 
yf (r) = > (2r3 + 8r? +r +24) = 9° [r (2r? +1) + 8r? + 24] 
r=l r=1 r=1 
= [r(2r?+1)] +357? + ¥ 24 
r=1 
a (n+1)(n?+n+1)+ NAAN) 4. 5 
2" 2 
ote =. aoj0—1. 1 
A585 (9740 N2012/1/3)(i) us = ae and ug = Solent =a 
od_=1 me 1 
(ii) As n > 00, Uns —Un 70 <> us — tin = EE 0 ty > 5 


(iii) Let P (k) be the following proposition: 


We show that P (1) is true: 
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1 
m=2=2-5-—(5) -5. / 
3 3 3 \2 3 


We next show that for all 7 ¢ N, if P(7) is true, then P (7 + 1) is also true: 


Uj+1 = uj + ua ~ j= uy + (2-5 


py) #3) -3 1 ea) = l 


2 6 3 


A586 (9740 N2012/II/4)(i) On the first day of the nth month, she deposits 100 + 
(n-1)10=10n+90. Hence, through the first day of the nth month, her account has: 


Number of terms 


(First term + Last term) x 5 


= (100 + 10n +90) x 5 = 5n? + 95n. 


Let j be the smallest positive integer such that 57? + 957 > 5000 or 7? + 197 — 1000 : 0. By 
the quadratic formula, the two roots of x? + 19x — 1000 = 0 are 


_ -19 45/19? -4(1)(-1000) 19+ 627 

a 2(1) ~ 2 
Hence, j = 24. Thus, her account first exceeded $5000 on the 24th month—that is, on 
December 1 2002. 


(ii) Let S,, be the amount in his account on the last day of each month, after interest has 
been paid. 


Then $; = 1.005- 100 and Sj,41 = 1.005 (S;, + 100). 


So, in general: 








# —42.519, 23.519. 


L003? = A 


S, = 1.005”- 100 + 1.005""!- 100 +--+ 1.005-100 = 1.005 - 100 ——_—_ 
1.005 - 1 


= 20 100 (1.005” - 1). 


Let 7 be the smallest positive integer such that S; = 20 100 (1.005/ - 1) > 5000 or 201 - 
1.005! > 251 or, 


. 2ol 201 _ In(251/201) 
1.005’ > —— In 1. ln — re 
005! > or j In 1.005 > 2 ool or > mLO05 


w 44.541. 
201 


Hence, 7 = 45. Thus, his account first exceeded $5000 in the 45th month—that is, in 
September 2004. 


(iii) Let r be the interest rate. Then given r, the amount in the account on 2 December 
2003 is 


100(1+r)” re 100(1+r)™" jae 100(1 +r) + 10 
—_-—- _--_-_———"’ -_—_o =< 

Jan 2001 deposit has Feb 2001 deposit has Nov 2003 deposit has Dec 2003 
earned interest 35 times earned interest 34 times earned interest once has not earr 
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= 5000 or, 


36 

i —] 

We want the above amount to equal 5000. So, write roo tt") = 1 
r 


(l+r)e-1250r or — 8 0.01796 = 1.796%. * 


Note that + is an equation we haven’t learnt to solve in H2 Maths, so you’ll need to use 
your calculator here. 


A587 (9740 N2011/1/6) (i) 


if . I ies | ' pA, 
sin(r +5) 6—sin(r-5)0 = sinr@co838 + cosrd sin 50 — [sin-r0-cos0 - cos sin 56] = 2.cos 8: 


sin(r +4)0-sin(r-4)0 














ee . ° 2 
(ii) Rearranging (i), we have cosré = Saal . And so, 
sin 50 
: 1 ‘ 1 
n n sin(r+5)@-sin(r—-5)0 
8 op «Sen $)O=sin (r= 
=| oes 2sin 50 
1 3 2n+1 2 = I 
ae (sin $0 - sin 50 +sin 20 - sin 38-4---45in a @-sin 6) 
2sin 50 2 2 2 2 


2n+1 1.) sin(n+4)0 4 
=—— sin "9 - sin 58) = =. 
2 sin 1a 2 2 2sin $0 2 


(iii) Let P (k) be the following proposition: 


ke cos 50 - cos(k +5) 
y)sinrd = 4 
= 2sin 50 





We show that P (1) is true: 


1 9sint4sin@d —2sin—1@sind 9 gin V2=3/29 gin U2+3/29 
SS sinr@ = sing == Ee 





i 2sin 50 2 sin 50 2 sin 50 
P Q 
——s Ss 
1 3 
cos 59 — cos 50 
=a 5 v 
2sin 50 


where the last step uses the last formula printed under Trigonometry in List MF26, p. 3. 
We next show that for all 7 ¢ N, if P(7) is true, then P (7 + 1) is also true: 
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gel j 
y sinré = )\sinrd + sin (j + 1)0 
r=l1 r=l1 
P(j) COS 50 — Cos (j + 5)0 
2 sin 50 





+sin(j+1)0 

7 cos $0 —cos(j +5) 0+ 2sin 50sin (7 +1)0 

7 2sin $0 

cos $0 — cos(j +$)0+cos(j+5)0-cos(j+3)0 


2sin 50 


— 





cos $0 - cos (j + 3) / 
7 2 sin 50 
where again + uses the same trigonometric identity as before. 


as desired. (Again, to get from 2 to es I used the same trigonometric identity as before.) 
A588 (9740 N2011/I/9)(i) The depth drilled on the nth day is 256-7 (n - 1) = 263-7n 
metres and the depth drilled through the nth day is 


Number of terms 
P 
Thus, the depth drilled on the 10th day is 263 - 7-10 = 193 metres. 


Let 7 be the smallest integer such that 263 - 77 < 10 or j > 253/7 » 36.1. Then 7 = 37. So, 
the total depth drilled is 


519 7 
= (256 + 263 —7n) 5 = n~ sn’. 


D,, = (First term + Last term) x ; 


519 7 
D; = D37 = a 3f - - 377 = 4810 metres. 


(ii) Through the nth day, the depth drilled is 


2 sy 1-(8/9)" | (5) 
d, = 5.256(—| =256——~“—*~ =2304]1-[- tres. 
> (5) 1-8/9 9 metres 


By “theoretical maximum”, the writers of this question probably mean this: 
n 8 r-1 8 n 
lim }) 256 (5 = lim 2304]1- (=) = 2304 metres. 
nN co pel nN Co 


Q\J 
Let 7 be the smallest integer such that d; > 0.99 - 2304 or 2304 f - (=| | > 0.99 - 2304 or, 





8)\/ 8\/ ,_ 8 
1-(5) > 0.99 or 0.01> (=) or a or 
In 100 
39.1. 
** in (9/8) 
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Thus, 7 = 40. 
A589 (9740 N2010/143),(i) 5,,_; =n (2n+c) -(n-1)[2(n-1) +c] 

= 2n? + en -(2n? -4n+2+cen-c)=4n-2+e. 
(ii) Since u, = 4n-2+c and uns =4(n+1)-2+c=4n+2+C, we have 


Unt+1 = Un + 4. 


A590 (9740 N2010/II/2)(i) Let P (k) be the following proposition: 


Yor (+2) = ah(k +1) (2k +7). 


r=l1 
We show that P (1) is true: 


1 
Yor (r+2)=1-8=3=5-1-2-9= 1 (141) (2-147), J 


We next show that for all 7 ¢ N, if P(7) is true, then P (7 + 1) is also true: 





Yir(r+2) = Sr (r+2)+ +(j+1) (+3) 
tiene relia 
eed eG) 42 = (67 +18) 

j 


= 1 (aj? +13) +18) 
= (+2) (2) +7) 
== (441) (J+1+1) [2 +1) +7] Y 


(ii)(a) Observe that 














1 05 0.5 
r(r+2) or r+2 
Hence, 
5 = (¥- “=) 
= t a SVT 42 
_9.5 0.5 0.5 0.5 0.5 0.5 0.5 05 05. 05 
— — —— $¢ + +—- 
it 3 2 4 3 5 n=l nel © we 
_ 0.5 05 0.5 0.5 3 i 1 


— — —— - — — ——__ - J 
1 2 n+l nt+2 4 2(n+1) 2(n+2) 
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(b) In the formula found in (ii)(a), as n > oo, the second and third terms vanish. Hence, 





eek eee Se "4 


A591 (9740 N2009/1/3)(i)_ n(n +1)-2(n-1)(n+1) +(n—1)n 























n-1 n nei (n-1)n(n+1) 
ene 2 alan =7-—2(-1). 2 
7 n(n? -1) — Ban nn 2% 
So, A=2 
(ii) 
s_! ol i 2 = 
Sar3-r 244\r-1 r r+il 
5(5-5¢5¢gstgta-ats ota) 
=—(--=+-4+7---4+-4+=--—4+54++-+ —— - —— 4-4 —- = 
2\1 2 3 23 4 3 4 °5 t= Meal ne Teel a eel 


Observe that the terms with denominators 3 through n- 1 are cancelled out. Hence, 


a | 5(; 2 1 1.2 il (5 1 1 1 il dl 
= — Sel et ee eee et a 
sr3-r 2\1 2 2 n n n+if/ 2\2 n n+l} 4 2n 2(n+1) 


(iii) In the formula found in (ii), as n > oo, the second and third terms vanish. Hence, as 
nm 


noo, ) ——-—>-. 
a 
saore-r A 


A592 (9740 N2009/1/5)(i) Let P (k) be the following proposition: 


k 
Sr? = =k (k +1) (2k+1). 
r=1 
We show that P (1) is true: 
: 1 1 
De ae a DS ll) Ge led), J 
r=1 


We next show that for all 7 ¢ N, if P(7) is true, then P (7 + 1) is also true: 


Sr = Pets Ge PY AiG) +1) + G+) 


j+l1 





- “= (2? +j)+ 1 ejssnl*banctnen 


= PP (+2) (29+8)= 2 GF G +14 DQG +141 y 


ew 257 ye = -2n (n+ 1) (2, Qn +1) -an(n+1)(2n+1) 


n(2n +1) 
6 
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(8 gn +2) ~ PERE D fy 41) = BOD (an 51), 


ae 
A593 (9740 N2009/1/8)(i) Let r be the common ratio. Then the 25th bar has length 


20r24 = 5cm and so r= (=) 0.51/12, 





20 
In the limit, the total length of all the bars is i # 356.343 cm. 

-r 
And so indeed, no matter how many bars there are, their total length cannot exceed 357 cm. 


a 272.257cm. 





1-r? 
(ii) The total length is L = 20 i 


The length of the 13th bar is 20r!* = 20- (0.5412) = 10cm. 


(iii) The total length L is 
N f 2 
Licence easier ie = : 
125 + 300d = 272.257. 


So, dx 0.491cm and the length of the longest bar (the 1st bar) is 5+ 24d * 16.781cm. 
A594 (9740 N2008/I/2). Let P(k) be the following proposition: 


Se= Douce Zh (k+1)(4k4+5). 
We show that P (1) is true: 


1 

i 

o1e ) Up = ty = 1(2-1+1)=3=—-1-2-9= 
r=1 


1(1+1)(4-1+5). J 


one ae 


We next show that for all j « N, if P(j) is true, then P (j + 1) is also true: 


J . PG) 1. . . . ee oer oe 2! 
Syst = Do Ur + (9 +1) (27 +3) - gi FFD 4G +5) + GF 1) (2943) = (A S5)) (1 
r=l 


TS (47 +17) +18) = 17" (#2) (Aj +9) = 7 “(j+1) (+1 +1) [4 (j+1)+5]. Vv 


A595 (9740 N2008/1/10)(i) On the first day of the nth month, she saves 10+3(n-1) = 
7+ 3n dollars. 


Thus, the total saved through the first day of the nth month is 


N f 1 
eee ee 


(First term + Last term) x 5 9 9 9 


3 17 
Let 7 be the smallest positive integer such that ar + oo > 2000 or 377 + 177 — 4000 > 0. 


By the quadratic formula, the solution to 3a? + 17x — 4000 = 0 is 
-17+,/17? -4(3) (-4000) -17+,./482 
poe VEE) eee 39.458, 33.791 % 
2(3) 6 
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Thus, 7 = 34. So, she will have saved over $2000 on 1 October 2011. 


In the 1st month (Jan 2009), she has saved 10 dollars. In the 2nd (Feb 2009), she has saved 
10+ (10+1x 3) dollars. So in the nth month, she has saved 10+ (10+ 1x3)+(10+2x3)+ 
...[10 + (n= 1) x 3] = [20+ (n-1) x3] x 5 = 8.5n + 1.5n? dollars. Set 8.5n + 1.5n? = 2000 


and solve: 


8.5 + \/8.52 —4(1.5)(-2000) -8.5 + 109.874 
n= —_o oo _ >=W a —_ 
3 3 


—8.5 + 109.874 
We can ignore the negative root. So n= a x 33.791. So it is only in the 34th 


month that she has saved over $2000. That’s 1 October 2011. 


(ii)(a) At the end of 2 years, her original $10 has earned 10 x 1.0274 - 10 » 6.084 dollars in 
compound interest. 


(b) Just after interest has been paid on the last day of the nth month, the total in her 
account 1s 

1.02" = 1 
102= 1 


10+1.02” + 101.02" + -4+<.4:10+1.02' = 101.025 = 510 (1.02” - 1) dollars. 


Hence, at the end of 2 years, just after interest has been paid on 31 December 2010, the 
total in her account is 


510 (1.0274 - 1) » 310.303 dollars. 


(c) Let j be the smallest positive integer such that 510 (1.02/ - 1) > 2000 or 510-1.02/ > 2510 
or, 


- 261 251 251 
1.02/ > a2! or jin 1.02 >In at or iby a + In 1.02 + 80.476. 
51 51 51 


Thus, it is only after 7 = 81 complete months that her total savings first exceed $2000. 


A596 (9233 N2008/II/2). Let a, and g, denote the nth terms of the arithmetic and 
geometric progressions. Let d and r be the corresponding common difference and ratio. We 
are given 


1 | 1 
2+ 92-5 or 5td+5=5 or d+5*0. 
1 1 2 4 2 3 
And, a3 + 93 = 3 or 5tid+>=5 or 2d+ > 2-5. 


2x + minus 2 yields r — r2/2 = 3/8 or 4r? - 8r +3 = 0 or, 


8+ y(-8) -4(4) (3) = 


1 13 
Bae 
2 (4) 5 29 


T= 


Since the geometric progression converges, |r| < 1 and so r = 1/2. And thus, its sum to 
infinity is 


2037, Contents www.EconsPhDTutor.com 


gl 2 1/2 = 
ter TH172 





if 


A597 (9740 N2007/1/9)(i) Using your graphing calculator, a » 0.619 and @ * 1.512 
(calculator). % 


(ii) Suppose z, > DL. Then p41 -X%_ > 0. Or, 


il 1 
Buel — In = Ge" — Ln + 0 or 30 L=0. 


1 
Equivalently, L equals a solution to 3° -x=0. So, L equals a or £. 


1 
(iii) If x; = 0, then x = 3 x3 8 0.465, 24 » 0.531, x5 » 0.567, reg » 0.588, ..., 215 = 0.619. 
So the sequence converges to a * 0.619. 
If x; = 1, then x » 0.906, 73 ~ 0.825, 24 » 0.761, r5 ~ 0.713, rg ¥ 0.680, ..., x17 0.619. So 
the sequence converges to a x 0.619. 


If x1 = 2, then x2 » 2.463, 73 » 3.913, v4 » 16.690, x5 ¥ 5903 230.335. “Clearly”, the sequence 
diverges. 


1 
(iv) From the graph of y = e” — 3x, we observe that if a < x, < 6, then e” - 3x <0 or 3° <2 
or Xn+1 < Xn. 


1 
Similarly, we observe that if x <a or x > 2, then e” —- 3x >0 or 3° SP OF Lai > Da: 
(v) If x, > 6 * 1.512, then (iv) tells us that x,,; > 2, and therefore that the sequence 
diverges. We saw this with x, = 2 in (iii). 
If x, € (a, 8) » (0.619, 1.512), then (iv) tells us that 2,41 < Zy. We saw this with x; =1 in 
(iii). 
If rz, <a*0.619, then (iv) tells us that x,41 > 2p. We saw this with x, = 0 in (iii). 
A598 (9740 N2007/1I/10)(i) We are given that the first term of the geometric progression 


equals a. 


' 1 2 
We are also given: ra=a+d, rea =a+5d. 


Rearranging + and 2 so that d is on one side, we get 


_a(r-1) _a(r*-1) 


: ; or Br —-5 =3r?-3 or 3r?2 -—5r+2=0. 


d 





(ii) 3r? - 5r + 2 = (8r -2)(r-1) =0 and so r = 2/3 or r=1. But if r = 1, then by 2 d=0, 
contradicting our assumption that d #0. Hence, r = 2/3. Since |r| < 1, the geometric series 
converges to: 





= 30. 
1-r 


2038, Contents www.EconsPhDTutor.com 


“t) From £. d= _ 
(iii) From =, 5 3 
“nd =an(1-"—) el 

18 








- —a/3 
ee a =-=. And now, 








S=[a+a+(n-1)d]>=an+ 


da n(19-n)>72 <> n2-19n+72 <0. By the quadratic 





S>4a <— an 
formula, x? — 19x + 72 = 0 has solutions: 


19#4/ (=19)° =4(1) (72) 41 
v= ( 7” (1) ( ? - MEV 5.208, 18.772 


ae 


So, é holds if and only if 5.228 < n < 13.772. Of course, n must be an integer. And so, the 
set of possible values of n for which é or S'> 4a holds is {6,7,8,9, 10,11, 12, 13}. 
A599 (9740 N2007/II/2)(i) Let P (k) be the following proposition: 


1 
Uk = ee 
We show that P (1) is true: 
il 
U1 = l= 2’ J 


We next show that for all 7 ¢ N, if P(7) is true, then P (7 + 1) is also true: 
+1 | (f+1)°- (+1) 


- 7 2j7+1 py) 1 
GEL = hy y ~ WD TBy. 5} Bnd 2 
(+1) PP G+1) G41) 
. i 


P41 (+1) 


N 


(ii) No On+1 _ 
2 Binal? = 2, (tin Une) 
= U1 — Ung + U2 — UZ ++ FUN -— UNS 
1 
= U1 — UNG = 1- ——_,. 
(N +1) ra 


(iii) In the formula just found in (ii), as N > oo, the second term vanishes, so that the 


series converges to 1. 
2(n+1)=1 
od) Thus, 





Zn + 1 
iv) First ob that = 
(iv) First observe tha P+)? Gel Gait)? 
> Qn-1 2 Qn+1 | 1 * 
pam? (n-1)? Sj P(mt1)? WN 
www.EconsPhDTutor.com 
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A600 (9233 N2007/1I/14). Let P(k) be the following proposition: 


ko cos $a —cos(k+3)a 
)) sinrx = ———__.——. 
r=l 2sin 52 
We show that P (1) is true: 
1 jj Pe To l i 
singsin5a” , cossx-—cos(1+35)z 
) sinraz = sing = —_—— = ——~_+~——_. "A 
= 2sin 5x 2sin 5x 


(= used the last trigonometric identity printed on List MF 26, p. 3.) 
We next show that for all 7 ¢ N, if P(7) is true, then P (7 + 1) is also true: 


jt 7 
yi sinrz =) sinrx +sin(j+1)a 


r=l r=l1 


1 24 il 
P(j) COS 5X — COS (j + s)a 


1 


+sin(j+1)z 
2sin 5x 


1 - 1 J F : 
7 cos 5x — cos (j + $)a+2sin5zsin (j +1)a 


1 








2sin 5x 
cos 42-cost++¥) 2+ cosl¢+t x—cos(j+3)a 
2 2 2 2 J+ 9 
2sin $x 


Coase = cos (g21+5 a y 


2 sin Su 


(Again, 2 used the same trigonometric identity as before.) 





<a a peal, 26 
A601 (9233 N2007/II/1). 5° 3"? =9 5° 3" =9-3 =<} (3? -1). 
r=1 r=1 - 
2 
A602 (9233 N2006/I/1). The first term is S; = 6 - 31 = 4. 


2 4 il 
The common ratio is (Sp - $1) $1 $245) -1= (6-55) +4-1- 5-1-5. 


A603 (9233 N2006/I/11)(i) Let P (k) be the following proposition: 


is 1 
Dr = Gk (k+1)*. 


r=l1 


We show that P (1) is true: 


1 
yr ersi= 


i 1 
—@jde—(*(141)%. J 
rl rl (1 +1) 


T= 


We next show that for all 7 ¢ N, if P(7) is true, then P (7 + 1) is also true: 
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j+l1 


aa “yr Gj +1?) 25 2 (5 +1)24+ (5 +1) 


_ ae SA (ajay = LP 2s ajeay= AY G2) "A 


(ii) 22+ 4° +--+ (Qn)? = » (2r)° =8yn = 2n?(n+1)?. 
oa Y @r-1)8 = 1? 384.2, n= 1)" 21? 20? teeny =|2? 24? 42a On) | 


n 1 
*- LC y= 7 (2n)° (2n+1)?-2n?(n+1)? 


a 


=n? he ee =n? (2n? -1). 
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155.3. Ch. 135 Answers (Vectors) 


A604 (9758 N2019/1/12)(i) We’re given Q = P+ A (-2,-3,-6) = (2, 2,4) +A (-2, -3, -6) 
for some A € R. By my Assumption 1, Q is on the plane x+y+z = 1. So, 2-2A+2-3A+4-6,A = 
1 or 8-11A=1 or X=7/11. Hence, Q = (2,2, 4) + 7/11 (-2, -3, -6) = 1/11 (8, 1,2). 


Similarly, we’re given R = S + w(-2,-3,-6) = (-5,-6,-7) + «(-2,-3,-6) for some p € R. 
By my Assumption 2, R is on the plane r+ y+z=-9. So, -5-2u-6-3u-7- 6p = -9 or 
-18-11p = -9 or w=-9/11. Hence, R = (-5, -6, -7) — 9/11 (—2, -3, -6) = 1/11 (-37, -39, -23). 


(223,56) et) Tt OR = eae -40, -25) = = (9 8,5) 
(2,3, 611.1, DI > 7/3 ee ae 


_ 109,85) G1, DI _ 22 
cos = 193 5 yl(1,1,1)| — S10 


(ii) cos0 = 





(iii) Consider the point A = (1,0,0) on the plane x+y+z= 1. Let B be the point 
on the plane x+y +z = —9 that is closest to A. Then B = A+ A(1,1,1) = (14+4,A,A) 
and so1+A+2A+A=-9 or \ = -10/3. Hence, the distance between the two planes is 


[AB] = |Al|(1, 1, 1)| = 10v3/3. 


(iv) k= sind V1-cos?@  /1-121/147_ \/26/147__—/170 
sin6 \/1—cos?6  \/1-484/510 /26/510 49 
sin 0 


< 
sin 9 





2 1.86. 





(v) Since 0,6 €[0,7/2],6>0 => k= 


A605 (9758 N2019/II/5)(i) For some A, €R, 
OX = OA+ AC = OB + BD =at (at 4b) =b+p(5a). 
Comparing coefficients, \ = 1/4. Hence, OX =1.25a+b. 
(ii) For some a, 6 € [0,1] with a+ B + 1, OY = a0C + BOD = (2a+58)a+(4a+)b. 
For some 7 € R, OY = yOX = 1.25ya+yb. Comparing coefficients, 
20+5821.25y and 4a+f2y. 


Now, 2x 2 — 3 yields 98 = 1.5y or B = 7/6 = 4 7/24 and hence a = 5y/24. 
Now, = implies a = 5/9 and 6 = 4/9. Hence, 


OY = 2a +58)a+(4a+9)b= 5 (5a+4b) = = (Fa +b) = 2X. 


And OX : OY = 3/8. 
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A606 (9758 N2018/1/6)(a) By the distributivity of the vector product (Fact 145), 
ax 3b = 2axc —> ax 3b-2axc=0 <> a x (3b - 2c) = 0. 


By Corollary 25, 3b - 2c || a or equivalently, 3b — 2c = Aa for some X. 


(b) We’re given a-a=c-c=1, b-b=16, and b-c = |b||e|cos 60° = (4) (1) (0.5) = 2. 
Also, b t+ c, so A #0. 

Now consider the scalar product of 3b — 2c with each of c, b, and a: 

¢ (3b-2c)-c=a-c=3b-c- 2c-:c=6-2=4 = > a-c=c-a=4/X. 

» (3b -2c)-b=a-b = 3b-b-2c-b =48-4=44 => a-b=b-a=44/). 

e (3b -2c)-a=Aa-a= = 3b-a-2c-a = 132/A- 8/d = 124/). 


Rearranging, \? = 124 or \ = +V124. 


A607 (9758 N2018/II/3)(i) AB = DC = B- A= (0,8,-1). 
(6, 3C= D054) (06-1) =(-5, 4.3), 


(ii) EB x EC = (5,4,-10) x (-5,4,-8) = (8,90, 40). % 
(0,0, 10) - (8,90, 40) = 400. 
So, the cartesian equation of the plane BCE is r- (8, 90, 40) + 400. 


(iii) Plane ABCD has normal vector AB x BC = (0,8, -1) x (-10,0,2) =(16,10,80). 
The requested angle is the angle between the planes ABC'D and BCE, which is (Fact 172) 


|(8, 90, 40)||(16, 10, 80)| /64 + 8100 + 1600\/256 + 100 + 6400 
4228 
-1 
= cos)! —————___ » 1.02. 6 
/9764,/6756 


(iv) The midpoint of edge AD is (0, -4, 2). 

Let (0,-4,2) + k(8,90,40) be the point on the plane BCE that is closest to (0,-4,2). 
This point must satisfy 2: 

[(0, -4,2) + k (8,90, 40)] - (8,90, 40) = 400 <> 9764h —360+80= 400 <> k= 680/9764. 


Hence, the distance between the midpoint of edge AD and the plane BCE is 


680 680 
k (8,90, 40)| = ——/9764 = —— ~ 6.88. 
eC N= 9764 9764 
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A608 (9758 N2017/1/6)(i) Assuming t takes on all values in R, the vector equation 
r = a+tb describes the line with direction vector b and which passes through the point 
with position vector a. 


(ii) The vector equation r-n = d describes the plane that has normal vector n and is of 
distance |d| away from the origin. (See Corollary 35.) 


(iii) Since b-n ¥ 0, the line is not parallel to the plane. Hence (Fact 171), the line and 
plane intersect at exactly one point. This point is given by the solution of the following 
system of (two vector) equations: 


r=a+tb and r-n=d. 


To solve these equations, plug the first into the second: 





(a+tb)-n=d =— a-n+tb-n=d — i- 


That is, f solves the above system of equations. 
And this solution corresponds to the point at which the line and plane intersect. This point 
has position vector: 


d-a-n 


a+tb=a+ b. 





A609 (9758 N2017/1/10)(i) The existing cable corresponds to the line described by 
r = (0,0,0) + A (3, 1, -2) (\ €R) 
The new cable corresponds to the line with direction vector: 
PO = Q- P=(5,7,a) -(1,2,-1) = (4,5,a+1). 
And so, this line may be described by 
r= (1,2,-1)+y(4,5,a+1) (we R). 


If the two lines intersect, then there exist real numbers \ and fi such that 


3\ 2144 
0 1 4 eee 
o]+Al 1 |=] 2 [+2] 5 or \ 22456, 
0 —2 —] +1 A 
24 2-14 (a+1)A (AR). 


3x 2 minus 4 yields 0 = 5+11f or f = -5/11. And now from 2 \= —3/11. If these values 
+ aw ° 3 
of A and fi satisfy =, then 


—3)\ 3 —5 3 22 
=) a | (a pa ae 
(=) +(a+ (=) or a 5 
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(i) Let R = (0,0,0) +A (3, 1,-2) = (3, 1, -2). 
If 2 PRQ is right, then PR 1 QR or PR-QR-=0. But, 


i A= 1 
PR-R-P=i| 1 |-] 2 |=] j-2 and 
= =i =e al 
3 5 a5 
QR=R-Q@=\| 1 |-| 7 |=] i-7 
ay ~3 ==3 


So, 


PR-QR = (3\-1)(3\-5) + (A-2) (A-7) + (-2441) (-24 43) 
= 14)? - 35) + 22. 


This is a quadratic expression in \ with determinant (-35)° — 4(14) (22) < 0. Hence, 
PR-QR#0 or PR{ QR and thus 2 PRQ cannot be right, 
(iii) Let A =(0,0,0) A CG, 1,-2)SA(3,1,-2). 


[PR| = \/ (3A-1) + (4-2) + (-2A 41) = VIA - 141 +6. 


|PR| is minimised at “—b/2a” (Fact 34): 





—14 


A= 5a) = (eb: 





Hence, R = (3, 1,-2) = (1.5,0.5,-1) and |PR| =4/ 1402 14) 46 =4/3.5—7 5 6 29/25. 


A610 (9740 N2016/1/5)(i) Method 1. u+v = (2+a,-1,2+b), u-v = (2-a,-1,2-)), 
and so, 


2+a 2-a b-24+2+5b 2b 
(u+v)x(u-v)=| -1 |x] -1 |=] (2+06)(2-a)-(2+a)(2-b) |=] 4b-4a 
2+b 2-—b —2-a+2-a —2a 


Method 2. Recall that the vector product has the following three properties (Fact 145): 
the vector product is distributive and anti-commutative; moreover, the vector prod- 
uct of a vector with itself is the zero vector. Hence, 


(u+v)x(u-v) =ux(u-v)+vx(u-v) (Distributivity) 
=uxu-uxvt+vxu-VvVxv ) 
=uxu+VxXUtVxXU-VXV (Anti-commutativity) 
=O+vxut+vxu-0O (Self vector product is zero) 
=2v xu. 
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But, vxu=]{ 0 {x} -1 |=] 2b-2a 
b Z —a 
2b 
So, (u+v)x(u-v)=] 40-4a 
—2a 


(ii) We are given that 2b=-2a <=> b=-a. 


2b —2a 
So, (u+v)x(u-v)=] 4b-4a |=] -8a 
—2a —2a 


And, \(u +-v) x (u--v)| = \V/ (2a)? + (-8a) + (-2a)? = V7202 = V72 |al. 


If V72|a] = 1, then a = +1/V72. 


(c) Recall that the scalar product is distributive and commutative. Hence, 


(u+v)-(u-v) =u-u-U-vV+V-U-V-V=U-U=u-Vviv-T- v-v = 0. 


Or, v-v=u-uor |v = ul’ or |v| = [ul = 2? + (-1)? +22 = V9 =3. 


A611 (9740 N2016/I/11)(i)(a) Let w = (-2,1,2) be l’s direction vector. We show that 
w is perpendicular to the two non-parallel vectors u = (1, 2,0) and v = (a, 4, -2) = (0, 4, -2) 
on p: 


wu = (-2,1,2)-(1,2,0) = -24+2+0 =0, J 
W:v = (-2,1,2)-(0,4,-2) = 0+4+-4 =0. J 
Since w 1 u,v, we have w 1 p and thus also / 1 p. 
To find the point at which / and p intersect, write 
il 1 0 = » ee ees eo 
-3 |+A] 2 ]+Al 4 [=] 0 J+t or -34+2\+40 28 
2 0 =), 1 2-27 2142. 


From é 12 (1 +t). Plug * into 2 to get 
-3+2[-2(1+#)]+4a=f or p2 (5t+7) /4. 


Plug 2 into 2 to get 
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2-2(5¢+7)/4=1+2% or £2 5. 


And now plugging © back into = and 2 we have \ = —8/9 and fi = 19/18. % 


(i)(b) “Obviously”, the two planes must be parallel to p, with one “above” it and the other 
“below” it. 


Upper plane 


The plane p 


Lower plane 





The plane p has normal vector n = (1,2,0) x (0,4,-2) = (-4,2,4). We have fi = n/|n| = 
1 

(—4, 2,4) /V36 = = (-2, 1,2). 

The plane p contains the point B = (1, -3, 2). 

The upper plane contains the point C = B+ 12n = B+4(-2,1,2) = (-7,1,10). 

The lower plane contains the point D = B -12n = B - 4(-2,1,2) = (9,-7, -6). 


Both planes have normal vector (—2, 1,2). We have 

OC - (-2,1,2) =144+1+20=35 and 00249) = -18-7-12=-37. 
And so, the two planes have cartesian equations: 

—2e+y+z=35 and -2r%+y+2z=-837. 

(ii) If a line and a plane intersect at zero or more than one points, then they are parallel 
(Fact 171). And so, w 1 nor w-n=0. 
But n = (1, 2,0) x (a, 4,-2) = (-4,2,4 - 2a). 
So w-n = (-2,1,2)-(-4,2,4 - 2a) = 18 - 4a = 0 or a = 18/4 = 9/2. 
A612 (9740 N2015/1/7)(i) OC =0.6a and OD = Jub. 


(ii) BC = OC - OB =0.6a—b and so the line BC can be written as r = b+ A(0.6a — b) = 
0.6\a + (1 -A)b, for A € R, as desired. 


AD =0D=0A= 5/11b — a and so the line AD can be written as r = a+ y(5/11b-a) = 
(1-p)a+5/iipb, for A €R, as desired. 

(iii) Where the lines meet, we have 0.6Aa + (1 - A)b = (1-yp)a+/ipb. Equating the 
coefficients, we have 0.6A =i yw and 5/110 21-). From 2, we have w = 1-0.6A. Plugging 
this into 2, we have 5/11(1-0.64)=1-A <— > 1-0.6A = 15-15) <> 85 = 6/5 <> 
A = 3/4. And 4 =0.55. Altogether then, the position vector of E is 0.45a + 0.25b. 
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AE = 0.55a -0.25b and ED = -0.45a + 9/44b. We observe that AE = 11 ED and so the 
desired ratio is 9/11. 


A613 (9740 N2015/II/2)(i) The angle is 


cos | (2,3,-6) - (1,0, 0) 7 cos | 2 
|(2, 3, -6)||(1, 0, 0)| - V49V/1 


(ii) The vector from P to a generic point on L is (2,5,-6) —r = (2,5,-6) — (1, -2, -4) - 
(2X, 3A, -6X) = (1 - 24,7 - 3A, -2 + 6A). The length of this vector is 


J 1281, 


(1-2)? + (7 - 3A)? + (-2 + 6A)? = V49)2 - 70A + 54. 


49)? = 704 54 = 33 <=> 49)? =70A 421 =0 <> 717 = 1014320 = (7143) O=1) =0 
A= 7c 1. 

Hence, the two points are (1, -2, —4)+3/7(2, 3, -6) = 1/7(13, -5, -46) and (1, -2, -4)+(2, 3, -6) = 
(3110): 

49? —70\ +54 is a U-shaped quadratic with minimum point given by 98\—70 = 0 or \ =5 /7. 
Hence, the closest point is (1,-2,—-4) + 5/7(2,3, -6) = 1/7(17, 1, -58). 

(iii) The plane is parallel to the vectors (2,3,-6) and (2,5,-6) - (1,-2,-4) = (1,7,-2). 
It thus has normal vector (2,3,-6) x (1,7,-2) = (36,-2,11). Moreover, we know that 
(1,-2,-4) is on the plane. Hence, a cartesian equation is 36x —2y + 11z = 36x 1-2 (-2) + 
11 x (-4) = -4. 

A614 (9740 N2014/1/3)(i) One possibility is that one or both are zero vectors. And if 
neither is a zero vector, then they point, either in the same direction or in the exact opposite 
directions—or equivalently, either 4 = b or a = —b. 





(ii) (L2 2,-2) = ca 
(iii) It is (12-2) ie 
3c 3 


A615 (9740 N2014/I/9)(i) The plane q is parallel to the vectors (1,2,-3) and (2, -1, 4). 
It thus has normal vector (1,2,-3) x (2,-1,4) = (5,-10,-5) and hence also normal vector 
(-1,2,1). It contains the point (1,-1,3). Altogether then, it has cartesian equation —x + 
2y+2=0. 

(ii) Line m has direction vector (-1, 2,1) x (1, 2,-3) = (-8,-2, -4) and hence also direction 
vector (4, 1,2). 

To find a point that is on both planes, try plugging in x = 0. Then from the equation of 
q, we have z = —-2y. Now plug this also into the equation of p to get 2y — 3(-2y) = 12 or 
y = 1.5. Hence, an intersection point is (0, 1.5, -3). 


Altogether then, the line m has vector equation r = (0, 1.5, -3) + A(4,1,2), for Ae R. 

(iii) AB = OB - OA = (4),1.5 + ,-3 + 2A) - (1,-1,3) = (4\-1,2.54+2,-6 + 2A). So 
AB) = (44-1)? + (2.54 A)? + (-6 + 2d)? = 21)? — 27 + 43.25. This lattermost expression 
is a U-shaped quadratic, with minimum point given by 42 - 27 = 0 or A = 9/14. So 
PaO 45s), 9)- (=,2 = =). 
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A616 (9740 N2013/I/1)(i) From the equation for p, we have z = 0.52 - 2. 





Plug + into the equation for q to get 2x -2y+0.5%-2=6 <=> y 21 .25r - 4. 


Now plug = and 2 into the equation for r to get 5x2 — A(1.25a” - 4) + u(0.5a - 2) =-9 <> 
Au — 50 

Ll 
Z 


Doll p= 3, from: =; 2 and 5 we have 








O.5ur+25-2n20 <> «4 


(ii) From é, if 4. = 0, then we have 250, a contradiction. So the three planes do not intersect. 
A617 (9740 N2013/1/6)(i) Every vector in the plane can be expressed as the linear 
combination of any two vectors with distinct directions (see Fact 125). 

4a+3c 


(ii) By the Ratio Theorem, ON = 7 





(iii) The area of triangle ONC is 


0.5|ON x OC| = 0.5 |= +" 
= 1/14|(4a + 3c) x ¢| 

= I/14\4a x c+ 3c x ¢| (distributivity of vector product) 
= I/14|4a x ¢| (vxv=0) 

= I/14|4a x (Aa + ub)| 

=I/14|4a x Xa+4ax pb] (distributivity of vector product) 





x C 








2 
= '/14|4a x pb] = ae 
7 
Similarly, the area of triangle OMC is 


0.5|OM x OC| = 0.5|0.5b x c 
= fab xe 
= 1/4|b x (Aa + pb)| 
=1/4|/b x Aa+bx pb| (distributivity of vector product) 
= 1/4|b x Aal (vxv=0) 
= 1/4\|b x al. 
Altogether then, 2u/7 = A/4 or A = 8/7. 
A618 (9740 N2013/II/4)(i) The angle 6 between two planes is given by the scalar 
product of their normal vectors: 
vos = (2221) -(-6,3,2) __ 16 __16_ 16 
(2-2, D632) V9v 49 3x7 21° 
So 6 = 0.705. 
(ii) The intersection line of two planes has direction vector given by the vector product of 
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their normal vectors: (2, -2,1) x (-6,3,2) = (-7, -10, -6). 

A point that is on both planes satisfies both equations 2” -2y+z = 1 and -6x+3y+2z =-1. 
Plugging in x = 0, the first equation yields z = 1+2y, which when plugged into the second 
equation yields y = -3/7. So a point that is on both planes is (0, -3/7, 1/7). 

(iii) The distance between a point a and a plane is given by |d-a-nJ, where n is its normal 
vector and d=r-n. 


For p;, d= 1/3 and for pp = d = -1/7. Hence, the distance between A(4,3,c) and the plane 
pi is 

















Fess ee eee 
3 “B38 1 a? 
and the distance between A(4,3,c) and the plane po is 
Cee) eee: — 
7 7 ar | a fa 








Equating these two distances, we have 








1 14-2 
<= 7 ese a7 8R) Ges 6240, 
OR ee 7+ 7c = 42 -6c => c= 35/13. 


A619 (9740 N2012/1/5)(i) The area of triangle OAC is 


0.5|OA x OC] = 0.5|a x (Aa + pb) 
=0.5|a x Xa+a x pb] (distributivity of vector product) 
= 0.5|a x pb] (vxv=0) 
= 0.54|(1,-1, 1) x (1, 2,0)| 
= 0.5.|(-2, 1,3)| = 0.5V/14pu. 


So 0.514: = 126 or p= 2 x \/126/14 = 2x V9 =2x3=6. 

(ii) c = \a+pb = \a+4b = A(1,-1,1)+4(1, 2,0) = (44\,8-A, A). We are given that |c| = 5V3. 
So (44+A)?+(8—A)? +2 = 32—BA +80 = (5V3) = 75 <> 3X2-8A45 = 0 = (3A—5)(A-1), 
so \ = 5/3 or 1. And c = (52/3, 61/3, 5/3) or (5,7, 1). 

A620 (9740 N2012/1/9) (i) r = (7,8,9) + A(8, 16,8). 


(ii) The position vector of N is given by p+ (pa +) v, where v is the direction vector of the 


— ee ea 
line, p is a point on the line, and a is the given point (1,8,3). Compute (8, 16,8) = ae ae 
and now, 

(—6, 0, -6) - (1,2, 1) (1,2, 1) 12 
7850) 4 SE Need EE 2780) 1 SZ), 
(7,8,9) EAL D = (7,8,9)- 2,21) = (4,7) 


By the Ratio Theorem, the ratio AN: NB =a: 1 satisfies 
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(7,8, 9) + a(-1, -8, 1) 


eae 1) = 
7) at+l1 





1 
= 7 (77 08 - 80,9 + a). 





Solving 5 = = we have 5a +5 = 7-a or a= 1/3. So the ratio AN: NB=a:1=1/3:1= 
Lea: 

(iii) To be written. 

A621 (9740 N2011/I/7)(i) m=0.5(p+q) = Ye6a+3/i0b. The area of triangle OM P is 


0.5|m x p| = 0.5|0.5(p+q) xp 
= 0.25|pxp+qxp| (distributivity of vector product) 
= 0.25 |q x p| (vxv=0) 
= 0.25 |3/5b x 1/3a| 
= 0.05 |b x aj = 0.05 |a x bj. 


(ii) (a) Since a is a unit vector, (2p)? + (6p)? + (3p)? = 1 or 49p? = 1 or p= 1/7. 
(b) |a-b] is the length of the projection vector of b on a. 
(c) ax b = 1/7(2, -6,3) x (1,1, -2) = /7(9, 7, 8). 
A622 (9740 N2011/I/11)(i) A normal vector to the plane is 
(4 - (-2) ,-1 - (-5),-3- 2) x (4-4,-1- (-3) ,-3- (-2)) 
= (6,4,-5) x (0,2,-1) = (6,6, 12). 


Another normal vector to the plane is a scalar multiple of the above, namely (1,1,2). We 
have (4,-1,-3)-(1,1,2) =-3. Hence, a cartesian equation of p is x + y+ 2z =-3. 


= 
=2+3 —> 1 22(2+3)+1=22+7 and 





(ii) From the equations for /;, we have 7 
-2 

A =24+8 — y 2—-4z - 10. 

Plug in + and = into the equations for lg to get 


2z+7+2 3 -42-10-la4z-3 








i 5 k; 
From 3 we have 10z + 45 = -4z-11 <=> z = -56/14 = -4. Now from 2 we have 
poe oe 
—A4z-11 5 


(iii) The direction vector of l; is perpendicular to the normal vector of the plane p, as 
we can verify—(2, —4,1)-(1,1,2) = 0. Moreover, a point on J; is on p, as we can verify— 
(1,2,-3)-(1,1,2) =-3. Altogether then, J; is on p. 


From the equations for Jj, we have y = 54+11 and z = -7x-11. Plug these into the equation 
for the plane p to get x + (54 +11) +2(-7r-11) =-3 <— > -8%-11=-3 <— = r=-1. So 
y=6 and z=-4. The intersection point is (-1,6,-4). 
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(iv) The angle @ between lz and the normal vector to p is given by 


6 = cos! ( (1,5,-7)- (1,2) 
- (1,55 =7)) (1,142) 





= = =cos? a my 
OS (=) (=) 1.957. 


So the acute angle between /y and p is 2.172 - 1/2 » 0.387. 
A623 (9740 N2010/I/1)(i) |b)’ = 1+2?+2? = 9 = al? = (2? +3? + 6?) p? = 49p”. So p= 3/7. 


6 9 18 6 9 18 
li Db): (a= b)=|o4 Lj 42) 2)+( 5-122, =] = 2 
(i) (a+b)-(a-b) = (241,542, > +2)-(2-1,5-2,>-3] 
13. 115) 128 


See 
49 49 * 49 

(Optional. Actually, more generally, since (a + b)-(a-—b) = |al’ — |bI’, if [al = |b|, then 
(a+b)-(a—b) =0.) 

A624 (9740 N2010/I/10)(i) The line has direction vector (—3,6,9), which is a scalar 
multiple of the plane’s normal vector (1,-2,-3). So the line is perpendicular to the plane. 
(ii) From the equations of the line, we have y = -2x+19 and z = -3x+27. Plug these in to 
the equation of the plane to get x — 2(-2x + 19) - 3(-34 +27) =0 <=> 147-119=0 <> 
x =119/14=8.5. And so y=2 and z=1.5. So the point of intersection is (8.5, 2, 1.5). 


=2=10 
(iii) We can easily verify that the given point satisfies the equations for the line: = 


23+1  33+3 
6 9 
The point of intersection we found in (ii) (call it X) is equidistant to both A and B. 
Moreover, these three points are collinear. Thus, B = (19, -19, -30). 


(iv) The area of triangle OAB is 





A= 








. The point is therefore on the line. 


0.5|a x b| = 0.5|(—2, 23, 33) x (19, -19, -30)| 
= 0.5|(—63, 567, -399)| x 348. 


A625 (9740 N2009/I/10)(i) The angle 0 between the two planes is given by 


v= fereyenanl (7a) _ 


(ii) The line / has direction vector (2,1,3) x (-1,2,1) = (-5,-5,5) and thus also direction 
vector (1,1,-1). 

A point (x,y, z) that lies on both planes satisfies 27 + y+3z +1 and -r+2y+z 22. Plugging 
in e= 0, 2 yields y = 1 -3z and now 2 yields z = 0. So (2,y,z) = (0,1,0). 

Altogether then, the line / has vector equation r = (0,1,0) + A(1,1,-1), for AER. 

(iii) The line / is parallel to the plane p3, as we now verify: (1,1,-1)-(2-k,1+2k,3+k) = 
2-k+1+2k-3-k=0. Moreover, the point (0,1,0), which is on the line J, is also on the 


plane p3, as we now verify: 2x0+1+3x0O-1+k(-0+2x0+0-2)=0. Altogether then, 
the line / lies in p3 for any constant k. 
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We want to find k such that (2,3,4) satisfies 2x + y+ 3z-1+k(-x + 2y+2z-2) = 0. 
That is, 2x2+3+3x4-1+k(-2+2x3+4-2) =18+6k. So k = -3. So the plane is 
2 +y+3z-1-3(-x +2y+z2-2)=0 or 5a-5y+5=0 or x-yt+1=0. 


= 2( 11.13, 2 14,14, 14 
A626 (9740 N2009/II/2)(i) Let p = OP. By the Ratio Theorem, p = ee ee 





a 
(12,-4,6). So the point P is (12, -4,6). 
(ii) AB - p = (-3,-27,-12) - (12, -4,6) = 0. 
ew ee = =, 
(iii) c = (12,-4,6) = (6,-2,3) = O88) = (6, 23) la-c| is the length of the 


JV@+(2)42 7 


(iv) ax p = (140, 84, -224). |a x p| is the area of the parallelogram formed with a and p as 
its sides, where the heads of a and p are the same point. The area of the triangle OAP is 
la x p| = 140? + 84? + (-224)? » 139. 


A627 (9740 N2008/1/3)(i) OP = OA + OB = (6,3, -3). 
(ii) The angle AOB is equal to the angle between the vectors OA and OB: 


cos | (1,4, -3) - (5, -1,0) 
|, 4, -3)| (5, =I, 0)| 


projection vector of a on p. 


x 1.532. 


os! (ss ] 
V 26V 26 
(iii) It is |OA x OB| x 25.981. 


A628 (9740 N2008/I/11). You can either find the intersection point using a graphing 
calculator or painfully by hand, as I do now: 


2 5 
From pj, 2 51-—7+ -y. Plug + into the equation for po to get 3a + 2y -5z = 3x 4+ 2y- 


2 5 9 19 
5(1- 30+ oy) =-—5 or qin ays Oorn sy. And so from Z we have z21+z. Now plug 





3 
in 2 and 2 into the equation for p3 to get 5¢+ Av +17(14+2) =p or (22+A)z = p-17 or 
u-17 04 4A ee ff 4° 4°77 
cS = -—— = -—. §o the point of intersection is {|-—-,-—, — }. 
22+A Ll 11 1” TR 1h 


(i) The line has direction vector (2,-5,3) x (3,2,-5) = (19,19, 19) and thus also direction 
vector (1,1,1). From our work above, x = y at the intersection of the two planes. Plug in 
x =0 to find that the two planes intersect at (0,0,1). Altogether then, the line has vector 
equation r = (0,0,1) + a(1,1,1), foraeR. 

(ii) Two points on the line are (0,0,1) and (-1,-1,0). Plug these into the equation for 
plane p3 to get 17 = wp and -5- A= p, so that p= -22. 

(iii) The line / must be parallel to the plane p3, so that (1,1,1)-(5,A,17) =0 or A = -22. 
Moreover, the point (0,0,1) on the line is not on the plane, so that ps #17. 


(iv) Another vector that is parallel to the plane to be found is (1,-1,3)-(0,0, 1) = (1,-1, 2). 
The plane thus has normal vector (1,1,1) x (1,-1,2) = (8,-1,-2). Compute also d = 
(0,0, 1)-(3,-1,-2) = -2. Altogether then, the plane has cartesian equation 3x -y-2z = -2. 
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A629 (9233 N2008/I/11)(i) From the equations of the first line, we have y = 27-2 and 
z=5-2. Plugging these into the equations of the second line, we have 
ge-1, 24-2+325-2-4 
lo 3B 





Both = and 2 imply that x = 4 and so indeed the two lines intersect. (If they didn’t intersect, 
then = would contradict 2.) So the point of intersection is (4,6, 1). 


(ii) The angle between the lines is given by 


ieee) oe Cos ee , -8 
cos! oat 2, = ai 3 if = cos? Vii x 2.967. 
This is obtuse. So the acute angle is 7 — 2.967 ~ 0.175. 
A630 (9740 N2007/1/6)(i) (1,-1,2)-(2,4,1) =0. 
(ii) By the Ratio Theorem, OM = 1/3[2(1,-1,2) + (2,4, 1)] = /3(4, 2,5). 
(iii) The area of triangle OAC is 


0.5|OA x OC| = 0.5 |(1,-1, 2) x (-4,2,2)| 
= 0.5|(-6, -10, -2)| 
= 0.5V 140 = 5.916. 
A631 (9740 N2007/I/8)(i) The line / has vector equation r = (1,2, 4) + A(-3, 1, -3), for 
Aé€R. Plugging this into the equation for the plane, we have 3(1-3,)-(2+A)+2(4-3A) = 17 


<= 9-16\=17 <= X=-0.5. So the point of intersection is (1, 2,4) — 0.5(-3,1,-3) = 
(2.5, 1.5, 5.5). 


(ii) The angle between / and the normal vector to p is 


_, (-3,1,-3) - (3,-1, 2) 4 16 

cos = cos» ——— 

I(=3; y=3)||(3,=1,2)| V 19/14 

So the angle between the line and the plane is 2.946 — 71/2 » 1.376. 

_ |17-(1,2,4)-(8,-1,2)|  |17-9| 8 — 4V14 
V14 V14 V14 7 

A632 (9233 N2007/I/7). The foot of the perpendicular a point A to a line is Q + 


(QA *) Vv, where Q is any point on the line and v is the line’s direction vector. Hence, 





x 2.946 





(iii) |d—a- Al x 2.138. 








P = (-3,8,3) + (4, -5, — 2.) ae 
T2250) _ 


it 
= (-3, 8,3) - ——_—— = (-5, 6,0). 
( Oyo) V7 ( , 6,0) 





|AP| = |(-6,3,2)| = V49 = 7. 


A633 (9233 N2007/II/2)(i) OD = 0.75(1,-3,4). So the line AD has direction vector 
(3.25, 3.25,0) and hence also direction vector (1,1,0). So the line AD has equation r = 
(4,1,3) + \(1,1,0), for \eR. 
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(ii) OC = 0.25(4, 1,3). So the line BC has direction vector (0, 3.25, -3.25) and hence also 
direction vector (0,1,-1). So the line BC has equation r = (1,-3, 4) + u(0,1,-1), for we R. 


Setting the equations of the two lines equal to each other, we have 4+ A\=1,1+A=-3+4+ yp, 
and 3=4-4, so that \)=-3 and w=1. And the point of intersection is (1, -2,3). 


A634 (9233 N2006/I/14). By the Ratio Theorem, OP = (1-\)OA+ OB = (1 - 
d)(1,-2,5) +A(1, 3,0) = (1,-2+5A,5-5A). And O@ = (1-4)OC'+ pOD = (1-)(10, 1,2) + 
p(—2,4,5) = (10 - 124,14 34,2 + 3p). 

(i) PQ has direction vector AB x CD = (0,5,-5) x (-12,3,3) = (30,60,60) and hence also 
direction vector (1, 2,2). 

Moreover, PO = OQ — OP = (9- 12,3 + 3u -5A,-3 + 34+ 5X), which must be a scalar 
multiple of (1,2,2). And so 3+ 3u—-5\ = 2(9- 12) and -3 + 3u+5A 2 2(9 - 12). Taking 
2 minus é. we have -6 + 10\ = 0 or \ = 0.6. Taking 2 plus é we have 6 = 4(9 - 12) or 
p= 2/3. Altogether then, PQ = (1, 2,2), as desired. 

(ii) First observe that AQ = OQ - OA = (10 - 12,1 + 3u,2 + 3) - (1,-2,5) = (2,3,4) - 
(1, -2,5) = (1,5,-1). 

Now compute that the area of triangle ABQ is 


0.5 |AB x AQ = 0.5|(0,5,-5) x (1,5,-1)| 
= 0.55 |(20, -5, -5)| » 10.607. 
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155.4. Ch. 136 Answers (Complex Numbers) 


A635 (9758 N2019/I/1). (Note that a + 0.)®“ 
Since all of the coefficients of this polynomial equation are real, by the Complex Conjugate 
Root Theorem, 2 -i is also a root of this equation. Hence, 

O= f (z) = az? + b2* +cez+d= 2° + (b/a) 27+ (cla)z+d/a 


= [z- (2+i)] [z- (2-i)] [z-(-3)] = [(2-2Y +1] (¢ +3) 
= (2° - 40 +5) (24+3) = 23-2 -72415. 











Comparing coefficients, b = -a, c= —7a, and d= 15a. % 
. it -_ il 
A636 (9758 N2019/I/9)(i)(a) w+ — =cos@+isind@ + — 
w cos 6 +isin6é 
_ 1 cos @ — isin 8 _ cos@ —isin@ 
= cos@+isin 6 + — —— = cos@ +isin@ + = 
cos@ + isin cos@ —isin# cos? § + sin* 0 


=cos@+isin@ + cos@—isin#@ =2cosd_R. 


wool cos@+isin@-1 7 cos@+isin@—-1cosé-isiné+1 





(i)(b) 


_ (cos@) + (isin 6 - 1) (cos) - (isin@-1) _ cos?@ - (—sin? 0 - 2isin +1) 


wtl cos@+isinO6+1 cos@+isind+1cosé-isind+1 














~ (cos@ +1) + (isin#) (cos8+1)-(isin?) —cos?6 + 2cosA + 1+ sin? 
Pisin 6 isin @ isin @ i2sin 2 cos 2 sin 2 @ 
24+2cos@? 1+cosé 2 cos? 5 2 cos" 5 cos § 2 





(ii) Let z =a+ib. We are given that Va? + b? = 1 or a? +b? =1. 


z-3i 
14+ 3iz 


_ [2-3] VP +@-3) VER OD —VI- HTD, 
b+ 3iz| | /(7 3p)? + (3a)? V9 — 6b +1490? V9-6b+1 




















S4Tf q@ = 0, then f (z) = 0 is a quadratic equation whose coefficients are real and hence cannot have roots 
2+iand -3. So, a #0. 


2056, Contents www.EconsPhDTutor.com 


A637 (9758 N2018/II/2)(i) Since all of the coefficients of this polynomial equation are 
real, by the Complex Conjugate Root Theorem, 2 + 3i is also a root of this equation. Now, 


[x - (2- 3i)] [x - (2+ 3i)] = (2 - 2)? - (31)? = 2? -40 +449 = 07 - 4 + 13. 
Write (a* — 4x + 13) (ax? + bx + c) = de* — 2023 + sx? -— 56x + t. 
Or, az* + (b- 4a) 2? + (c- 4b + 13a) 2? + (-4c+ 13b) 2 + 13¢ = 4a* — 2023 + sx? - 56x +t 


Comparing coefficients, a = 4; b- 4a = -20. So, b= —4, -4c + 135 = -56, and c=1. 
The other roots of the equation are x = (-» + Vb? - 4ac) /:(20)2 (4 +V16- 16) (oo 


Hence the other three roots of the given equation are 2 + 3i and 1/2 (repeated). 
Also, s=c-— 46+ 13a =69 and t = 13c= 18. 


(ii) (a) Let a,b €R and (a+ ib)° = a? + 3ia2b - 3ab? - ib? = a? - 3ab? +i (sab 0 207. 
So, a? - 3ab? = 27, 3a2b- 0 20. By 2, ? 23a?. 

Plug 2 into = to get a® — 9a? = 27 or a= (-27/8)'8 = -3/2 and hence b = +3V3/2. 
Thus, the other two possible values of w are —3/2 + 3V3i/2. 

(ii) (b) |-3/2 + 3V3i/2| = \/9/4 + 27/4 = 3. 

arg (-3/2 + 3V'3i/2) = cos ! [(+3/2) /3] = cos + (¥1/2) = 27/3. 

Hence, —3/2 + 3V'3i/2 = 3 [cos (27/3) + isin (271/3)]. 


: 3., OV. 3 - Bs. 
(ii)(c) Sum = 3 + (-5 + <) + (-5 = i) =i 





27/3 Re 








3 3V3.\/ 3 3V3. 9 27 
Product =3(—$ 5 ')(-3- 5 i)-3(3+ 3 21/3 
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A638 (9758 N2017/I/8)(a) By the quadratic formula: 


ga\ (27 -A(1 i) (5 +51) : 1+ /1-(5+5i-5i+5) _lty-9 _ 143i 


=a) i f= * a 





oS 





Lai. 1+3i _ laodil+i _ 1+i14+3i173 


eee a 
i. ta eo oe 


Multiply by 








(b)G@)- ee Se iy 2 a oe, oS 0 i) 0) = 9 ee 1) 0) = 
-2-21+2i-2=-4. at 


We are given 
w*+pw+39w?+qw+58 = —4+p (-2 - 21) +39 (-2i)+q (1 — i) +58 = q-2p+54+i (-q — 2p — 78) = 0. 


Hence, g - 2p + 54 +0 and -—q-2p-78 20. Taking < plus Z we have —4p - 24 = 0 or p=-6 
and q = -66. 


(c)(i) Since the coefficients of the given quartic equation are all real, by the Complex 

Conjugate Root Theorem (Theorem 21), w* = 1+i also solves the given equation. Thus, a 

quadratic factor of the given quadratic polynomial is (w-1+i)(w-1-i) =w?+1-2w+1= 
2 

Ww" — 2 + 2. 


NoW-v6ritte+ 39? — 66w + 58 = (w —2Qw + 2) (aw? + bw + c) = aw* + (b— 2a) w3+?w?+?w + Qe, 


where ?’s are coefficients we didn’t bother to compute. Comparing coefficients, we have 
a=1,b=—4, and c= 29; 


Thus, uw — 6w? + 39w? — 66w + 58 = (w? — Qw +2) (w? - dw + 29). t 
A639 (9740 N2016/I/7)(a) Simply plug in -1 + 5i and verify that the equation holds: 
(-1+ 5i)? + (-1 - 81) (-1 + 5i) + (-17 + 7i) = 1-25-10i+ 1-5i+81+40-17+7i=0. Vv 


Note that the Complex Conjugate Roots Theorem (Theorem ???) does not apply here 
because the coefficients of the given quadratic equation are not all real. 


Let a+ib be the other root. Then, 
w+ (-1- 8i) w+ (-17+7i) = (w+1-5i) (w-a-ib) = w? + (1-a—5i- ib) w+?, 


where we didn’t bother computing ? because it isn’t necessary. 


Comparing coefficients, we have 1 - a = -1 or a = 2 and —5i-ib = -8i or b= 3. Hence, the 
other root is 2 + 3i. % 


(b) Plug in 1+ ai into the given equation: 
(1+ai)* -5(1+ai)°+16(1+ai)+k= 1+ 3ai-3a? - a3i-5(1+2ai-a?) +16+16ai+k 
= 1-3a?-5+5a?+16+k+i(3a-a*- 10a + 16a) 
= 12+2a7+k+i(9a- a3) =0. 
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Comparing coefficients, we have 12 + 2a7+k +0 and 9a-a3 20. From 2 we have a = +3, 0. 
We are given that a >0O and so a 33. Plugging 3 into 2 we have k = —30. oS 


A640 (9740 N2016/II/4)(a)(i) |z- 3 -i] = 1 traces out a unit circle centred on 3+i = 
(3, 1). 


arg z 2 tan! 0.4 traces out a ray from the origin, with gradient 0.4. 





A 
y 
|jz-3-ij=1 a 2 
\ oe 
ae SA). 3 
Large 2 = tan 0.4 PA _ 
: x 





(a)(ii) We are asked to find the two points that are on the circle and equidistant from 2 
and z2. These are the two blue points depicted in the figure above. 


The line connecting these two blue points is perpendicular to the line with gradient 0.4— 
it thus has gradient -1/0.4 = -2.5 and direction vector (2,-5), whose unit vector is 


Tx (2,-5). Moreover, it passes through the point (3,1). Thus, it may be described 


by the vector equation: 


= (8,1) + Z=(2,-5) AeR. 


The circle’s radius is 1. And so, the two blue points are given by 


1 Z i) 
G+ @.-8)=(3+ 1-4] 


1 2 5 
8) -@-#-(3- Ze Z), 


(b) (i) |w| = |2 - 2i] = \/ 2? + (-2)” = 2V/2 and arg w = arg (2 - 2i) = -cos"! 


2 @ ot 
—— =- cos —== 
7 9/2 J2 
== % 
A 


; 3, | | 

Thus, w = 2V2e/4 and the cube roots of w are (2v2) ei = \/Qeim/2 \/QQ(-n/12+2n/8)i — 
SdeT2 and Vde(-M/12-27/3)i = \/Je-Bin/4., 

1- 

(b) (ii) arg (w*w") = arg w* +narg w+2kTt = oA 2k axe ( 5 n 


= = 
Se dhed and n>0. So we must have k = 1 and thus = 





+ 4k), where k = -1,0,1. 


+=) OF d= (3 








1 
So, 
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A641 (9740 N2015/I/9)(a) 


w?  (a+ib)? a*-b?+2abi_ a? -b? +2abi_ atid 


= SS ———— 
w* a—ib a-—ib a-—ib a+ib 








a® — ab? + 2a2bi + a2ib — ib’ — 2ab? 
a? + b? 


1 


= ADA [ (2° _ 3ab”) +74 (3a7b = b)| 


is purely imaginary if and only if a?-3ab? = 0. But a?-3ab? = a(a?-3b”) = a (a — v3b) (a + v3b),. 
So either b = ta/V3 or a = 0 (but the latter is explicitly ruled out in the question). 


Altogether, the possible values of w = a+ib are given by b = +a/ V3 and a is any non-zero 
real number. 


(b) (i) 25 = 2Peitl-05) = QF ein(-0.5+2k) for ke Z. So z = Qei™-05+24)/5 for & = 041,42. So 
|z| = 2 and arg z = -0.97, -0.571, -0.171, 0.371, 0.771. 


(ii) 21-2 = Je'™(0.7) _ Jei™-0.9) 
~ Jein(-0.1) (ein(08) 5 grins) 
= 2e'™(-0.1) Dj sin 0.87 


= (4sin 0.871) ie’), 


So arg (z, - 22) =arg [(4 sin 0.87t) ie'™O-)] 
= arg (4sin 0.87) + argi + arg (e'™-"))) + Qk 
=0+7/2-0.1m+2kn=0.4n (k =0), 


and |z1 — z9| = \(4 sin 0.870) iei"-0.)| 


= |4sin 0.8m |i||e’"\-°)| = 4sin 0.87 = 4sin 0.27, 
I i 
where the last line uses the fact that sin(7—- x) = sina. 
A642 (9740 N2014/I/5) (i) 2? = (1+ 22)? = 1? + (27)? +2(1) (21) = 1-44 41 =-3 +43. 
z = (-3 + 44)(1 + 21) = -3 - 61 + 47 + (44) (27) = -3 - 21-8 = -11 - 27. So, 


Do rt 2-1 26-14 24-11 + 24 
zs -11-2¢ -11-2i -114+2i 112-(23)2 121+4 125 





—-11+22 11 2 
(ii) Since pz? + 5 = p(-3 + 47) + — = (- - ost) +1 (4p + os) is real, we have 
Ap + 2 = 0 or g=—250p. And pz? + 5 = 19p. 
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A643 (9740 N2014/II/4)(a)(i) This is simply a circle with radius 4 centred on the point 
-5 +7. It has cartesian equation (x +5)? + (y—1)? = 4?. 





{z€ C: |z+ 5- i| - 4} 














(ii) The complex equation |z- 67] = |z+10+47| is equivalent to the cartesian equation 
(x —0)2 + (y -6)? = (a + 10)? + (y +4)? or -12y + 36 = 20x + 100 + 8y + 16 or y 2-2-4. 

So to find the intersection points of the line and the circle, plug = 2 into = to eet (x +5)? + 
(-2 —4-1)? = 4 or 2(2 +5)? = 4 or (2 +5)? = 8 or +5 =4V8 or x =-5+4V38. So the 
possible values of z are -5 + V8+ (54 V8-4)i=-5 + V8+ (14 V8): ; 


(b) (i) w = V3 -4, so |w] = (v3) +(-1) =2 and argw = tan”? = 7 7 So w = 2e-7/6), 
And so w® = 28ei(-72k") = 96 gir 

(ii) arg ee argw”" -argw* +2km = nargwtargwt 2k = (n+ 1)argw+2km = (n+1)x 
(—71/6) oe A complex number z is real if and only if arg z = 0 or ee = 7. So by 
observation, the three smallest positive whole number values of n for which “— is real are 
5, 11, and 17. a 

A644 (9740 N2013/1/4)(i) (1+22)? = 1+3x2i+3 (21)? + (21)? = 14+6i-12-8% = -11-2i. 
(ii) Since w = 1+ 2i is a root for az? + 5z?+17z+b=0, we have 
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0 =a(1+2t)°+5(1+42i)? +17(1+2i) +b 
= a(-11 - 27) + 5(-3 + 42) + 17+ 341 +b 
= (-lla-154+17+6)+i(-2a+ 20 +34) 
= (2-1la+6)+i(54- 2a). 

ee ay ae 


0 0 








54-2a=0 = > a=27. And 2-1la+b=0 = > 0=11(27) -2= 295. 


(iii) By the complex conjugate roots theorem, 1 - 27 is also a root for the equation. Write 


212 452" 41724 295 =97 [2 = (1427) [2-0 = 2) Hk) 
= 27[(z-1)? - (2%)?] (z-k) 
= 27 (2? -2z2+5)(z-k) 
= 27[23 -(k +2)2? - (2k +5)z-5k]. 


So 295 = 27 x (-5k) or k = -59/27. The roots are 1 + 27 and —59/27. 


2 
A645 (9740 N2013/1/8)(i) |w| = |(1-iv3) 2] = [1 -iv3||z| = 12+ (V3) lal = 2lel. 
arg w = arg (1 - iv) z| = arg (1 — iV3)+arg z+2kn = tan” (- 3)+0+2kr = —7/3+0+2k7 € 
[-7/3 + 2k7, 1/6 + 2k7t]. So we should choose k = 0 and arg w = -71/3+ 6. 
(ii) z is the top-right quarter of the circumference of the circle of radius r, centred on the 
origin. 
Take the position vector of z, rotate it clockwise by 7/3 radians about the origin, double 
its length—this is the position vector of w. 
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Ire 1(- W/3) 









10 
(iii) arg (= = arg 2! - arg w? + 2k7 = 10argz - 2argw + 2k7 = 100 - 2(-2 +6] + 2k = 
w 


8042" +2kn =", so @=— (with k =0). 

3 24 
A646 (9740 N2012/1/6)(i) 2? = (1+ ic)? = 1° + 3(ic) + 3(ic)? + (ic)? = 1+ 3%e- 3c? — ic? = 
(1 - 3c’) + i(3c - c?). 
(ii) z° is real if and only if 3c—c’ = 0 or c= 0,+V3. The question already ruled out c = 0. 
So c=+V3 and z=1+iV3. 


(iii) z = 1-iV3 = |z|e'*8* = 2e*-7/3)_ |z"| = 2" > 1000 if and only if n> 9. (The reason is 
that 2° = 512 and 2'° = 1024.) So the smallest: positive integer n is 10. 


jz*°| = 21° and arg z'° = 10(-m/3) + 2km = 2n/3 (k = 2). 


A647 (9740 N2012/II/2)(i) |z - (7-32)| = 4 describes a circle with centre 7 — 37 and 
radius 4. 
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{zE C:|z- (7-37) - 4) 





7” Radius 4 


















(ii)(a) a is the point on the circle’s circumference that is closest to the origin a. The line 
! through the origin and the centre of the circle passes through a (see Fact ??). 


But the distance of the centre of the circle from the origin is V7? + 3? = V58. The distance 
of the centre of the circle to the point a is 4 (this is simply the length of the radius). Hence, 
the distance of the origin to the point a is 58 - 4. 


(b) Aabc is right. So ab? + bc? = ca? = 4? = 16. 


3 
But the line / has gradient “7 (because it runs through the origin and the point 7-37) and 


3 3\" 49 7 28 
so ab = =bc. Hence, (=) x bc? + bc? = 16. Or be? = 16 x —. Or bc = 4x —— = ——=. And 
i f 98 Vd8  ~V58 
12 28 12 
ab = ——. Hence, a = | 7 - —~—, -3 + —— ]. 


(iii) By observation, d is the point where |arg z| is as large as possible. arg z = arg(7- 37) + 
Zcod. 


4 —3 
But Acod is right. So 2cod = sin" —~. Moreover, arg(7 — 37) = tan”! 7 


3 


- 4 
Altogether then, arg z = tan™! — + sin>' —— = -0.9579. 
& & 7 eS 
A648 (9740 N2011/1/10)(i) Let (x + iy)? = x? - y? + i(2ry) = -8i. So 2? - y? = 0 and 
2G 28. From Z| we observe that 2 and y must have opposite signs. From es x =+y and 


by our observation of the previous sentence, we must have x = —-y. And now from 2 we 
have 2(—y) x y = -8 or -2y? = -8 or y = +2. Altogether then, z; = -2+2é and z =2- 24%. 


(ii) Using the quadratic formula and part (i), 
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—4 42 4(1)(4+ 27 
ee (1) (4+ My as 4- (44+ 21) =-24V-2i =-24+(1+7) =-3-i,-1+7. 





2 


(iii)(a) This is simply the line that is equidistant to z1 = (-2,2) and z = (2,-2). By 
observation, it has cartesian equation y = x. 





W-W|-|W- W% 





y 


IN 








Z-Z 








-|4-% 








(b) This simply the line that is equidistant to w; = (-3,-1) and w2 = (-1,1). By observa- 


tion, it has cartesian equation y = x + 2. 


(iv) The two lines are parallel and do not intersect. 


A649 (9740 N2011/II/1)(i) This is simply the circle with radius 3 and centre 2 + 5, 


including all the points within the circle. 
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Ize: 





a -(2+5f) <3} 

















(ii) The points on the circle’s circumference that are closest to and furthest from the origin 
o area and b. The line / through the origin and the centre of the circle passes through both 
a and b (see Fact ??). 


oc = V 224+ 52 = 29 and ac = 3. Hence, oc = V29 — 3. Symmetrically, ob = V29+3. The 
maximum and minimum possible values of |z| are thus V 29 + 3. 


(iii) The locus of points that satisfy both |z-2-57| < 3 and 0 < argz < 7/4 is the blue 
closed segment. 


By observation, |z —6- | is maximised either at P, or Py. These points are given by 


(p— 2)? + (p—5)? =3 => 2p? - 14p + 20=0 
<> p*?-7p+10=0 = > (p—-5)(p-2) = 0. 


So P; = (2,2) and P, = (5,5). The distances of these points to the point (6,1) are V4? + 1? = 
V17 and V1? +4? = V17. So both are, equally, the furthest point from (6,1). 


2 
A650 (9740 N2010/I/8)(i) For 1, r= 2 +(V3) =2 and @ = tan" (V3/1) = 1/3. For 


Es a 
z,r=\/(-1)+(-1 ? = VB and 6 = tan! = 
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-3 

Altogether then, z) = 2 [cos . +7sin =| and 2 = /2 cos = +7sin "| 
Y lzi| 2 
ii = = 
EI lal V3 

“ 11 it 
Hence, (=) = V3| cos + isi a 

22 


Z1 -37 _ 137 7 —llz 


4 12 i 

















= /2. arg (=) = arg 21 — arg z= 7/3 - 
a) 22 


aes 
(iii)(a) This is simply the circle with centre z, and radius 2. 





AX 
y 





Z- Z| -2 


arg (4Z- 4) - 1/4 

















(b) This is simply the ray from the point z2 (but excluding the point z2) that makes an 
angle 7/4 with the horizontal. 


(iv) We want to find x > 0 such that \(, 0) - (1, v3) = 2 or (x -1)?+3=4 or (x-1)?=1 
or x = 0,2. So (2,0) is where the locus |z - z,| = 2 meets the positive real axis. 
A651 (9740 N2010/II/1) (i) 


.- 6+ /(-6)2 —4(1) (34) _ 3, ¥-100 Hae 
D D ' 


2 


(ii) Since -2 +2 is a root of 2+ +423 +2? +axr +b =0, we have 
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(-24-7)*4+4(-2 47)? + (4247) +a( 247) +050 
: (tedious algebra) 
-12-2a+b+(16+a)i=0. 
a 


ep — 
-0 -0 





16+a=0 = >a=-16. Moreover, -12-2a+b=0 = > -12-2(-16)+b=0 = > b= -20. 


By the complex conjugate roots theorem, —2-—7 is also a root. So 


vi + 4x +x? - 16x - 20 = [x - (-2+7)][z-(-2-1)] (2? + cr +d) 
=(r+2-i)(a+2+i)(a*?+cr+d) 
=[(+ ay" ~i?| (a? + cx +d) 
= (2? +40 +5) (a? +cx +d) 
= 24+ (44+c)x° 4+ (5e+4d)x + 5d. 
Comparing coefficients, we have c= 0 and d= —4. So 2? +cx+d= 27-4 = (x-2)(x%+2). So 
the other two roots are +2. 


A652 (9740 N2009/1/9)(i) 27 = 147 = 24/7ei7/4 = 21/2 eim(1/42k) By de Moivre’s Theorem, 
ge R ert rerl) for k= 0,241,494; 
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on ° @. - 
‘ ; \ 
- ‘ 
: : Z—2Z,| = |z— Xp 
¢ \ 
\ 
k = 3 | 
7 Dis 
2n/7 
| a | ane k-0 
(> aes 
\ : : 
] 
\ 
| 
\ 
/ 
1/28 ; 
k= 33 | 
Ss 
\ 
. oa a k — — | 
> - 
~ a . . 
— @ a 
k =-—2 


(ii) 
iii) |z — z1| = |z — zo] is the line (blue) that is equidistant to the points z, = 2 Me™/?5) and 
(iii) |z — z1| = |z - 29] ( q p 1 
ay = Qi/l4pin(1/28+2/7) 
Explanation #1: 0 satisfies the equation |z - z;| = |z - z2| as we can easily verify—0 - z1| = 
0 - z9| = 2.4. So 0 is in the locus |z — z| = |z - 29]. 


Explanation #2: The perpendicular bisector of a chord runs through the centre of the 
circle. So in this case, the perpendicular bisector of the chord z,z2 runs through the origin 
(which is the centre of the circle). 


A653 (9740 N2008/1/8) (i) 


(1+ V3i) (1+ V3e) = (-2+ 2V3:) (1+ V3) = -2- 6+ (23 -2V3) i= 8, 


(ii) O=222 +a27+bz+4 
= 2(-8) + a(-2+ 2V3i) +b(1+ V3i) +4 
=-12-2a+b+iV3(2a+b) 
p< 


| —S 
EN 29 
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Adding + and 2 together, we have -12 + 2b=0 or b=6. And now from 2 a=-3d. 
(iii) By the complex conjugate roots theorem, another root is 1 - V3i. So 
E - (1 + v3i) | E — (1 - v3i) | (z-c) 
(z-1- V3i) (2-1+ v3i) (z-c) 
2 
=2|(2-1)?-(v3i) |(z-0) 
(22-2244) (z-c) 
[2° + (-c-2)z2? + (4+ 2c)z - 4c]. 


223 — 327 4+62+4=2 
=2 

2 

=2 


Comparing coefficients, we have c = —0.5, which is also the third root for the equation. 


AV 


@ 14+Vv3i 


—0.5 





@ 1-v3i 











A654 (9740 N2008/II/3) (a) |p| = Pe 
6 - (-0) = 20. 


argp’ = 100+ 2k. The argument of a positive real number is 2m7 for some integer m. 
Hence, 6 = n7t/5 for integers n. Given also the restriction that 0 € (0, 71/2), we have 0 = 71/5 
or 27/5. 


(b) |z| <6 is a circle of radius 6 centred on the origin, including the interior of the circle. 





Ww * 
= = 1 and argp = arg — = argw-argw" = 
w 


lw*| 


|z| = |z -8-6i| is a line that is equidistant to the origin and the point (8,6). 


So the locus of z is the line segment AB. 
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) 


|Z| -|2- 8- 6: 





(i) 
(ii) Observe that arg z is maximised and minimised at A and B. arg A = 4COX + 2 AOC, 
6 3 
arg B= zCOX - 2 BOC. Moreover, COX = arg(8 + 61) = tan”) — = tan! -. 


8 4 
Note that AAOC is right and the length of OC is half of |8 + 62| = V8?+6? = 10. So 
OC 5 
2 Z hp ee ele 
OC = 10. Thus, 2 AOC = 2 BOC = cos OA cos OB cos me 


Altogether then, arg A = zCOX + 2 AOC = tan ; +cost ° # 1.229 and arg B= 2COX - 
3 5 

2 BOC = tan"! a cos! a 0.058. 

A655 (9233 N2008/1/9)(i) This is the circle centred on -2i with radius 2. 
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|z +27] -2 








(ii) This is the line that is equidistant to the points 2+ 7 and i. 





A655 (9233 N2008/I/9)(iii) This is the region bounded by and including the rays 


arg(z +1 - 37) =7/6 and arg(z + 1-37) = 7/3. 
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w\6 < arg (4+ 1-31) < w/3 
xX 





A656 (9233 N2008/II/3)(i) (1-1)? = 1-1 -2i=-2i. V 


The other root is -1 +7, because (-1 +72)? = 1-1-2i = -2i. 











(ii) 34 5i+./(3 + 52)?-4(1)(-4) (1-21) 3 +504 /9- 25 + 307 + 16(1 - 22) 

ll ZL _———— 
2 2 

_—34+5t4V-21 345474 (1-2) 

a [sn ee 


= 24+ 21,1434. 


A657 (9740 N2007/I/3)(a) This is the circle with radius V13 centred on the point 
—2+ 31. 
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Z+2-3i| -V13 




















(b) (a+%b)(a—ib) +2(a+ib) = 3+4i or a? +b? + 2a4+ 2bi = 3+ 47. Two complex numbers are 
equal if and only if their real and imaginary parts are equal. So a? + b? + 2a +3 and 2b24. 
From 2 b=2. Plug this into + to find that a2 +2a+1=0 or a=-l. So w=-1+2i. 


A658 (9740 N2007/I/7)(i) By the complex conjugate roots theorem, another root is 
re, And so a quadratic factor of P (z) is 


(z + re’) (z - re?) = 274 rze? - rze? = (re’”) (re““”) 


=2-4+rz (ci? —e?) —p= 27? 4+rzcos6—-r?. 


(ii) 2° = -64 = 64e'" = 2%e'"0#2*) for ke Z. So z = Qel™+24)/6 for k = 0,41, +2, -3. 
(iii) We first use (ii), then use (i): 
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2° +64 


z?-2(2) cos(m/6)+2? z°—2(2) cos(57/6) +2? 
-eeor ss —— ——’'’™’'nan——’——T——re——~ 
om (z 7 2e'"/6) (z = 2e7™"\(2 = oe") (z = 26 Ne = 2err® ) (z a 267") 
Nee 


22-2(2) cos(37/6)+2? 


= (2° -2V3 +4) (22 +4) (2? +2V8 +4). 


A659 (9233 N2007/1/9)(i) By the complex conjugate roots theorem, another root is 
-ki. Altogether then, 


az’ +bz3 +27 4+dz+e=a(z-ki) (z+ ki) (2? + fz+g) 
=a(2? +k?) (27+ fzt+g) 
=alzi+ fois (+g) +k fz gk]. 


By comparing coefficients, we have b + af, c= a(k? +9), d 2 ak? f, and e = agk®. Now verify 
that indeed: 
ad? + b’e = a®k* f? + a f2 gk? 
= (af) x [a(k? + g)] x (ak? f) 
=bxcxd. o 





(ii)a = 1,0 = 3, c= 13,d = 27, e = 36. So indeed ad? +b’e = 1x277+3?x 36 = 1053 = 3x 13x27 = 
bcd. J 


b d [27 
From + above, f = — =3. So from 2 k =+4,/— = +4)\/—— =+V/9 = +3. So the two desired 
a af 1x3 


roots are +32. 


A660 (9233 N2007/II/5). The locus of P is the ray from (but excluding) the point 2i 
that makes an angle 7/3 with the horizontal. This is one half of the line with cartesian 


7 
equation y = x tan 3 +2=V/3x +2. 


The locus of Q is the line that is equidistant to the points 4 and -2. It has cartesian 
equation x = 1. 
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Cd, V3 + 2) 


w/ 3 


Q:|7+ 2 =|Z-A4 


(4, 0) 






xX 


The intersection of the two lines is (1, V3 4+ 2) or 1+i (V3 + 2). 


2 
[i+é(V3+2)][1-i(V3+2)]=14+(v3+2) =14+34+444V38=844v38. 0 Vv 
A661 (9233 N2006/I/5)(i) This is the circle with radius 3 centred on —4 + 47. 
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/N 
v 
lz +4- 4i] -3 
‘o 
C-O1D] , 
SS 














(ii) Given a point (C here), the line connecting it to the centre of a circle (A here) also 
passes through the point on the circumference (B here) that is closest to the given point 
(see Fact ??). 


The distance between A and C is \/(-4-0)?+ (4-1)? = V42 +32 = 5. So the distance 
between B and C' is 5-3 =2. 
A662 (9233 N2006/1/6) (i) 


0 = (ki)* — 2(ki)3 + 6(ki)? — 8(ki) + 8 
= k* + 2k°i - 6k? — 8ki+8 
= k* — 6k? +8 + 2k(k? -4)i. 
n _—+—_/_ —__——__-’ 


eS 
=0 0 


From 2k(k? — 4) = 0, we have k = 0,+2. Only k = +2 also satisfies k* - 6k? +8 = 0. So the 
equation has roots +22. 


(ii) 24-223 + 62? -824+8 = (z-2i) (z+ 21) (227 +az+b) 
= (2° +4) (2? +az+b) 


= z*+ 029+ (44 b)27 + 4az+ 4b. 
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Comparing coefficients, a = -2 and b = 2. So 27+ az +b =z? -2z +2, whose zeros are 








, = 2#VC2)-4M@) fac (29 a7 
; 


Altogether then, the equation has roots #22,1 +7. 
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155.5. Ch. 137 Answers (Calculus) 


A663 (9758 N2019/I/2)(i) Solve a? +a-1=0 using your calculator: a» 0.682. 
(ii) As usual, a quick sketch is helpful: 


y & 








0 0 A ge 0 i. “4 7 
P= f yde= f a®+a-1de-[F4+-2| -0-(5+5+1}--3. 
b 4 7D b A 70 94 92 
q- f yar =| 4 Sc] “(T+ 5-0)-( Pius - 0.682] 
a 4 2 a \4 2 A 2 
bt bP 


as —+——b)+0.395. 
4 2 








bt 6 t 
We are given that |Q| = 2|P| or ra ae b+ 0.395 = 


Solving this last equation on your calculator, you'll find four roots, of which three are 
negative and only one (1.892...) is positive. Since b > 0, conclude 6 x 1.892. SS 
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A664 (9758 N2019/I/10)(i) As usual, mindlessly use your calculator and copy: 


A 


Y 


(9) |, = (a, 2a) 











By observation, a line of symmetry is y = 0. Sa 


(ii) If y = 0, then 0 = a (2sin@ — sin 20) = 2sin 0-sin 20 = 2sin 0-2 sin 0 cos @ = sin 6 (1 — cos@) 
<=> sind=0orcosd=1 <> 6=0,7, 27. 





(iii) As mentioned in Remark 191, to properly compute this area requires knowledge of 
multivariate (or vector) calculus and, in particular, a result called Green’s Theorem. 


But in the absence of such knowledge, I can only imagine that the exam-taker was supposed 
to have blindly applied the usual Substitution Rule without having any idea of what she 
was doing. In this case, it turns out that fortuitously, this produces the “correct answer”, 
which is apparently all that matters in the Singapore education system: 


“Answer”. We observe that 
(z,y)| = (a, 0), (09) | =(a,20), and (x,y)| = (-3a,0). 


So, it “seems” that the top half of the curve C' corresponds to 6 = [0,7]. 
Proceeding blindly, we guess that the requested area is 
ydz. 
-3a 
But why this should be so is hardly clear. Surely [ ydx should correspond only to the 
-3a 


red shaded area in the above figure? 


Aiyah who cares? We have no idea what we are doing anyway. So let’s blindly apply the 
Substitution Rule as usual, cross our fingers, and hope we get the “correct answer”: 


(Answer continues below ...) 
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(... Answer continued from above.) 


a =a da dé =a dzxdé i 
dx = [ dee fi. 9 =p 
-3a ee r=-3a 7 dé dx * r=-3a V96 wart fy V6 


0 
=f a(2sindsin20) a0 = fo a(2sin@ — sin 20) a(-2sin@ + 2sin 26) dé 


™ 


a’ (-4 sin? 6 + 4sin @ sin 20 + 2sin @sin 20 — 2 sin? 20) dé 


a’ (4 sin? 0 — 6 sin @ sin 26 + 2sin? 20) dé. 


0 
ai a? (—4sin? @ + 6sin sin 20 — 2 sin? 20) dé 
I 


At é we substitute “x = a” with “@ = 7”. But since we also have x ee why couldn’t 
= 


we have substituted “x = a” with “6 = 7/2”? We don’t really know why and we have in our 
minds only some vague and ill-formed argument along these lines: “Well, I know that @ = 7 
is where the curve ‘ends’, so that’s what I should probably use.” 


At 2 just to get to the “correct” sign (as in the expression given in the question), we arbi- 
trarily reverse the limits without knowing why on earth we’re doing this. So, as requested, 
we submit the “correct answers” 0; = 0 and 6) = 7 


This has all been a bit bizarre and we don’t really know what on earth we just did. But 
who cares? Hooray! We’ve arrived at some expression they’ve told us to get and that’s all 
that matters! 


It turns out that fortuitously, the given definite integral does correspond correctly to the 
requested area. But the above “answer” arrived at this “correct answer” only by sheer luck. 
(For a proper solution to (iii), see e.g. Brilliant.org.) 


1-—cos4é 


(iv) First, observe that 2sin? 6 = 1-cos 26, sin sin 20 = 2 sin” 6 cos@, and sin? 20 = : 


Moreover, S sin? 6 = 3sin?@cos@. Now, 
TT 
[ a’ (4 sin? 0 — 6sin 6 sin 26 + 2 sin? 20) dé 
0 
TE 
= 2a? 7 2sin? 6 — 3sin@ sin 26 + sin? 20 dé 
0 

Le = 4 
= 2a? 1 — cos 20 — 6 sin? 6 cos 6 + si ao 
0 


2 Lo x9 A611. 7 2 
= 2a*|@=— —sin 26 —2sin?@+ ———sin4@| = 37a". 
2 2 8 0 


The area bounded by C is double what we just found, or 67ta?. 
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A665 (9758 N2019/I/11)(i)(a) Write d6/dt = -k (@ - 16). Now, 


dt 
t= 49 0? = -— a5 - -ZIn|6- 16+ =-7m(6- 16) +C. 


(At the last step, we can remove the absolute value operator because 6 > 16.) 


Plug in the initial conditions (t,@) = (0,80) , (30,32) to get 0 = —(In64) /k + C and 30 2 
~(In 16) /k+C. From 2 - 2, (In64-1n16) /k = 30 or k =n 4/30 = In2/15. Now from 2, we 
also have C' = 151n64/1n2 = 15 x 6 = 90. Hence, t = - (15/1n2) In (@ - 16) + 90 or 





— 90- gp Oz tn? epic 
— 5 +16=e oS ar +16=2 +16= ais * 16. 
(i) (b) 0 = oa +16=8+ 16 =24. 
(ii) Write dT'/dt = k/T. Now, t = a ar = [= ~ aT = a +C. 


Plug in the initial conditions (¢,T) = a 0) , (60,1) to get 
0=C and 60 = 1/ (2k) + C =1/ (2k) or k = 1/120. 


Hence, t = 60T7. It’s first safe to skate when T = 3 or at t = 60-3? = 540. 


A666 (9758 N2019/II/1)(i) Choose u = a: 


y! Vv Vv 
/ 
u i —<———- > 


[P0-ntar=20-n'e- [T]2 0-2) Ja =-F01- ot - 52 0- 2)i+C 


--3(1-2)!|e+2(1-«)]+C=-2 (1-2)! 0 +30) +0. ® 


d 1 du 9 
ii) Apply — to u=1-2? to get 2u— = -1. 
(ii) pply — tou x” to get 2u— 


Note that from é. we actually have u=+V1-~2. So, let us use u $J/1-2 8% Now, 


feG-2)! dv f (1-w)udw? f (1-0)u (- dus) 


2 2 ss 5 2 3 = 
f 2ut- du? du = Sub - 209 + C= (1-2)! - (1-2)? +6 


2 2 
= = (1-2)! [3(1-2)- pl eC oa x)? (2432) +0. 


(iii) Our answers in (ii) and (iii) differ by C —- C, which is a constant. 





675We'd also arrive at the same answer if we instead used u =—-V 1-2. 


2082, Contents www.EconsPhDTutor.com 


A667 (9758 N2019/II/3). Let S and V be the surface area and the volume of the 
cylinder, respectively. Then S' = 2mr (r +h) = 900 or h * 450/ (rm) -r. 


And V = mrh = mr? [450/ (rm) - r] = 450r - nr. 


dV 
Compute ae 450 - 3nr?. The stationary points of V are given by 
r 


dV 


1 
Ae =() —> 450 — 37r2 = 0 — rs at 
r 


T=? TU 





Since a > 0 for all r € [0,7) and = < 0 for all r € (7,00), 7 is a strict global maximum 
ip 


is 
and the corresponding volume is 


1 1 1 150 ff 1 
(F) = 450) /— 2 - | — 50 = 450y/— 2 -n— |= ie = 300) /— s = 1500/5 
50/ 150 


ss 7 \/150 150 150 1 


hr) i. - «/150)r ~ Bo wo ~ 450150. 2 











A668 (9758 N2019/II/4)(i) f’ (x) =2sec 22 tan 22. 
7 (hy e2 fa (x) tan 2x + 2sec 2x sec? 22 | = 2f' (x) tan 2x + 4sec? 2z. 


f (0) =1, f’ (0) =0, f" (0) =4. So, sec2x 41+207+... (for x € (-7/2, 1/2). % 


0.02 1 0.02 9 0.02 
(ii) f sec2rde + fi 1420? 4...de=[e4 5% +...| 
0 0 


0 
= 0.02 + (2/3) x 0.02? + --- = 0.0200053 + --- 0.02001 





(iii) 0.02000533546765866 --- * 0.02001 ® 


(iv) Very close. Good. 
(v) Undefined at 0. 








A669 (9758 N2018/I/1)(i) — = ——~ = 
LL LX 


l l 1 i 
(ii) f ™Zar- -{3- aE de + f Sar @ 224 f San--24-* +0 
a x x x Lo 


oa l 17 l i Int. 1 
Thus, [ Fax =[-=* - =| -|-=*-=]-|-==-]--=+1, ® 
1 @& 1 


dy dlInzx 1 1 (-5) 1 Ing 
= =—-—4¢+lnz{- : 


x x 
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3 
A670 (9758 N2018/I/2)(i) For #0, —=7-2c <— > 3=7r-22? <— > 27°-7r+3=0 
X 


pe TD EVE = 4(2) (3) _7#V25 7451 
4 


A 2 





3(3) a 


The 2z-coordinates of A and B are 1/2 and 3. 
(ii) As usual, a quick sketch can be helpful: 











The requested volume is 


fame 9] (C9) (Geet 


dy d 
A671 (9758 N2018/1/3)(i) ytuc2 => ~ 2 sa + Qua. 
x 


d d 
Plug 2 and = into the given equation 2 = 2y — 6 to get at + Qua? = Qua? -6. 
a a 


Rearranging, a = mo fora), 
dg. a 


(ii) u= [ -ar=5+0. So, y= unr? =3+ C2’. 
a i 


Plug the initial condition (2, y) = (1,2) into 3 to get 2=3+C-12. So, C=-land y = 3-27. 
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a2 -4y? 1 5 
A672 (9758 N2018/I/7)(i) [4 => a? = Ay = 
£ 


zy. 2 ( > + ary”) 


1 
2 


1 d 1 d 

ee ae) Be rea) 
dy 1 2 Yy 

a oe =7-- — = —_———_*— (for 2 1 
da (8y+ zy) =2 54 = Step eles (for 2ry + 16y # 0) 


(ii) Plug 1 into the equation for C: 





Now plug in (x,y) = (1, 1/3) and (2, y) = (1,-1/3) into = to get 


dy _ 2-1/9 Wo Wy 17 
2/3+16/3 54 





dz dz 7 
(x,y)=(1,1/3) (x,y)=(1,-1/3) 


So, the two tangent lines are y - 1/3 217(x-1) /54 and y + 1/3 2 -17 (2-1) /54. 


Taking 243 yields 2y = 0 or y= 0. And now from 2 x =-1/17. Thus, the coordinates of 
N are (x,y) = (-1/17,0). 


dy dy/d@ 4sin@cos@ 2sin@cos@  —- 2sin@cos@ 
dz dz/d@ 2-2cos20 1 -cos26 - 1-(1-2sin?¢) 
2sin@cos@ cosé 
=e = cot? (for @ + 0,71). 
2sin? é sind ona 
(ii) The point on C at which 0 = a is (2, y) = (2a — sin 2a, 2sin? a). The normal at this 


point has gradient -1/cot a = —tana and equation y - 2sin?a = - tana[z - (2a - sin 2a)]. 


A673 (9758 N2018/I/9) (i) 











0 - 2sin’ 
y= 0, then « = —=———~ +. (2a ~ sin 2a) = 2sina cosa + 2a ~ 2sin cosa = 2a (ees), 
—tana 


2 
(iii) First, note that since 6 €[0,7t], sin? >0. Now, 


(st) + (4) ao= (2 - 2cos26)° + (4sin 6 cos 6) dO 
dé dé Sg 
ae 2 2 Y a 
=f, (2-2cos 26)" + (2sin 20) a= | \/ 4+ 4cos? 20 — 8c0s 26 + 4sin 26 dé 
Y ny ay 
= 1+ 4- 80032600 = 8 (1 -—cos 20 ao= f 8 (2sin? @) dé 
jy VS eosvj ao =f” 8 (25i) 


7 7 
=4 [sin 6| a2 f sin 6 dé = 4[-cos@]} = 4 (cos 6 - cos7). 





2085, Contents www.EconsPhDTutor.com 


ddl 
A674 (9758 N2018/1/10)(i) Apply — to L— + RI+L=V to get 


dt dt C 
d.d/l ad dgq_ d d2] di fidVv 
—j,— + — HAI + — L—~ = 
ae ao an > d2 ae OC dé’ 
dV d2I dil iT 
If V is constant, then aE =0 and = + becomes Eg eras a 7 0. 


d z 
(ii) Apply ai to I 2 Ate" twice: 


dl R 4 , _Rt R 
an =A(er! aL — = te” HE) 4 Ac i (1-4), 


a2] R R ae, R78 R R 
ES) fii ane ie a4 (as | ay. alee (sae le ea ae le ee 
di l-sr° OL” | st} = Ht (- nal aL” Ht (- )s OL Ht (t- ) 


Now plug Z < and 2 into 2: 


R _w(R R\ Ate 
0 = LAS-ett (4-2) +R #(1-—#)+ A 


2 2 
-5(=!- 2) +R(1-t)+5-5(S)+R(-s)+5-- + 5-45 





2\20 21 CG 2h 2L C 4b C Alp  O- 
AL 
Rearranging, C= Re 
(iii) The stationary points of J (as a function of t) are given by 
dl 4 Rt ee = 2 
—| = Ae 2r | 1-—t] =0 t=—. 
ae _ _ ( OL ) ad R 


Note that A and e~2" are always positive. 
Since dI/dt > 0 for all t[0,#) and dI/dt <0 for all t € (¢,00), ¢ is a strict global maximum 
of J and the corresponding maximum value of I is 


21 JAI; BAses 3A 
I(t) = Ate"? ee aes ti 7 = =—, 
(Oe a z R eR e-4 2e 





(iv) i 
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A675 (9758 N2018/II/1)(i) x= aw f (4-15) "dy =5(4-15) +C. 


Plug in the initial condition (x,y) = (0,69) to get 


0= 5 (28-15)! += 5.8840 =5-4+0=18+C or C= -18. So, ot 5(qv- 15)" — 18. 


3 
2 2 1 3 
Rearranging, y = 3 (E (x + 18)| + i} mG [2 (a+ 18)]? +45, 


dy _ 1 

i) “| -4 = 3-15)3 
ae (y/3 - 15) 

Plug 2 into = to get the corresponding z = (9/2)- 4? - 18 = 72-18 = 54. 


So the requested point are (%, y) = (54, 237). 


240 <> §=3(49+15) = 237. 


A676 (9758 N2018/II/4) (i) First, for all x € R, 


(2x)? (2x)" _ (x) oat 408 
212977 ———+.... 
2! 4! 6! “oe AD5 








cos 2x =1- 





Next, for cos 2% - 1 € (-1,1] or cos2xz #0 or x € [0,7/4), we have 


2 Ax6 
In (cos 2) = In(1 - Qn? +oat- 4...) 





45 
2 
2 AS D7? £2g4 =. ~I72 +...) 
a 
45 2 a 
2, 4x6 (424-8054...)  (-826 +...) 4, 64 
= (97? 4 274 = .)- SY foe 20g a pe 6 
(207+ 5 45 2 a eB a5 
As already stated above, the expansion is not valid for x = 71/4. 
_ In(cos2x) , 1 f -2u?- $at-# 
(ii) fat / 2 dx= f —2- +... dz 
4 64 
es |, ae 5 ees 
wg" 995" 
0-5 In (cos 2x) 4 64 a7 i 2 
——_—— dx = | -2x - =? - ——2° “4 = -]-— - —— +... w -1.0644. 
I, em | eg 395" I 18 225 





(ii) [= (cos?) x» 1.0670. © % 
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A677 (9758 N2017/I/1). For az (-1, 1], x 





an : (20r)" (ax) | (ax)" 
e"In(1+az) = oar FO 4 | aaa -...| 


(ax) (ax)” 2. 2.9 3 a*\ 5 2,0 3 
= ax —- 5 + 3 + Qan* — aren? + 2an’? t+: = ant ae vt +{|2a-a ae Cia re 





If 0 = 2a-a?/2 = a(4-a) /2, then a = 0 (discard) or a = 4. 


d 
A678 (9758 N2017/1/3)(i) Apply — to y? ~ 2ey + 51? - 10 +0 to get 


dy 


dy 2 
2y— -2|y+xrx—)]+10r =0. 
Ya (y oe) a 
. , dy 3 5 4 
The stationary points are given by a 0 or —2y + 10x = 0 or Ga oL; 
x 


Plug = into + to get 
0 = 25x - 10x? + 5x? - 10 = 202? - 10 or =i? or g = +1//2 = +/2/2. 


Thus, the x-coordinates of the two stationary points of C' are +\/2/ 2 


(ii) Let the stationary point with x >0 be P = (V2/2, 5V2/2). 


d 4 wy dy (“2 dy eu) 5 
Apply — to =t t 2) | — —}]-2{,—+— — 10 = 0, 
PPY dx oe bere (32 . Ya dz : dx eg r 


d 
Plug - 30 into 2 to find that at P, 
at 


Re a By 5 1 _/2 
ay —2¢ 4% 41040 or eZ 


——$s — ——— eS SS 0. 
dx? da? da gay +/2/2=5/2/2 4 : 





Since the second derivative is negative at P,°” P is a maximum. 





®"6See Proposition 11 (Second Derivative Test) 
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A679 (9758 N2017/1/7)(i) List MF26 (p. 3) contains this Sum-to-Product formula: 








P P- 
cos P — cos@ = -2sin ey oe @ 
2 Z 
Equivalently, cos (A+ B) -cos(A- B) = -2sin Asin B. 
: ; . 1 
Rearranging, sin Asin B = 5 [cos (A - B) -cos(A+ B)]. 


i 
So, ‘i sin 2mz sin 2nz dx = . 7 cos (2max - 2nz) — cos (2mz + 2nx) dx 


1 


i oe - 
= f cos2(m-n)x- cosa men)ade = }[2en—nle_smr(n ends 


2(m-—n) 2(m+n) ee 


2 


(ii) [ (f (x)) dx = [Gin Qmx +sin2nx)* da 


T™ 
= [ sin? 2mz + sin? 2nx + 2sin2mzx sin 2nzx dx 
0 


™1-—cos4mx 1-cos4nz 


= [ DO in Oe Sin Qn dx 
0 2 2 


sin2(m—n)x_ sin2(mt+n)z 
2(m-n) 2(m+n) 





Ts. I 
= |x - — sin4dmz - — sin4nz + 
8m 8n ; 
Since that sin (k7t) = 0 for any integer k, this last expression simply equals [z]j = 7. 


d 
A680 (9758 N2017/1/11)(i)(a) = 0. (b) 4+ 2.5c = 29. So c=10 and v = 4+ 10¢. 








1 
(ii) oe =e-kv=10-ko (with k >0). If 10-kv #0, then 2 TG So, 
1 1 
Sav f ae or t = —FIn|10 - kvl + Ch, 


or, k(C,-t)=In/l0-kv| or |10-kuj=eO) or 10-kv=4e"e = Cye™. 
Plug in the initial value (t,v) = (0,0) to get 10 = Cy. Thus, 10- ku = 10e™ or 


v=— (1-6). 


(iii) This object’s terminal velocity is lim us lim 10(1-e") /k = 10/k = 40. 
So, for this object, k = 1/4. At 90% of terminal velocity, we have 


lae"S99 or (i=. or t = -1n0.1/k = 41n10 » 9.21 seconds. 
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A681 (9758 N2017/II/4)(a) As usual, a quick sketch can be helpful: 








y=27-6r+5 





Find their intersection points by plugging the line’s equation into the curve’s: 
(x-1)/2=2?-6r+5 — 2x" -132+11=0 


134 y(-18)-4(2)(1) 1349 | ap 


20) A io) 





=—_ GE 


So, the area of the plate is 





11/2 as] ; ve 13 ee! te 243 
- _ S| ee = = SI x 15.2. 
f 5 (a? -6r+5) dx | ra i: >] 6 15.1875 # 15 ® 


Oo [(24) yen fod 5 Lap ha laaz- a) ecw ” 


(ii) The volume of the second container is four times that of the first: 





TT TU 
=4 => Pata dl = —> O=40 <4) aa". 
2(B—b)  2(a2—a) ai al “a 





(4-44) (a-@)_1aVvinar@ 


By the quadratic formula, b = 7 9 








Observe that since a > 1, -a+a?>0. So, 


1-Vl-a+a@ 1-V0 1 ig Sew rere. lat 1 
= WwW. = ‘ 
2 2 Dy 2 2 2 








Since b> 1, we may discard the smaller value and conclude: 


1+Vl-a+a? 
5 : 
2090, Contents www.EconsPhDTutor.com 





b= 


A682 (9740 N2016/I/2). We'll do this without a calculator: 


First, observe that 2°°°* = Qin 20" = elcos)(In2) | So, 


y! (x) = + e(oay(n2 ~ e(cos x) (In 2) (—sin x) (In 2) = JCS gin (In vay ® 
x 
Hence, y’ (0) =0 and y’ (=) =-In2. 


(ii) The tangent line at x =0 is y—y(0) = y’ (0) (x-0) or y-2=0 or y=2. 


The tangent line at x = 71/2 is 


y-y(2)=9'(F) (2-3) or y-12-in2(z-F). 
1 
Plug = into = to get 1=-In2(2-7] or poe 


1 
So, the tangents meet at (7 —-—, 2). 
2 dn2 


A683 (9740 N2016/I/8)(i) Compute 

f' (2) = asec? (az + b) = a[1 + tan? (ax + b) | =a+atan? (ar +b) =a+ay?=a(1+y’). 
f" (x) = 2ayf" (x) = 2a*y (1 +y"). 

f" (a) = 2a? ka (x) (1 +y") +y-2yf' (x) | = 2a’ f' (x) (1 . 3y”) = 2q° (1 +y") (1 + ay”). 


(ii) f (0) = tan(a-0+ 7) =ta nZ=1  f'(O)=a{1+L[f OP} = 2a, 
f" (0) = 20? f (0) (1 +[f (0)]) = 20? -1-2 = 4a?. 
f'" (0) = 203 (1+([f ba 3 [f (0)]°) = 20° -2-4 = 160°. 


: f"(0) 2, £"O) 8 
So, f(z) =f (0)+f'()a+ 1 5 pete) =142ar4+ 2072? + 3a'a" +... 5 
(iii) f (0) =tan(2-0+0) = tan0 =0. f' (0) =2{14[f (0)]*} = 
f" (0) = 2-2 Ff (0) (1+[f (0)]) = 
f" (0) = 2-27 (1+[f (0)]’) (14+ 3[f (0)]°) =2-2?-1-1=16. So, 
F(x) =f (0)+ fO)a+ ge Pe, aes Bate... re 
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d dy. dd 

A684 (9740 N2016/1/9)(i)(a) Plug y = — and + = a into the given equation to get 

dy dy 

— + 2y = 10 — =10-2y. 

ae Or at a 
(i)(b) For 10 -2y #0, we have 2 : So 

- we have — = 
oes dy 10-2y "" 


dt 1 1 1 1 
t= | —d -{— a --; | —<a = -=Inly —5|+ Cy, 
Pd pj 9) gan 5 nly — 5] + C1 


or, 2(Cy-t)=|y-5] or y-5 = 40%) = cee = Che or y 25+ Coe. 


Plug the initial values (+, y= =) 2 (0,0,0) into 2 to get 0=5+C> or Cy =-5. 


So, y= 5(1-e). 
Next, x = Sat= f yat= f5(1-e”) dt 25(t+0.5e2) + Cy = 5t + 2.50 + C5. 


Again plugging 2 into 3 we get 0=0+2.5-1+C3 or C3 = -2.5. 


So, x = 5¢+2.5e°% - 2.5. x 
d a2 1 1 
(ii) First, — " <a = [ 10 5sin Stdt = 10 + 10cos st + C. 


Plug in the initial values 2 to get 0=04+104+ C4 or Cy =-10. 
d 1 1? ie 
Next x = [ — dt = J 108+ 10cos5t - 10d¢ = 10(= +2sin 5-1) + Cs. 


Again, plug in 2 to get 0= 10(0+0-0)+Cs or Cs =0. Hence, 


t 
x = 5t? + 20sin 10t. 





(iii) For Model (i), 7=5 = 5=5t+2.5e7%-2.5 <= tw» 1.47 (calculator). * 
For Model (ii), 7=5 = > 5 =5t?+20sint/2-10¢ <=> t* 1.05 (calculator). % 
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A685 (9740 N2016/II/1). Let V(t) and h(t) be the volume (m°) and height (m) of 
water in the cone at time t (min). 


From tana = 0.5, we have r/h = 0.5 or r =0.5h. 


io 
Plug this into V = ah to get V = ah or = (=v) ; 








TU 

dV m..,dh dh1 4 aV 

By the Chain Rul la d 
he Sea dé 12° dt ~ dt whe dt 


When V =3, h= Te . Also, — 20.1 (for all t). Plug 2 and 2 into = to get 


dh 4 
dt (36/n)7 





2/3 
0.1 =0.10-1/ (=) ~ 0.0251ms7! 


A686 (9740 N2016/II/2)(a)(i) Use Integration by Parts twice: 


sin nx sin nx sinnzx 2f -cosnx —Ccosnx 
f Peosnade =a" -2 f de = POM =| -f dc] 
n n n 


n n n 











9 sin nx cosnx .sinnx 
Se 
n n 








1 
+C = —(n’2’sinnz + 2nxcosnx -2sinnx) + C. ® 
iP a 


(ii) We'll use the facts that the value of sin at any integer multiple of 7 is 0 and the value 
of cos at any even (or odd) integer multiple of 7t is 1 (or -1). 


a 1 1 1 
1 27t 
i x cosnx dx = — ae sin nz + 2nx cosnxz — 2 sin na” =e [2nzx cosnax 7 
T nr nr 


2 
Ot ae for n even, 
2 27 n n 
= — (27-1-7cosn7) = — (2-cosn7) = 
De ne On 
72 [2-(-1)]= “2? for n odd. 


So, a= 2 if n is even and a = 6 if n is odd. 


(b) From u = 9-2”, we have “ 2 _2r. The volume of the solid is 
a 











2 3 sg. 
2a... L U2 

x f oe = ; [le LE) are (0-2 ———— a ii= =x ff 7” x dx 
oom (ou du ne ad 5 2 git pe auet [Po -Fa 
DO Je=0 U2 a “2 U2 wit 2 U2 —2Io wu w 

T 9)? 9 9\ m/f. 5 4 
= S| =| ioe. alhOe |= (ine owas, 

5 [imal + =]. > (In +e n =) (ino + =| ® 
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A687 (9740 N2016/II/3) (i) If y = 0, then cost = 1, so t= 0,27 and 2 =-1,2m-1. 
So, the z-intercepts are (-1,0) and (27- 1,0). 








A 
y 
When t= 7, 
(x,y) = (m+ 1,2) 
When ¢ = 0, 
(x,y) = (-1,0) C 
> 
When t = 271, 
(x,y) = (27 — 1,0) 

y = 1-cost is maximised at cost = -1 or t=7. So, the maximum point is (7+ 1,2). at 


d 
(ii) Compute — +1+sint. Now, 
=n ae oa (a a dt 
dx= fy adet f  (1-cost) (1 +sint) 5 
t=0 yor t=0 Vat dx ‘ t=0 tee ra 


a a 1 
=e L+sint—cost—sinteostdt = [ 1+sin¢—cost— 5 sin 2¢dé 
0 0 


a 


1 1 a 
= [i cost -sint + 5 008 2 =a-—cosa-—sina+ —cos2a+-—. 
4 0 4 4 


dx dy 





d 1+si 
(iii) The given normal has gradient | = : = is au = 
dy lt=n/2 dt = dt !t=n/2 sint lt=n/2 


So, its equation is 


y-(1- cos) =-2[2-(F-cos =) | or y=—-224+7+1. 





1 
Hence, E = (=o), F = (0,7+1), and the area of AOEF is 
l7a+1 (m+1)° 
— 1) = ———.. 
ZS a 
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A688 (9740 N2015/1/3)(i) We 1/2 
have graphed some continuous 
function f :[0,1] > R. 


The area bounded by f, the z- 
axis, and the vertical lines x = 0 


1 
anda =1is [ f (ae) de: 


Consider the two large red rect- 
angles. Their total area is 


2L/(a)+1(5)} 


which serves as a crude approx- 


1 
imation of i 7 (2) da, 
0 





Next, consider the four smaller blue rectangles. Their total area is 


Wav G Gh 


which serves as an improved approximation of A. 


It is plausible then that as n grows (and the number of rectangles grow), the following 
expression serves as an ever-improved approximation of A: 


ae dt ae) 
It is thus plausible that 


mala) fla) +7 GFL £@ & 


(Note: We haven’t proven that + is true. We have merely presented an informal argument 
for why it might be true. Which is all you need to know for A-Level Maths.) 


(ii) Suppose f : R > R by f (x) = ¥/x. Then 


2 (ee) Ee Bene FF] -2r(2) 12) 


Hence, by (i), 2 (fre de= f° Yada =" [oi] = 3 % 
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A689 (9740 N2015/I/4). The rectangle’s perimeter is 2(~+y). The semicircle’s is 
2x4 +7x. Adding these up, we have 


d 
d=2(xt+y)+2n +70 = (4+) 0+ 2y or yi 5-(2+2)e. 


Tl 
The rectangle’s area is xy. The semicircle’s is 5f So, the total area is 








d 
sys Fa? ta[S-(247)o| + —7 = Qn + =a. 
oe a ‘b d/2 d 
This is a n-shaped quadratic polynomial that is maximised at 7 = -——~ = -——_ = =. 
2‘a 2(-2) 8 
2 2 2 2 il 
So, the maximised area is -2/ + (5) = = + Q = (and k = —). 
8 2\8 52 16 .32 o2 
A690 (9740 N2015/1/6)(i) For 2x € (-1,1] or x € (-0.5, 0.5], 
2x)? (2x)° 
In(1+ 20) =20- OZ), CH) op ants Bat a 


(ii) For |ba| <1, 


c(c-1) (br)? ‘ c(c- 1) (c-2) (bx)’ 7 
2! 3! 





ax (1+bx)° =ax}1+c(br) + 
2,1 9 3,1 48 4 
= ax + abcxr + sab c(c-1)2 + gab c(c-1)(c-2)a*+... % 


1 8 
Comparing coefficients, a é 2, abc 2 -2, xabrc (c-1) 3 3 
it, 2 4 1 i Se 
Plug = into = to get b=-1/c. Then plug = and = into = to get 
1 8 3 


8 
—_=-— —>? ]--=- —> C=-x. 
a 


c 8 D 


5 
And now from = we also have b = > So, the coefficient of «* is 


bcte-nee-a)-£a(9) (3)(Q)(-9)-3()) een 
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n/2 
A691 (9740 N2015/1/10)(i) A; + Az = [ cos x dx = [sin gr? 2 = 1. 
0 


By symmetry, the area under y = cosx from O to P is 
m/4 
Ars f cosa dx = [sin]! oe 
0 


Taking 22 yields 


or or A, 22- V2. 


Now plug 3 into = to get Ap = 1- A, = V2-1. So, 


Ay _2-V2_2-V2V2+1_ 2/2-24+2-V2 


As.” st +9212 741 2-1 











an 2, 


2/2 2/2 
(ii) The volume of the solid is nf ax? dy = nf (sin y) 2 dy. 


Tl 
(iii) From y = sinu, taking u € |-5. 


mla 


d 
| we have u=sin ly. Also, 7 2 cosu. So, 
u 


V2/2 V2/2 y=V2/2 u=m/4 
mf (sin"ty)?dy +n f wady =m f ee mf u’ cosudu 


m/4 
=rfursinu- ff 2usinuda| =n{2sinu+2ucosu- f 2cosudu| 
0 


m/4 


0 


2/2 Z 2 (1 
= n[u2sinu + 2ucosu-2sinul” n|(Z + Ne - v3) -0 Pe (= ae 2), 


(So, a=0 and b= =.) ® 
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A692 (9740 N2015/I/11)(i) For 6 #0,7/2 or 0 € (0,7/2), 


dy dy dx _ 6sin@cos?6-3sin’@  2cos?@-sin?@ .cosé sind 








ee ee ee - = 2coté- tan 0. 
dx dé dé 3 sin? 6 cos 0 sin 6 cos 6 sind cos@ in 
is ; dy ae = — _ 
(ii) Stationary point <=> 0=— | ; 7 2cotd — tand <> 2=tan?@ <> tand=+vV2. 
7 \o= 


Since 6 € (0,7/2), tan@ > 0. So, we can discard tan@ = -/2 and conclude that the sole 
stationary point is at tan@ + J/2. 


At this stationary point, sin@ = \/2/3 and cos@ = \/1/3. So, 


(eu) = Vi 2 Val-(( -((5) ) a) 


dy d dé  —-2cosec?@ — sec? 6 
2cot 6 — tan@) = (—2cosec?@ — sec? 6 = 
dr ar ) ( le 3 sin? @cos 0 

For all @ ¢ (0,7/2), this second derivative is negative (because it has negative numerator 
and positive denominator). Hence, @ in particular is a strict local maximum?” and hence 
also a turning point (k = 2). 








Next, compute 


(iii) Since 6 € [0,7t/2], we have y = 3sin?@cos@ > 0. So, C is entirely above the z-axis. 
Moreover, x = sin? @ is strictly increasing in 9. Hence, the requested area is simply 


d=7/2 d=71/2 O=1/2 
f ydx =f 3sin®Ocosddx = [ 3 sin? Bcos EE at 


d=71/2 m/2 
= f 3sin’ 6 cos 6 (3 sin” cos @) dé = [ 9sin* @cos?6d0 = 0.884. ® 
=0 0 
(iv) Plug C’s equations into the line’s equation to get their intersection points: 
year => 3 sin? 6 cos @ = asin® 0 —> sin? 0 (3.cos6 — asin@) = 0. 


So, they intersect at sin@ = 0 (the origin) and 3cos@- asin@ = 0 or 3/a + tané (the point 
P), 


If the line passes through the maximum point of C’, then P must be the maximum point 
of C. 


From (ii), the maximum point of C’ is at tan @ = /2—plug this into + to get 


3 3 
~=/2 or a=—. 
a 


®7Proposition 11 (Second Derivative Test). 
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A693 (9740 N2015/II/1)(i) The maximum height is attained when 


dh 1 1 
= ae A) eee =32. 
a; 0 or 0 6 5f 0 or i 
dt 1 10 


(ii) For h < 32, So, 


dh /i6-th 
dt 10 i 

fe —dh= | ———dh=-40 (6-2 hee = 90y A ha 
dh /16= 2h 2 


Plug in the initial condition (h,t) = (0,0) to get 0 = -20V64+C or C = 160. 
Hence, t = -20V 64 - 2h + 160 and 


~20/64 — 2-16 + 160 = -20\/32 + 160 = 160 - 80V2. 


Oly ag = 


A694 (9740 N2014/I/2). Apply < to xy + ry? +54 +0 to get 
a 


Spee ae sO, 
dx dx 


d 
Plug in of 2746 get 
dx 


0 =2Qry- a2? +y? - Qty = y’? - 2? or lee! or y 2 £2. 


Plug 2 into + to get +72 +2° +5420. 
Since 54 #0, we can discard the negative value in 2 and conclude that ye 


Now from Z we have 27° + 54=0 or « =-3. 


Hence, the unique point on C at which the gradient is 1 is (-3,-3). 
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A695 (9740 N2014/1/7)(i) a % 1.885 (calculator). x 
f(«®)=-7 => a2 aT =<7 —> xv‘ (a*-3)=0 —=> x = 0,+V3. 


Since 3 > 0, we must have § = V3. 
a 3 
(ii) fe (7) dz= i 2° — 32-7 da = E - 5 - 72 | B = —0.597. ® 


B / = Zt af 
(iii) [ f (x) dz = |= - G05 - 72] 99 - — V3 -—V3-7V3. 
0 7 9 t 5 
af Zt 54 
So, the requested area is — (=v3 = v3 - 7V3) ~ 7/3 = a5 V3. 


(iv) f (-z) = (-2)°-3(-2)'-7=2%-3c!-7=f(z). Vv 
If y solves f (x) = 0, then so too does —y. 


So for example, since a * 1.885 solves f (x) = 0, so too does -a * -1.885. 


Remark 223. The last question was strangely open-ended. I think the above answer 
should have sufficed for the full four marks. But of course, who can divine what was on 
the mind of those who wrote this question? 


Here are two other things that could also have been “said”: 


1. Compute 7 (2) =67 = 12 = 67° Ge —2). So, for x >0, f’(z)>0 — «> V2. 
Hence, the only positive root is a and the only other real root is -a, while the other 
four roots are complex. 


. These we can even find using only what we’ve learnt in H2 Maths.°” Write 


x® - 321-7 =(x-a) (ata) (ax' + bar? +c) = (2? - a”) (a* + br? +0). 


Comparing coefficients on the x* and constant terms, we have —3 = —a? +b and —7 = 
-a’c. And thus, b= a? -3 and c= 7/a?. 


Now let z= 2”, so that x4 + br? + c= 2? +bz+c=0. By the quadratic formula, 


_-bt V8? -4c 3-07 + Jat—6a?+9 - 28/a? 
ee 


2 
So, the other four (complex) roots of f (a) =0 are 


3-a? + /a‘t - 6a? +9 - 28/a? 


Lot /z=+ 
4 








678In particular, without knowing how to solve cubic equations. 
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A696 (9740 N2014/I/8) (i) 7 f (x) dz = dz = sin! 5 Te al 


IT 











- 1 41 1 ay? 
Gi) 1) = Reg=5= = =3]1-(3) | 
1-(§) 
3 
ste fe ry) (-4)(-2[-@] (4) ( 2) (-3)[-(4) ] 
“3 «(-5) -(5) aT 7 3] 
t. 4 5 
=3*5a° * gag? * 34090" * 
(iii) sin! pdt (x) dz - ee: kg 24 coats Toate... de C 
LG ae = + : —_ 73 + 2s z ag! 





3 ©6162 3240 244944" 
Plug in x = 0 to find that C; = 0. So, 


yh 1 1 1 5 
sin7! c+ x3 + xv? + eon 








3. 3° 162" "3240 244944" 








679See List MF26, p. 4. 
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d d 11 
A697 (9740 N2014/I1/10) — Pk(1+2-2), (+2, 2 (0. 7 ;): 


1 Lf") -5 
Pluie 24a e Sah -( = 
(i) Plug = into = to get ri ‘| tals ri 5 


(ii) 1+e-a®=2-(r—2) So, 
i] 1 il 
Sima ape! ea 


J 
nV) (x - 1/2) Ce 3 8 Ve N2) ey VOR AL LG 


























+ 
A "Bi (x - 1/2) 2,/3/A /5/4+ (x - 1/2) J5+2x2-1 
Plug in 2 to get 0 = /5In1+C or C =0. So, t2J/5In Voce =, for c€ [0,5]. 
V5+2r-1 2 
(iii)(a) ¢|, = V5In we os (iii)(b) ¢| _, -V5in 8 2.152. 
(iv) Take 2 and rearrange: e!/V® = ao —— 
V5 _ V54+2e-1_ V5-1 V5 L=V5 6, V5+1 
x+ => 7s +——— 0,V5In 
ayia i 2 et/V5 4.1 2 J5 211 
op & 


i 























> 
1 
(vim 24.0) w (2.152, 0) 
J5-1 
68°We can further simplify: V51n _ 1 V5In wee : = V5In (S= *) = V5In = =e 
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A698 (9740 N2014/I/11)(i) By Pythagoras’ Theorem, h = V4? - r? = V16-1?. 


The hemisphere’s volume is 27tr?/3. The cone’s is mtr?h/3 = mr?V16 —r2/3. So, the toy’s 
total volume is V (r) = 27r?/3 + mr?V 16 — r2/3, for r ¢ [0,4]. Next, compute 


3 1 2 
V(r) = 2m? +2 (ae T0=78 - Es) = mr] ar 3 (2vI0=7F- 
-T 


16-r? 
= | r+ OO | = mr ( “—*)- Tr ( 
3 V16-r? 38V16-r2/ 3V16-r? 








6rv 16-724 32- 3r?). 





Any stationary points 7 are given by V'(r) =0 <> 7, =0 OR 674\/16 - 77 + 32 - are 0, 


Since V (7,) = V (0) =0, 7 does not maximise V. Now, from 4, 
Square 
GF5y/ 16 — 72 = 37? 32 ==> 367? (16 —F2) = OF4 -— 1927? + 1024 or 4574 -— 7687? + 1024 20. 


Making the dubious assumption that r; must be a stationary point of f, 7, satisfies 2. 


(ii) Equation 2 is a quadratic in r?. By the quadratic formula, 


768 + \/768? — 4(45) (1024) 3844 101376 


<2 
"> 2 (45) 45 


~ 15.609, 1.458. 


So, in z the two positive solutions for r, " 
are /15.609 ® 3.951 and V/1.458 » 1.207. V (71, V (71) * (3.951, 139) 


(iii) We can verify that 3.951 satisfies 2 
while 1.207 does not." (iv) 


Hence, 7; ¥ 3.951 and 


h(r1) = \/16 - 1? » 0.625. 





Vs 











Square 
6811t was at ==> that the extraneous solution 1.207 was introduced—see Ch. 42.1. 
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A699 (9740 N2014/II/2). (See List MF26, p. 2.) 








922 +a-13 A Be+C  (A+2B)a?+(2C-5B)x+9A-5C 


(Qn —5) (2249) Qe-5 a2 +9 (2x — 5) (x? +9) 


Comparing coefficients, A + 2B 2 9,2C'-5B 2 1, and 9A-5C 3 _13. 


2 9x? +x2-13 


Solving, A=3, B=3,C=8. H 
olving, ee Jy Qa —5) (22 +9) 





dx equals 





2 2 
[ rn dx = [>In [2x— 5] + >In (0?+9) ‘ene =| 
0 Q2a-5 224+9 £249 2 2 2 3 Jo 
3 3 8 2 0O\ 3, 13 8 2 
= = (In -In5) +5 (In13~In9) +5 (tan! ~ tan" =) = 5 Inge tg tan’ 3. ® 


A700 (9740 N2013/1/5)(i) On [-a,a], we have [f (x) ]° + (z/a)’ = 1, with f (x) > 0. So, 
this portion of the graph is a semi-ellipse with y-intercept 1 and x-intercepts +a. 














We have an identical semi-ellipse on[-4a,-2a] (which is 3a to the left of [-a,a]). 
And again on [2a,4a] (which is 3a to the right of [-a,a]). 


And again on [5a, 7a] (which is 6a to the right of [-a,a]). But we’re asked to graph only 
up to x = 6a. So, we just have a quarter-ellipse on [5a, 6a]. 


For all other x, we have y = 0. 


(ii) From x + asin@, we have da/dé 2 acos@. We specify also that 6 € [-7/2, 7/2], so that 


Vcos? 6 2 cos6. Note that both a/2 and V3a/2 (the lower and upper limits of integration) 
are in [-a,a]. Now, 


V3a/2 V3a/2 2 i r=V/3a/2 —> dx dé 
I, f(z)d= f yi-ge fl, V1-sin OT aid 
b=71/3 n/3 m3 
. Vi-sin? dacosoda= f Veos?facosida a J cos’ 6 dé 
TU TT 


d=n/6 
7/3 cos 20-1 | 18 «x 
= a = —si 2 = = us 
a I, 5 dé=a F sin 20 |... 19° ® 
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d 1 

A701 (9740 N2013/I/10)(i) Since z < 3/2, 3-220. So, — = 55; and 

z 3-22 

1 l 
ue dz= f v dz=-jm3- 22|+C, = —FIn n(3-22z)+Ci, 

3.C 3 

or In(3—2z) =2(C, -2), or 3-22 =e "Ge = Che, or z= in ae = 9 C30", 
d 5 

(i 7 =Z= 3 “-F Ce So, 


eee dy d dy 22 _ 3 —22% = = = 
(iii) age aor a = —2Ce =-2(5 + Cre )+3=-20% +3 (a = -2, b= 3): 


3 
(iv) In 2, set Cs = 0, Cy = 0 to get one line y = at: 
3 
Set Cs =0, Cy, =1 to get another line y = 52 +1. 


3 
Set Cs = 1, Ch =0 to get y=e 7 + ah a non-linear member of the family that has the line 


3 
y= 3 as its asymptote. 
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: dy dy dz 9 
A702 40 N2013/1/11 — = — + — = 6" + (6t) =F. 
702 (9740 N2013/I/11)(i) Compute ieee aaa? (6t) 


So the tangent has equation y — 2t? = t (x - 3t”) or y=tr-t?. 


(ii) The two tangent lines are y + nr-p® and y2qr-q. Let R= (cp, YR). Plug + into =: 


3 3 


3 p-4q 
pir-p=qtr-q => (p-q)trr=p-q => tR= ae =p t+pqt¢. 





3 
(At <>, we assume p #q so that it’s OK to divide by p-q.) 
Next, yr = ptr-p’=p(p?+pq+@)-p=p(pqt+@). 
Suppose pg = —1. Then R is on the curve x = y’ +1, as we now verify: 
2 
yet =p’ (pq+@’) +1l=p*(p?¢? +2pq?+q')+1=p? (1-29? +¢q') +1 


=p -Iwerrpr¢t+l=ap—-2+Pt+l=ap-l+g=p+pqig? ==zrp. 


(iii) Plug x = 3¢? and y = 2¢° into x = y? + 1 to get 3t? = 4#°+1 or 4¢® - 3t? +1 20 at any 
point at which the curves C’ and L intersect and, in particular, at MW. 


Observe that #2 = -1 solves 2. So, write 4t° - 3t7+1 = (? + 1) (at’ + bt? + é), 


Comparing coefficients, 4 = a, -3 = a+ (so, b=—4), and 1=c. Hence, 
4t6 — 30? +1 = (t? +1) (4¢* -40? +1) = (+1) (2-1) (2-1). 


The real solutions to 2 are 2t?-1=0ort=+,/1 /2. Since the y-coordinate of M is positive 
(and real) and y = 2t®, we must have t > 0 at M. Thus, 


vb) Wve) 


(iv) In the given region, C and L may be described by y = 2 (x/3)'° snd g=a/o 1; 


The area under C’,, above the x-axis, and from 0 to M is 


3/2 1.5 3/2 y) on 5} 
(20) elacael at) ee 
0 3 315.95" Jo 315.25 \2 5/2 10 


L intersects the z-axis at x = 0. The area under L, above the z-axis, and from 1 to M is 








3/2 j__ 2 ier. 2 4 1 V2 
3V 2 Zz 2 
Hence, the requested area is ste — - = inv? 
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A703 (9740 N2013/II/2). Figures reproduced, but with the points D, F, and F’ added: 





(i) Observe that 2 AFD = 7/3. And tan 2 AFD =|AD|/z. So, |AD| = V3. 
Hence, |DE| =|AB|- (|AD|+|BE]) = a-2V3c. 
The equilateral triangle in Fig. 2 has sides of length |DE]| each and hence area 


5|DEPsin™ = 5 (a -2Van) “ = YB (q- 2V30) 


The prism has height x and thus volume V (2) = ue (a- 2/30) 2 


ii) Note that |DE| 4 a—-2\/3x > 0, 0% < Hence, x € fo =al 
(ii) |DE| ae WE 
V3 


Define the function V : fo ma > R by V (2) = — (a ~ 2V3x) x. Compute 
2/3 4 


V" (2) =F [(a-2VSe)' +20 (a-2V5e) (-2V3)] = F (a -2V5e) (a 632). 








a a / 
So, V'(%)=0 <— > 2%, = —= OR #2 = —~. [| | V 
2V/3 6V3 0 ty 
Since V’ (x) > 0 for all x €[0,%2) and V’ (zx) <0 for all x € («: al we conclude that V 
attains its strict global maximum at 72, and a 
J3 2 f3 a \ a a a 2a a® 
i) (aan, ) Bye Gyn) ee ae a 
V (2) | (a V3») a= \a Se: oA a (a =) = & (72)'- Ti 


2107, Contents www.EconsPhDTutor.com 


A704 (9740 N2013/II/3)(i) f (0) = In (1+ 2sin0) =In(1+0) =In1=0. 























Poa 2 cos x f"(x) =2 -sinzt ———2cos*x 
~—-142sinz’ 11 42sine (1+2sinz) 
fl" (x) =2 = COST 2 sin COs x n 4A cos x sin x ‘ A cos? x | © 
1+2sina (14+2sinz)? (14+2sinz)’ (1+2sinz)° 
So, f’(0) =2, f” (0) =2(0-2) =-4, and f” (0) = (-1+0+0+8) = 14. Altogether, 
0 / 0 i 0 1 0 7 
p(x) = 2) £0) pO) 2 Oo, os = O04 2a Qa’ + oa" +... % 
2 3 2 3 
(ii) et sine =[1+ a+ toa) +e. | [na wr) +e. | one tana?+ (SP) a4... 
Comparing coefficients, n = 2 and an = -2 (so, a=~-1). Hence, 
an Te (7 8 a 
—— -—- —— = —_- — = La = Deis 5 9 
& 5 )e-| g[=(1-g)?=-3 
: 1 
A705 (9740 N2012/I/2) (i) {oe- zin(1+a*) +c. 
co 
d 1d 
(ii) From u= 2”, — =2r or v2 a Now, 
1 1 * 1 -1 i 1 a1 9 
iw! v2 f 5 Sac? s la de 5 ten u+C => tan rt+C. @® 
(Note that = is given on List MF26, p. 4.) 
ne i. ae 4 
(iii) f () dx x 0.186 (calculator). a 














682For 2 € R, 1+.2*>0, so we don’t need the absolute value operator. 
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A706 (9740 N2012/I/4)(i) Observe that 2 ACB = 1/4-6. So, 


sin 2 AC'B = sin(7 -6| = sin (=) cose — sin @ cos (=) + (cos6 - sin@) ue 





A AB 
By the Law of Sines, — as — a Rearranging, |AC| equals 
JAB] 1 3m, 1 J2 1 
ea ed ee 02) ee ee ey 
sin 2 AC'B cae sin (71/4 - 0) or 4 a (cos@-—sin6) \/2/2. 2. cos@-sin@ 
(ii) By the small-angle approximations,°°? 
62 
cos 6 iS and sin 6 x 6. 


So, by the Maclaurin series expansion for (1+ x)", 
2 


: —— =1+(-1)(-5 -6)+ VE (_F 9) bere l tO + SO 


cos @ — sin@ 1-£-96 2 








The above holds for |-0° /2- 0| <1, which in turn holds since @ is “sufficiently small”. 
And since @ is “sufficiently small”, 6° » 0 for k > 0, so that 


} 
cos @ — sind 


Sie x 


w 140+ 50 fora=1, b= 


d 
A707 (9740 N2012/1/8)(i) Apply — to e-y=(a+y)" 
L 
d d d 4 
1-H 2(esy)(14 2) =2(e 4 y) 42(e + y) EY = 20 + 2y + (2x4 2y) S. 
dy 1-22 -2Qy 4 9 
1+ —= = 1+ = —__~____ (for 2x + 2y +1 # 0). 
Rearranging, Coie * Fs Oye cor r+2y+1+#0) 


(ii) Apply d/dz to + to get 


2 ae, 4 : 
fy 2g gt) tts ay" 
dz (24 +2y+1) dx (2x + 2y+1) dx dx 


d 
(iii) By definition, the turning point is a stationary point and so occurs where 7 =, 
ip 


But at any such point, — (140) =1<0. 


dz" 
So, by the Second Derivative Test,* it is a maximum. 


"di 





683In List MF26 (p. 2), read off the first few terms in the Maclaurin series expansions for sine and cosine. 
°84Proposition 11. 
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Z 
A708 (9740 N2012/I/10)(i) The model’s volume is k * 12h + ie 


2 
Rearranging, h 2 a an So, the model’s external surface area is 
‘fs 


2 Z 
A= nr? +2nrh + 2nr? = nr (2h + 3r) 2 ar [2(— = 5] + 3r| ee. 
we 3 r 3 


dA 2k 10 1 (10 
Compute — = -—~ + —ar=— (srr ~ 2K, Now look for any stationary points r: 
dr rs: cae 
dA 2k 10 10 3k \"8 
alex pe — oe or ge — 2k =0 or r-(=| . 


So, this is the only stationary point. 

The minimum value of r is, of course, 0. The maximum value ra, occurs where h = 0 (and 
can be expressed in terms of k using 2), = pe 
0 Ff tam 
Since dA/dr < 0 for all r € [0,7) and dA/dr > 0 for all r € (7,7max], A indeed attains its 
strict global minimum at 7. And the corresponding value of h is 





Sign diagram for dA/dr (assuming k > 0): 


h 





2k Q_ eae 4a 
7 3 


_— r= 
mr 3 57 51 


-(8)"E(8) -J-@)" 6-3-8)" 


(ii) Plug A = 180 and k = 200 into 2 to get 





180 3 ze au 2 Ter? or =m —180r +400 £0. 
Solve 4 using your calculator: r * -6.76 (reject), 3.04, or 3.72. SI 
200 2 
The two corresponding values of h are he a 3” # 4.88, 2.12. 
r 


Since r < h, we conclude (r,h) # (3.04, 4.88). 
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A709 (9740 N2012/I/11)(i) For 6 + 0, 2m, 


dy dy dr i siné _ 2sin § cos § cos § 





dx dé dO 1-cosé — 2sin? ~ sin 8 


The gradient of C at 0 =7 is 2 is = cot 5 =). 


= cot =. 
2 


d d 
As 0>0, 7 > oo; as 0 > 27, 7 —+-oo. That is, the tangents approach verticality. 
x ay 


(ii) You’re supposed to just copy from your calculator. But here as an exercise, we won’t: 


At the endpoints, (6,7, y) = (0,0,0) and (6, x,y) = Ay 
(271, 27,0), and the tangents are vertical. At 
the midpoint, (0,7,y) = (7,7,2) and the tan- 
gent is horizontal. 

Between the endpoints, dy/dzx = cot (0/2) which 
is positive for 6 < 7 and negative for @ > 71. 


(0,2,y) 
= (71, 2) 








a 
Moreover, d?y/da? = —cosec? (6/2) which is al- > 
ways negative (if defined). So, the gradient is (0,2,y) (0,4,y) 

decreasing. % = (0,0,0) = (20,27, 0) 
(iii) We have y = 1-cos@ and dz/d6 2 1-cos9. So, the area is ® 


rid v=2Tt A dé O=27 — 
dg= : oF 46 
: oo - dey Pde Pe pp: a0 


as. fee 2 an 
2 f (1 - cos 6) ao= [ 1 + cos? 0 —2cos6 dé 


27 


2m 2 1 1 
-f 1+ REE ~ 2080 a0 = [59 + = sin 20 —2sind| = 371. 
0 


2 





0 


d 1 
(iv) The normal has gradient -1 + ak i and hence equation 
dz sin p 
cosp—1 ; 
(= = ~(p- 
POOR) =~ og een) | 


For the x-intercept, plug in y = 0 to get 





Ce ae ae (p-sinp)] or sinp=x-—p+sinp or =p, 


sin p 
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d a 
A710 (9740 N2012/I1/1)(a) 2 = f “Sax - [ 16 — 9x? dx = 16x — 323 + Ch. ® 
L x 


d 3 
y= f Pan= f 16x - 303 + Cy de = 80? - S04 + Cyr + Cp, ® 


(b) For 16 - 9u? # 0 or u# 4/3, we have dt/du = 1/(16 - 9u?) and so 


1 1 


: 7 1 q 1 [ 1 q 4A+3u 
= —— du=- ——.——_ du = — ————- |In 
16 - 9u? 9 (4/3)° ~ 2 92- (4/3) 


+ 
4-—3u 


4+3u 
4—3u 








1 
C=—lIn 


ary +C.® 














Plug in the initial condition (t,u) = (0,1) to find that C = -(In7) /24. Hence, 














; 1 (i 4+3u 1 7) Lj 4+3u f we 
=—|1In —- in = — In| =F or U mee 
24 \"|4—3u 24° |7(4—3u))’ 3 
dy dy dx 2 1 
A711 (9740 N2011/1/3)(i) Compute 4 = 2 + = -4 + (2) = -=. 


dx dt dt # 
2 I 9 oo 3 
So the requested tangent is y - — = a (x —D ) Or ys=— ae 
Pp Pp p Pp 
(ii) If y =0, then 0 = -a/p* + 3/p or x = 3p’, so Q = (3p”, 0). 
ie] 0, then y= 3/9, 50, .R =(0;4/p), 
(iii) The mid-point is (1.5p’, 1.5/p) and has equation xy? = 1.5p? (1.5/p)° = 1.5%. 


A712 (9740 N2011/1/4)(i) Plug in the Maclaurin expansion for cosine: 
6 
cos’ x =| 1 cs. =1+6( ee )+—>( ae a ) + 
= x og | = 5 ag mI 5 ag oe 
2,14 xt 2 4 
=1-32 ri +15 i +e--=]—3x°+47*+... 2% 


a a 4 4 
(ii)(a) [ cos’ cde + f 1-30? +4044... de=[e-a%+ 05 +...| =a- a" + ea? +... 
0 0 


m/4 3 4 5 
So, ifa= ig then f cos® x da a (=) + (=) x 0.540. 
4 0 4 4 5\4 





m/4 
(b) By our calculator, f[ cos® x dx * 0.475. ® 


Using the first few terms of the Maclaurin series as an approximation would work well if 
m/4 were near 0. But it isn’t and so this approximation doesn’t work well. 
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A713 (9740 N2011/I/5)(i) Where x <0, y = f (|x|) and f are reflections of each other 
in the y-axis. Where x > 0, they are identical. 


A A 
Y Y 


y= fies 


(0, 2) 








2=%| tore < 2,%, 
2-a| forz>0, “ y=|f(x)|= ™ 

y= f (\2|) = . %=2) for xe>2. 

2+2| forxz<0. 








Where f(x) < 0, y = |f(%)| and f are reflections of each other in the z-axis. Where 
f (x) = 0, they are identical. 


(ii) x ¢ [0,2]. 


1 0 1 a ey 1 
(iit) f f (jel dx= f 2+ede+ fi 2-adz =|20+ =| + [20-5] 13 
-1 -1 0 2 44 2 Jo 


For any a > 2, we have 


a 2 a v2 72 
dx = f 2-x)d 1. -2) dx =|20- =| 5 -22| 
f le @lae - i) z+ J (a jida: x a |, +s an 
1 2 2 
=(4-2-245)+(S-20-244)2 2042. 
2 2 2 2 
1 2 ar 5 
Set = and = to be equal: ade ie or a? - 4a -1=0. 
Ay (Ay S=4A(1) (0 
By the quadratic formula, Pec tie ae re 


2(1) 
We can discard a = 2-V/5<2. So, a=24+V5. 
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A714 (9740 N2011/1/8)(i) For 100 - v? #0 or v # 10, 









































10+ 
= ar +C4. 
noes oe ° 
dt 1 i 
ee 2 = = 
(ii)(a) For 100-v* +0 or v # 10,  Gopaee 1059-32" 
i) 1, |10 10 
So, t= f Sav= f 10 dy 2 =n | TY) 4 Cy or 2(t- Cy) = In ie = 
io v2 2 |1l0=% 
or, rae = eX(t-C2) — 9-2C2Q% St 10+v _ en 2C2e%t = Oye? 
10-v 10-v 
10+ 20 20 
=<] = C3e"" =10= 2 
on 1) = “T0-0 a C3074 +1 


Plug in the given initial condition (t,v) = (0,0) to get 0 = 10- 20/ (C3 +1) or C3 =1. 


20 
Hence, v(t) = 10- are (t > 0) and 








2 2 1 
y=5 > 10-5 a 5a ot +1=4 — > eM =3 — > t= =1n3. 
e7f +] e4f +] 2 
= 


(ii) (b) v (1) = 10- 





(ii) (c) lim v (= 10, 


A715 (9740 N2011/II/2)(i) The box has length 2(n-), breadth n-2z, height x, and 
hence volume V =2(n-2)(n-22)2=2 (n? — 3nz + Day? } a = 2n*2 -6nz? + 42°. 


(ii) Compute dV /dx = 2n? -12nx+ 12x”. Any stationary points % are given by 


dV 


aa 0 or Qn? —12nz + 1277 = 0 or n? —6nz + 62%? = 0. 
DU \x=2x 





By the quadratic formula, 


6n + + \/(-6n)° -—4-6n? _ 8n+ V9n? - 6n2 1 J3 (14S )n 


a 2-6 6 =o geo 





We discard the larger value of x because it implies negative breadth: 


bie pee oe 
2 3 | 
3 
Thus, the only stationary value of V is x = ( - a - 
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A716 (9740 N2011/II/4)(a)(i) Use Integration by Parts twice: 
° 2-22 2 = -22 = -2¢ 7 1 2-22 2a 
ff ate dx =|2?(—)e%*- f 22(—)c dz| =|-=2“e + f we da 
0 2 Z 0 2 0 
1 —] —] , 1 1 i 
= |-507 +2 (=) ote [ (=) ee de] = |-507" - ne - | 











ik fe ib 1 
= |-ze™ (20? +22 +1)| =--e°" (2n? +2n+1)+-. ® 
4 0 4 4 
(a) (ii) fo ee dz = lim |-ze (2n? +2n+1)+ | ae 
0 noo A 4 A: 
1 dt a... .¥ 
(b) From x = tan 6, we have 79 7 8° 6. The volume is 
1 i ae 2 1 4 2 1] 2 
nf ydx=n f (=) dx+n f Cant dra f Bey 
0 o \a?+1 0 (tan? +1) 0 (sec? @) 
2 z=1 16 tan? 6 do d-n/4 16 tan? 0 mya 
2 awed Yo ——"_" d@ = 16 [ "ade 
xf sec? 6 me Op, sec? 6 i | ia 
m/4 il m/4 
-8n f 1 - c0s20.d0 = 8r[1— > sin 28] =2n(1—2). ® 
0 0 
A717 (9740 N2010/1/2) (i) e” = Leos sa... and 1+sin2x=1+ 27+... 
: il 2 5 2 
So, e (1+ sin2x) = (140450 +... )(42n4...)=1430450 +... t 
4x\" 4 -1) (4r\’ 4 -1 
(ii) (1+=] =1enae 4 aD (2) peed Be ODay so 
4 -l 4 4 
Ss. opis] Ang , 2) = J 
a 4 9 9 2 


d 
A718 (9740 N2010/1/4)(i) Apply an to 2? -y? +2xy+4 +0 to get 
£ 


dy dy dy 2 dy x+y 
2x — 2y— + 2y + 2x— =0 2(c£-y)—=-2 — = —_. 
w-2yT + 2y + 2aT —> (x ¥) 5. (c+y) <=> co 
(ii) Tangent parallel to x-axis => dy/dx = 0 —? y 2-2. 


Plug = into = to get a — a? - 20? +4=0 or v? =2 or r=4V2. 


So, the two points are (+ 2, +2), 





Colt ge = y, then the given equation becomes 2x? +4 = 0 which has no real solutions. Hence, x # y and at 
2 
<> , it’s OK to divide by y - =. 
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A719 (9740 N2010/1/6)(i) 6 » 0.347, y ¥ 1.532 (calculator). x 
y 4 3 
(ii) [ a —3x+1dr -|F- 5742] ~ 0.781. ® 
8 42 8 
(iii) The curve and line intersect at 2 =—/3. So, the shaded area is 


0 ak 9 3. 9 
3_39d -|5-5 1 er ee 
Lig? Oe ge A = 


(iv) At the turning points +1, the corresponding values of y are 








Gy =3G)sies amd PHsi4i=<1, 


The equation x® — 32 +1 =k has three distinct real roots if and only if k is strictly between 


the above two values. That is, k € (-1,3). 


A720 (9740 N2010/I/7)(i) The differential equation is d0/dt = k (20 - 0). 
Plug the initial condition (t, 0,6") 2 (0,10, 1) into = to get 1=k (20-10) or k=0.1. 


dé dt 10 
So Gy = 0.1 (20 - 6) If 6 # 20, then — Frey ee Hence, 
——— d= - 10 f sy ae=-101 6-20] +C, 
rhe Iw 20-0" n|@-20|+ Ci 
or, 0.1(C, -—t) = In|@ - 20] or \0 — 20] = cOUCr-A) = 60-101 g-O.1t 
or, 6 — 20 = te0-1Cig-0-1t _ Coe Ot on 6 3 20) + Coe Olt, 


Plug 2 into 2 to get 10 = 20+ C.0 or Cy = -10. Hence, 6 = 20- 10e°". 





(ii) 0=15 — > 15=20-10e°" A 4 
p=an e70-1t =(5 
20 pesesressceesseseeseseesescessecesseceeseccessssensnsenseseaneecenseceasaes 
<> -0.1¢=1n0.5 
=> f= 10ln2. 0 (t) 
10 


_} —1Me-0-1t) — 
lim 6 (t)= lim (20 -10e~°") = 20. 


t 
re 
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A721 (9740 N2010/I/9)(i) The box has volume 3x7y = 300. Rearranging, 


The lid’s external surface area is 
2(kxy + 3xky) +322 = 8kry + 322 = poe + 327, 
x 


The box’s is 2 (xy + 3xy) + 3a? = Sry +322 = a ar 
r 


800 
So, the total external surface area is A=—(1+k) +62’. 
z 
dA 800 ce 
Compute an ee (1+k)+12¢ = 2 [122 — 800 (1+ kK): 


So, any stationary points 7 are given by 


dA 
ay 


=() or 127° — 800(1+k) =0 or r= [Maem] 
- als , 





L=2 


So, there is only one stationary point 2. 


Since dA/dx < 0 for all x € (0,%) and dA/dz > 0 for all x € (%, 0), we conclude that A 
attains a strict global minimum at 7. 








y 1 LOO 3 
(ii) Let y be the value of y that corresponds to z. Then - ‘ oe (1th) 
1 3 3.3 
iii) ke (0,1 1+ke(1,2 1 = EE 
(iii) ke (0,1) <= > 1+ <(L2) —= ayes) saan ‘lF5) 
iy) ey = L = f=— 
“a 2(1+k) 7 
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A722 (9740 N2010/I/11)(i) For t 0,1, 


dy _ dy dx _1+t? +1 
dx dt ' dt 1-t? #-1° 





So, provided p # 0,1, the tangent at point P has equation 


a a) RR ee ca 








6 2 2_4)2 2 
oe Pa me (ee a) a ee 
D D D 


(ii) Plug y = x into + to get (p? + 1)x - (2 — 1)x =Ap or 7 = 2p. 50, A= (2p, 2p). 
. 1 9 D) 2 2 2 
Plug y = -x into = to get (p +1)x+(p -1)x=4p or r=-—. So, B=[-,--}. 
Pp P Pp 
The lines y = x and y = —2 are perpendicular and pass through the origin O. So, OA and 


OB are perpendicular. Hence, the area of AO AB is 
1 1 | 8 
sloalion| = 5 }V en) + 2p)” - Vary | G4 


be 











which is independent of p. 


2 
(iii) Observe that 7+ y = 2t and x-y= - So, 27-y? =(x+y)(x-y) =4. 
This is an east-west hyperbola with «x-intercepts (+2,0) and asymptotes y = £2. 
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e  2+2)+e sr+4 
o/xr+2 oS +2 O/n+2. 


Any stationary points Z are given by y’(Z) = 0 or 3%+4 =0 or & = -4/3. So, this is the 
only stationary point (and hence also only possible turning point). 








A723 (9740 N2010/II/3)(i) 2 fave 


Since y’ (x) <0 for x < z and y'(x) >0 for x >@, Z is a strict local minimum. 


Since Z is both a stationary point and a strict extremum, it is a turning point.°*” 


(ii)(a) y2=22(x+2) — > yssrVr+2. 


3x +4 4 
an So, y’ (0) = +—= = v2. 
ci) 22 





From (i), ov =a! (xr) = 3 


(ii) (b) A (iii) A 


Vertical asymptote 
Re=2 









































| i Dee1~ 17, 3 2\ 1 
A724 (9740 N2009/I/2). [ d -| | i (1 | )- In3 
( Le Jy ogee 5 eal, ay 1. OP a ® 
2p 1 pl/2p 1 1f. 42 HP 4 iloa 
[ ———ax-=- f —— (7 = — sin cand =-—sln -=—. ® 
0 V1 - px? p Jo V/1/p? - x? D 1/P J, D 2 6p 
i 1 Att 27 
S hee epoca a 
7 4°" 6p >> 63 3in3 





686See Proposition 9 (First Derivative Test for Extrema). 
®87See Definition ??. 
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A725 (9740 N2009/1/4)(i) f (27) +f (45) = f (3)+f (1) = (2-3-1)+(7- 1?) =54+6= 11. 


(ii) (-4,7) ty (4,7) (8,7) 
(0, 7) 





(= 6) 


(-6,3) (-2,3) (2,3) (6,3) (1053) 








(iii) [ forar= [tears [pears [p@ar+ ft @ ax 
=f tayans [ r(ayars [ p() dee fo F(o) ax 
=2f- f(r)de+ [f(y det F(x) de 


2 4 3 
ao 7T-da+ f 2e-lde+ ff ede 
0 2 2 


Dale 2 2 
= 2| 70 - 50° + [22-2], + [22-2], = 225 +12-24+6-2 = 362. 
3° Jo : : 3 3 


A726 (9740 N2009/1/7)(i) f’ (x) = e°* (-sinz), f" (x) =e (-sin x)’ +e°* (—cosz). 
So, f (0) = e°° =e! =e, , f’(0) =e" (-sin0) = 0, and f” (0) =0-e!-1=~-e. Hence, 











0 ‘(O Pag - 
gone 2 AO) EC ee Oh Jap zet Ont Sate z0- Sa... at 
i 2b 8b2.x? 
ii) Let g(x) = ——_.. Then g’ (x) = -————,, and g” (x) =- 
(ii) oo a+ bax? oo (a +bx2) ) (a+bx?)? (a+ bx?)° 


So, g (0) = 1/a, g’ (0) = 0, and g” (0) = -2b/a?. 
Hence, f (0) = g(0) or a=1/e; and f” (0) = g” (0) or -e = —2b/a? = —2be? or b = 1/2e. 
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A727 (9740 N2009/I/11)(i) A % 














f 
(ii) 7’ (2) = e® +2e (-22) =e" (1 - 2x”). 
7 @)=0 <> eee : So, the two stationary points are (. : a 
= =+—. So, wi = ah 
v2 ° v2 V2 
Each is also a turning point because f’ changes sign at each point: 
| | 
i} 1 
v2 V2 
(iii) From u + x”, we have du/dzx 2 2x. Now, ® 


i nee: 1 ae 21 vn du 1 usn® _ 1 Jvage 2 
“det f "de? f ut -5f ide (oe =_ (tae), 
from dat fp cotda2s fo evizae=5 f, etdu=5l-ewl =5(1-e™") 








a 1 1 

Hence, the requested area is lim [ ze dx = lim = (1 - e™) =-, 

n->oco Ji NL Co 2 2 

2 2 2 2 il 2 
(iv) By symmetry, [ [ren aeS2 i. ne dx=2: 5 (1 -e” ) ee ee & 
23 0 
1 1 2\2 
(v) nf y’ dx = mf (xe ) dx x 0.363 (calculator). ® 
0 0 
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A728 (9740 N2009/II/1)(i) Observations: 
1. v=t?+4t=t(t+4). So, x is minimised at t = -2 and is strictly increasing for ¢ € [-2, 1]. 
2. y' (t) = 3t? + 2t=t(3t+ 2). So, y has turning points at t = -2/3,0 (or 2 = -20/9,0). 














(x,y) = (-4, -4) 


” dy dy da 3t?+2t dy 3:27+2-2 16 
Gomme Se 6g |, eee oy: A 
OD Sone ae a a ae Oma, ean 





y—(22+2?) =2[2-(2?+4-2)] or = y-12=22-24 0 or Sy 2-12. 


(iii) Plug x = t?+4t and y =t?+?? into 4 to get +47 =2 (? + At) ~12 or #8 -#-8¢+1220. 
We already know that t = 2 solves 2 because P is an intersection point. So, write 
t?-#?-8t+12=(t-2) (at? +bt+c). 
Comparing coefficients, a = 1, -2a+ b=-1 (so, b=1), and -2c = 12 (so, c= -6). Hence, 
t?-t?-8t+12 = (t-2)(t?+t-6) = (t-2) (t-2) (t+3). 
So, the only other intersection point is at t = -3 and 


Q = ((-3)? + 4(-3) , (-3)? + (-3)?) = (-3, -18). 
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d a 
A729 (9740 N2009/II/4) (i) = - <r dt = ‘i 10 - 6tdt = 10t-3#2+C. 
n= SH at= f ror are Odi= hr =1 2 CL 4D. 


The remainder of the answer for (i) is no longer in the current 9758 syllabus. 

Plug the initial condition (t,n) = (0,100) into = to get 100 = D. 

So, the family of curves is n = 5t? - t? + Ct+ 100. 

Sketched below are three members of this family, corresponding to C = 0, C= 1, and C = 2: 


A 


Y 


100 





C=s 







Three members of the family 
w= 5 =F YCt 100: 














1 
(ii) For 3- 0.02n + 0 or n + 150, we have ae eae and 
dn 3-0.02n 150-n 
dt 50 
t= [ —dn= in=-50 f — n= -50In|n - 150| + £, 
dn” J 150-n “750° ee 


or, In|n - 150] = (E -t) /50 or |n — 150| = ef? or n — 150 = 2 /eH/50 = Fre t/50, 
Plug in the given initial condition (t,n) = (0,100) to get -50 = F’. Hence, n = 150-50e!/°. 
As t > o, n > 150. So, the population will approach 150 thousand. 


2 
A730 (9740 N2008/I/1). The dotted area is - Jy dy = E i] = 3 (8-4). 


2 a ae 7 
The shaded area is [ ax? dx = 52" =3 (8-1) = = These two areas are equal: 
is 1 
2 sex: a on 
— — = _— = = ae x 2. . 
3 (8 a ) 3 = 8-a 5 —> a=|5 73 
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3 3 
<7 de = 5In(2? +1) +0. 


(ii) Plug in the initial condition (x,y) = (0,2) to get 2=C. So, y=1.5In(x? +1) +2. 
(iii) As x > +00, dy/dz > 0. 
(iv) 


A731 (9740 N2008/1/4)(i) y = [ oP a = i. 





Three members of the family 


y=Sin(o?+1)+C, 








A732 (9740 N2008/1/5) (i) 


V3 1 pilv3 1 i v3 1 m 
———d -= f —_—.—— dr = = [3tan13 = - tan! V3 =—. 
ree =5 J, (3422 a 9 © 


(ii) Use Integration by Parts: 


e 1 . a | 1 1 i 
f oe” inde = | In | - f —_y7"!_ dr = geet =.) = gel 
1 n+1 1 1 n+l a n+1 


(n+1) 
1 (gnt_ymet) Dem el emtt nett +1 


(ie i) Cie : 





1 n+1 _ 
n+l (n+1)° 











88Since x? +1>0, we don’t need the absolute value operator. 
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A733 (9740 N2008/I/6)(a) By the Law of Cosines,°°? 
AC? = AB? + BC? -2(AB) (BC) cos6 = 1+9-6cos6 = 10-6 cos8. 
So, AC = V'10 - 6cos 6.5 Plug in the small angle approximation cos @ » 1— 67/2 to get 
AC » 10-6(1- 5) = V4 + 362. 


And now, by the first “standard” Maclaurin expansion, 


1/2 
VI+3R = 2(1 +70") =2|1+ (5 0). |= 2+ 70 +.. % 
- 


(b) f(x) = 2sec? (20+ =| and f” (x) = 8sec? (20+ *) tan(2x +7). 
f (0) = tan 7 =1, f' (0) = 2aec? 7 = 2-24, and f"" (0) = 8sec? F tan 7 = 8-2-1= 16. 


162" : 
Hence, f(x) =1+4r+ +--=14+404+8r"4+... % 





A734 (9740 N2008/1/7). The straight parts have length «+ 2y. The semicircular part 
has length 7tr/2. So, the total time to build the wall is 


3 (a + 2y) + on = 180. 


Rearranging, y 2 30-- (2 + 37) 7 
Next, the total area is 


TT 
Az=xzy+— 


2 2 2 2 
5 (=) + 30x -(2+3n) +0 = 30x - (4+ 5m) —. 


2 
This is a quadratic polynomial with negative coefficient on x”. Hence, A is maximised at 


a eee Cae 7 
~ 8a -2(445n) 22/8 4450 7 


From é the corresponding value of y is 


y =30- ~ 12.6. 














2+37 120 _ ( - 7S) - 0S - +7 


s 602 
4 4+57 4+57 4+57 4+ 57 





®89Proposition 7(c). 
6°\We can discard the negative square root since length must be non-negative. 
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A735 (9740 N2008/II/1) (i), (iii) > A 







(ii) fsine =(1¢2+5+...)(e-...) 


7 
=B+E ++... aS 


(iv) If (x) - 9 (2) = 0.5 


a 
e* sin x — (2 +2? r =| =0.5 


—>> 








— x» —1.96 or x » 1.56. ae 


If (v7) -g(2)] <0.5 — > -1.96 <2 < 1.56. 








A736 (9740 N2008/II/2)(i) The upper half of the curve C’ has equation y = VzV1— 2. 


1 
And so, by symmetry, R = 2 [ rV1-adz x 0.999. ® 
0 
(ii) From u 21-2, du/dx 2-1. And the volume is ® 


1 1 1 v=1 
x yde=n f wJi-ade2n f (1-u) Jude Sn f CY Tee 
r= a6 


dé 
iv 2 ak > 25. A 
= mf uP — MP du = | 2u8 — 28] -n(=-=} = on 
(iii) Apply d/dz to £ to get 


dy _ 1 (vizz- 1 \- 1 a 2-32 
dt 9/ 2/1 —a 2V/1-2} Afl-aVaeV1-2 4/1 -aVnJ/1—2 


So, dy/ de| _=0 <> £=2/3. This is indeed the global maximum point because dy/dz > 0 
for z € (0,£) and dy/dx <0 for x € (, 1). 
A737 (9233 N2008/I/2). For small x, cos 2x” x 1- (2r)* /2! = 1— 2x? and 














1 / =| 1 
=(1+2? a 1 (= )o?=1- 52° 
a oo a 
cos 2x | Es) 
BO, wx (1-22? (1-50) » L277 af 
V1 +2? ( ) 2 “2 
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A738 (9233 N2008/1I/3). Use Integration by Parts: 


1 1 1 1 1 1 i 1 1 3 
-22 = —22 —22 = -2 —22 = -2 
Fi “Le dz =—-5 [we hts e dz = se "7 [e | = 1 mn . 


A739 (9233 N2008/I/4). From t =Inz, we have dt/dx = 1/2. So, 


3 


e i 12 re 1 dt t=3 J 1 2 
ee 2 f zat=|-s| a2. 
I xv (In xv) i xr=e Rag = fi. : 


A740 (9233 N2008/1/6) (i) 

















for x >a, 
for © <a. 
Z 
> 
Se 1 2 
(ii) This is simply the area of two triangles: 5 la ee) ees 5 (b- a) =a +. 
A741 (9233 N2008/1/8) [- a 5 ay re ® 
, x an’ —| =—-—tan™~ -. 
4+? 2 2le 4 2 Z 
V3/2 32a na Tt 
= | ail es ere 
te sz de = [sin tlh aia te & 
If the two expressions are equal, then 
1 7 
qo ptan toss —= = = tan” 5 —=> a= 2tan 7 = = 5V3. 
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d d 
A742 (9233 N2008/1/10)(i) From ys xz, we have 7 24 oe. 
x x 


dy 3 9 


Plug + and 2 into ry, = 27+ y? to get 


vz (2+e) =a? +022 or 0=2 (2422 -2-1)-2 (22S -1). 
ade dx 


d 
So, if x +0, then ae =1. (If « = 0, then 3 becomes y=O) 
x 


dz 1 
(ii) Assume x #0. Rearrange £ to get 2 =—. And now, 
a a 


[ ae = f <a or f[ edz= f ae, 

£ z 
or, z* =2In|z|+C;, or z2 4/2 In|2|+C. 
Plug 2 into = to get y= +42\/2In ja|+C. 


Plug in the initial condition (7, y) = (2,6) to get 6=+2V21In2+C. Since 6 > 0, it must be 
that y = 7/2In|z|+ C. And now from 6 = 2V21n2+C, we also have C = 9 - 21n2. 


0, for = Q), 


xv/2|n |r| + 9- 21n2, for x + 0. 


Altogether, Y= 


= 24(—oj 
A743 (9233 N2008/1/13) (i) << eo ao’ ay 
dy dt = dt 3sin* t cost sint 


cost 

So, the normal is y - sin? t = ad (x - cos’ t) or xcost—ysint Z cos*t — sin‘ t. 
sin 

(ii) cos*t - sin*t = (cos? t + sin” ¢) (cos? t - sin? t) = (1) (cos 2t) 2 cos 2t. 

(iii) Plug 2 into + to get xcost—ysint 3 cos 2t. 


If y =0, then xcost = cos 2t or x = cos 2t/ cost. So, A = (cos 2t/ cost, 0) 
If x =0, then -ysint = cos 2t or y = — cos 2t/sint. So, B = (0,-cos 2t/sint). 


cos 2t\? cos 2t\? 1 1 sin? t + cos? t 
|AB| = ( +(- : ) = cos 2ty / —- + = cos 2ty | ———— 
cost sint cos2t sin? t sin’ t cos? t 
cos 2t 


1 
= 2t ————_- = ————__ = 2cot 2U. So, = 2. 
a sintcost 0.5sin 2t i (So, ) 








(Note that since ¢t € (0,71/4), sint, cost, cos 2t € (0, 1).) 
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A744 (9233 N2008/1I/14)(i) Let P (k) be this proposition: 
L=(a1)a she 
(1-2)? 
i (e sie tate (ead): 4 
(1-2) (=a) (=z) 
We next show that for all 7 ¢ N, if P(7) is true, then P (7 + 1) is also true: 





14274327 to pke = 





We show that P (1) is true: 


-PY) 1-(j +1) 2 + jas} 








14204347 +---4+ jrIt+(j4+1) 2! (i-2)2 + (j+1) 2 
— 14 (+a [(1-2)?-1)4 jai! 14 (9 +1) 2d (x? - 2x) + jai"! 
- (l-a)* - (1-2)? 
1-(j +2)adtt + (j +1) a3? 
= x 
(laa)? 


d 
x 











1- nm = n — nm 
= © (era? 448 +42" 4C) = = (« = +C)=- = ve [na Sa Wiese | 








ole dz \" 1-2 l-«2 aa ae. 
__ (1 - 2") (loa) -na" (1-2) + £3") © L-(n+l)ar+nart 
(1-2)° Gaz 


4+6 4-6 
A745 (9233 N2008/II/1). cos 4a -—cos6x = -2sin (2) sin (=) 


= -2sin 5x sin (-x) = 2sin5xsinz. 

















n/3 1 73 iTand 7/3 
f[ sin dz sing dr = 5 f[ cos 4x — cos 6x dx = 5 EE a2 |, = ae @ 
A746 (9233 N2008/II/5)(i) Define f : (-1,00) > 0 by f(x) =In(1+2) - =. 
He 
1 (w+2)-2-2¢-1 (x+2)’-4(1+2) x? 
So, f (x) = > 2 = 5 = a 
1l+z (x +2) (1+2) (x +2) (1+2)(r+2) 


Indeed, f’ (x) > 0 for all x ¢ Domain f = (-1, 00).°9! 
(ii) Observe that f (0) =0. Since f’(x) 20 for all x ¢ Domain f = (-1, co), for all x > 0, 





ou 25: 
f («) =f (0) or In(1 +2) - —~ 20 or In(1+2) > 
GZ G2 
61T¢ turns out that more generally, the domain of the natural logarithm function In is C \ {0}, that is, 
the set of all complex numbers excluding 0. In which case, it is perfectly possible that 1+a <0 and the 
conclusion to which this question leads is false. 
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A747 (9740 N2007/1I/4)(i) For 2-3/ #0 or I # 2/3, we have dt/dI = 4/(2-3I). So, 


dt 4 4 

—al= f ar=-4 f —ar- -* in |3I - 2+ Cy, 

dI ay 37 =9 pera ee 
or, In |3I - 2| = 3(C, -t) /4 or 3 - 2| = e(Or-t)/4 = Q3Cr/4Q-34/4 
or, Sf Deseo =C56" or T=Cye"" # 2/8. 


Plug in (t, I) = (0,2) to get 2 = C3 + 2/3 or C3 = 4/3. So, [= 2(2e°4 +1) /3. 


(ii) Jim T= 2/3. 


A748 (9740 N2007/1/11)(i) > ya 
When t = 71/2, 


(x,y) = (0,1). 






(i 1) < a _dy , dx _ 3 sin? tcost _ 3 
dt dt ~ 2cost (—sint) - 
So, the eat is 





sin t. 


3 
y —sin® 6 = 5 sin €: — cos? 0). 
x = cos’ t, y= sin? t, 


If y = 0, then —sin? 92S sin 8 (19 - cos? 0), for 0 <t< 1/2. 








2 1 
rq = 3 sin 7 + cos” 6 = = (2sin’ 0 + 3cos* 0). : 
= = 13 2 » 
If « = 0, then yr —sin®@ = 5 sind (- cos 0), Whent=0: 
il ay) =(1,0). 
snsabbe Lett =haninabaciate (2,9) = (1,0) 
So, the area of AOQR is 
i Th 5, a a4) 12 2 2 i a 29)? 
~rQyr = ~~ (2sin” 6 + 3cos”) — sin @ (2sin? 6 + 3cos?@) = — (2sin? 6 + 3cos’6) . 
2 23 2 12 
z=1 dr dt t=0 qd 
(iii) The area is f ydz = i, 7 yo ra = ya 
= f sin? t-2cost(—sint) dt = 2f™ cost sin* t dt. 
7/2 
From u = sint, we have du/dt 2 cost and 
m/2 4 12 t=n/2 au y) 
2 [ costsin* tdt = 2 df = 2 u‘ du = 2| =u | = ® 
0 t=0 Z 
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A749 (9740 N2007/II/3)(i) Let f(x) =(1+ 2)". 
f'(v)=n(1+2)"", f" (2) =n(n-1)(1+2)"”, and f” (x) =n(n-1)(n-2)(142)"”. 
f (0) =1, f’ (0) =n, f” (0) =n(n-1), and f” (0) =n(n-1) (n- 2). So, 


Gea =a me 8 pi Eo 








2! 3! 
3 
(ii) (1+ 2a”)? =141.5(207)+---=14307+... a 
(4-2)#=8(1-72)! 
31 2 31-1 3 
= 1+5(-3x)+ 22 (-5") + 222 (-5°) $n 26 8ne le ee, a 
2\ 4 2!\ 4 2 A 16 128 
So (4-0) } (14202)? = (8-304 Fu? + os )(. +32? ) 
, < 58 a or 
3 1 3 127 
= 8- 30+ Tea’ + Toga" + 24a" — Oa" +--- = 8- 3a + 24-a Bowl +... ra 
(iii) The expansions are valid for |-7/4| <1 AND |22?| <ll or |x| < 1/V2. 
A750 (9740 N2007/II/4) (i) 
5/3 5m/3 1 — | ji 5n/3 5 3 
f[ sin?arde = a ae = |e Fain 2c] a @) 


pa 5/3 5 3\ 5 3 
f[ cot ade =f L-sintaede = fe]gn® - (4) - @) 


(ii)(a) Use Integration by Parts twice: 


m/2 n/2 m/2 m/2 
R= [ x’ sin x dx = a (—cosx)], -f 2x (-cos x) dx=0+2 f x cos x dx 
0 0 0 


= 2[esine - [ sinwae] 3? = 2[xsinz + cosx] a? = 2(2 +0 == 1) =71-2. ® 
m/2 
(ii) (b) mf (2? sin r) dx » 5.391 (calculator). ® 


A751 (9233 N2007/1/2). (4+ 3x)? = 42 (1+ 32/4)?. 
The first negative coefficient is the 4th Maclaurin coefficient: % 





53°3°3°(-3) (3) =25. BP (8) 8 hee on 


4 a) \4) 12 256 4256-1024" 
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A752 (9233 N2007/1/3). V ~~ 2d oe d ® 
( jane): =n f : vem f 14422" 


v3/2 1 
——,——- dr = 
2 (1/2) +2? 4 


it =i stents {oO Tw 2 
= (tan! V3- tan 1)=5( )- 6o1e 


A753 (9233 vayon() From t= sinu, u2sin7!t and du/dt 2 1/cosu. Now, 











| 
f (sin™' ¢) cos| (sin!) t) pW? _ucosu _ ucos u? re: f ucos u? a4 
V1-#? V1-sin? - Jcos u| coved 
du 1 2 
ie eaten +C. ® 


(At =, we can remove the absolute value operator because cos u € (0, 1).) 


1 (sin t) cos (sin t) | il ra | \2 1 7 
(ii) f[ at = | 5 sin (sin) | -5]sm(3) -sino?] = sin =. ® 


1-# 0 1 


A754 (9233 N2007/I/10)(i) For x € [0,27], cosx = sing <> ax =7/4,57/4. From a 
sketch, we see that cosx > sinx on the left of the first intersection point and the right of 
the second—i.e., x € [0, 71/4) U (57/4, 271]. * 


Qn Es an Qn 
oe . 4 . 4 . . 
(ii) [ |cos x - sin z| de = [ cose sine da + fi sina ~coseda+ [, cos x — sinxdx 
0 0 a lus 
4 


4 


= [sin x + cos rae +[-cosa - sin a + [sin x + cos lee 


V2 V2 VJ2 2. 9/2 4/2 V2 9/2 
= 0-1 ere Eon er nav ® 


5a +4 A _ Bare 

(x-5)(a2+4) 2-5 2 +4 
Az? +4A+ Ba?-5Bx+Cx-5C (A+ B)a?+(C-5B)r+4A-5C 
(x — 5) (a? +4) 7 (x — 5) (a? + 4) 


A755 (9233 N2007/I/11). 





Comparing coefficients, A+ B 2 0,C-5B 2 5, 4A -5C' 34. 


Solving, we have C =0, B=-1, and A=1. So, % 
7 5x +4 | & 1 : 
we | -f —— - ——d -(1 ~5|-=In(2? ‘)| 
(x — 5) (a? +4) Yi e-5 g4+4~ Bel pints * )I, 


J5 1 
=In—-=-l1n8. @) 
4../20 8 
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1 1 
ae ee ne 






































d 1 d? 
A756 (9233 N2007/1/13)(i) 7 = ; seca tana = tang. 3 = sec’ x. 
d3 d?y d 
sag = 2s0c” x tan xv = 2 
d‘ d+ 
(ii) 4 = 2(2sec” x tan? x + sec! zr). So, sal, =2(0+1)=2, 
(iii) y= 0 Foleo 72? agaleag = 1 and qoal,_. 70: So, 
1 2 dl | 
In (secr) = 0+ 024+ —27+04+ Sat+---= <9? + —a4+.... at 
2! A! 2 12 
Gi) 1 (n\* vw mt 
Ind =21 al — —|— —.. 
(analy? n (see F) "= 25 (7) +3 (9) |- 16 1536 
A757 (9233 N2007/1/14)(i) Apply d/dz to x? -y? = Az to get 
dy d = d og — ew 2 42 
2x - dy = A or 2¢- ee y" or = oe (for x, y #0). 
dg dx ve da 2y ey 
(ii) From y 2 vx, dy/dx 2 xdu/dx +v. Now, 
dvsdy ary ye Ue --( 2u : )--2*8 
dx dx wtt+y aw 4u2ax2? 1 + 0? Ley? 


ldx 4 1+v? 
iii) R a eae al eee _f fave f aac8 Ine +C 
(iii) Rearranging sae Spee ar g=Inxe+C. 
lidg 1+v? 

Al ——dv= | - 
J edu 30 + v3 





dv £ -;In Bu + v"| +C. 


| 
Putting 2 and 2 together, Inz + C3 = sr In )3u + v"| or edn e+Cs) — |3v + v|, 


3 


= +e 3(ina+C3) =~ +e 33-3 = Cen, 


or, 3U+U 


or, C= (30 + v) outs (3v + v?) av? = 30x? + va? = 8a7y + y?. 


A758 (9233 N2006/1/7). Let V (t), h(t), and r(t) be the volume (cm?), height (cm), 
and radius (cm) of the cone formed by the liquid t — after the start of the experiment. 


Given the angle 45°, h =r. So, V(t) = or (t)]? A(t) = =(h (ty). 
1/3 90 1/3 
At the 3-minute mark, V (180) = 390 — 180 x 2 = 30 and A (180) = E V caso) = -(=) 


Next, compute V’ (t) = 1[h(t)]’h’ (t). We are also given V’ (t) = -2 for all t. So, 


viqso) 2 
m[h(180)]? 7 (90/r)7/3 ~—-71/8902/8 





h! (180) = 
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d 
A759 (9233 N2006/I/8). Apply —— to 322 + ry + y? + 33 to get 
45) 





dy dy dy 6n+y 
6 —+2 0 —=- ; 
eee a dg. = dx x+2y 
dy 2 
So, ano —> 62+ y=0 —> y = -62. 
a 


Plug 2 into = to get 322 +2 (-6x) + (-6r)* = 33 or Box = 33 or x=. 
So, the two points at which the tangent is parallel to the x-axis are (+1, +6). 
d 1 —sin@ 1 sind 


A 233 N2006 /I is _ _ 7 
760 (9233 N2006/ /9)(i) 5 vs0c 8 = aca a ee sec 0 tan 0 








d 
(ii) From « = sec 6 — 1, i 2 secOtand and x? +2r 2 (x + 1) —~1=sec? 6-1. 


1 1 1,3 1 } 
— tg [ a= f —__—____ da 
i (x+1)Vax? 42x V2-1 sec OV/sec? 6 - 1 V2-1 secOV tan? 6 


1 i yl l > rel dé b=n/3 x3 7 
= sec 6 |tan 0| . V2-1 sec O tand . r=V/2-1 a p=n/4 Jr 1 


(At 5 we can remove the absolute value operator because tan 0 > 0.) ® 


l+e-27% £#A  Bz4+C 
(2- z)(l+@2) 2-2 1+2? 
_A+2C+(2B-C)a+(A-B)2? 
7 (2-2) (1+27) 


Comparing coefficients, A+ 2C =1,2B-C=1, and A- B=-2. 





A761 (9233 N2006/1/12)(i) 


lag=2e 4 4 ott 
(2-a2)(1+a2)) Q-a@ 142? 


(ii soe -$(t-§) =-gf+cn(-4)- C Oe =) (- Sy | 





Solving, A=-1, B=1, and C=1. So, 





a a 
S----y-=7°-.... % 
Sa 8 
rt 9 9 3 2 
Te FD DSC Des +...J=(v+1)(1-2 Pecg) MERE ya55 * 





Plug 2 2 and 2 into = to get 





i — 2x? i oe | 1 1 
— -(- ~ 7 - va ..)+(leena?+...)=540- Fate... at 
(Q-z)(1+@) \2 4° 8 2° 4 


(iii) It is valid if |-7/2| <1 AND |x?| | or simply || < 1. 
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yr-yq _clr-clq_Afr-l/q_(a-r)/qr)_ 1 


A762 (9233 N2006/I/14) (i ' 
( aa) TR=io- <r = cg r-q r-q qr 


(ii) The described line has gradient gr and passes through P (cp,c/p). So, its equation is 
C1 
y--=qr(x-cp). 
Pp 


This line passes through V. So, plug (z,y) = (cv,c/v) into = to get 


a 11 p-v J 
—--—=qr(cu-cp) or —-—=aqr(v-p) or 
vp vp oe hd 





d d 
(iii) Observe that ry =c’. Apply — to get any +e=0. 
dy dy 


dx x C 
Rearrange to find that the requested gradient is -—| = a8 ag? 
dylizp ylizp c/p 


(iv) The normal at P is y—c/p 2 p? (x -cp). 
This line passes through 9. So, plug (z,y) = (cs,c/s) into = to get 


1 1 — 
<_ © <2 (es — cp) or —--=p*(s-p) or a 
Ss Pp Ss Pp Sp 





=p*(s-p) or oe 


(v) Since QP 1 PR, their gradients must be negative reciprocals of each other. 
Using (i), QP has gradient -1/qp and PR has gradient -1/pr. So, 


1 
——=pr or —_— 
qp qr 
Again from (i), -1/gr is the gradient of QR. From (iii), p” is the gradient of the normal 
at P. Hence, 3 says that QR is parallel to the normal at P. 


A763 (9233 N2006/II/2)(i) Use the Quotient Rule: 


dz Va? + 32 - x (4) (22 + 32)? (22) _ ree 32 


dir x + 32 (0? +329? (92 +32)? 


7 
Gi) f/-—+ ae9 S|] |4-4] -40-3)-4 ‘ 
2 (22 +32)9? 32| (22432)?|, 32LV8l V36], 3219 6/ 72° 


2 
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155.6. Ch. 138 Answers (Probability and Statistics) 


A776 (9758 N2017/II/5). XXX 
A777 (9758 N2017/II/6). XXX 
A778 (9758 N2017/II/7). XXX 
A779 (9758 N2017/II/8). XXX 
A780 (9758 N2017/II/9). XXX 
A781 (9758 N2017/II/10). XXX 
A782 (9740 N2016/II/5). XXX 
A783 (9740 N2016/II/6). XXX 
A784 (9740 N2016/II/7). XXX 
A785 (9740 N2016/II/8). XXX 
A786 (9740 N2016/II/9). XXX 
A787 (9740 N2016/II/10). XXX 


A788 (9740 N2015/II/5)(i) The manager may not have all the required information to 
properly implement stratified sampling. For example, he may not know what proportion 
of the sampling population each age group composes. 


(ii) Decide what the age groups are and how many he wishes to survey from each group. 
(That is, for each age group, set a quota of respondents to be surveyed.) Then simply go 
around surveying customers he sees in the supermarket, until he meets the quota for each 
age group. 


(iii) The manager may unconsciously gravitate towards customers that look more friendly. 
He may thus not get a representative sample of his customers (many of whom look un- 
friendly). 


A789 (9740 N2015/II/6)(i) Let X be the number of red sweets in the packet. 
PCY S421 = PCC ea) at Per 30) PC St) PC 9 SS) 


21-075 = ( . }oxstoa = ( 7 }ozsoas = ( ; Jozsoas 


= 1-0.75!0 ( 7 Joxstoa = ( > Jozsvas a ( * Jozstoas 


« 0.247501 


(ii) X ~ B(100,0.25). Since np = 25 > 5 and n(1->p) > 5, the normal approximation 
Y ~N (25, 18.75) is suitable. Hence, using also the continuity correction, 


29.5 -—2 
P(X 230) =1- P(X <30)#1-P(Y <29.)=1- 0 (23) 


18.75 
x 1 - ®(1.039) x 1 - 0.8506 = 0.1494. 


(iii) Let p = P(X > 30) » 0.1494 and q = 1- P(X > 30) * 0.8506. Then the desired 
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probability is 


15 15 15 14 15 918 15 312 
+ + a. ~% 0.8245. 
( 0 ) ( 1 } 9 pd 3 Pa 


A790 (9740 N2015/II/7)(i) The rate at which errors are made is independent of the 
number of errors that have already been made. 


The rate at which errors are made is constant throughout the newspaper. 
(ii) Let E ~ Po(6- 1.3) = Po(7.8). Then 





Lo r 7.8! (ec 


ai = a 2 te | 160 Os, 
ee ee er i to! 


) ~ 0.164770. 


(iii) Let F' ~ Po(1.3n). We are given that P(F < 2) < 0.05. That is, 





1.3n)®  (1.3n)} 
ertan(! on) re a) ) <0.05 or e13"(1+1.3n) < 0.05. 


Let f (n) = e18"(1+1.3n). From calculator, f (1), f (2), f (3) > 0.05 and f (4) < 0.05. 
Hence, the smallest possible integer value of n is 4. 


A791 (9740 N2015/II/8) (i) 


= 0.8825, 





ze 0.80 + 1.000 + 0.82 + 0.85 + 0.93 + 0.96+0.81+0.89 >) 2; 
n 


8 


.-#)? (0.80 - 0.8825)” + (1.000 - 0.8825)? +--+ (1.000 - 0.8825)? 
go = VO =F)" _ (0.80 ~ 0.8825)" + (1.000 = 0.8825)" +++ + (1.000 = 0.8825)" nar gage” 
n-1 fg 
(ii) The null hypothesis is Ho : 9 = 0.9 and the alternative hypothesis is H4 : [Wg < 0.9. 


_E-po _  0.8825-0.9 


= 0661800; 
s/\/n — 0.005592857/./9 





Since, |t| < t7,91 = —1.415, we are unable to reject the null hypothesis at the 10% significance 
level. 


A792 (9740 N2015/II/9)(i) By indep., P(B|A) = P(B) = 0.4. 


(ii)P(AU BUC) = P(A) + P(B) + P(C) - P(An B) - P(AnC)-P(BnC)+P(AnNBnC) 
= 0.45 +0.4+0.3-0.45-0.4-0.45-0.3- P(BnC)+0.1 = 0.935 - P(BnC) 


==> P(A'n B'nC"’) =1-P(AU BUC) =0.065+ P(BAC). 


The above is true even if B and C’ are not independent. 

And if B and C are independent, P(B nC) =0.4-0.3 = 0.12 and P(A’n B’nC") = 0.185. 
(iii) We know that P(An B’nC) = P(AnNC) - P(An Bn C) = 0.135 - 0.1 = 0.035. 

We want to find lower and upper bounds for P(BnC). Refer to diagram below. 
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At one extreme, it could be that P(A'’N BNC) =0, in which case P(BnNC) = 0.1. 

At the other extreme, it could be that P(A’n B’nC) =0, in which case P(BNC) = 0.265. 
Altogether, P(A’n B’nC’) = 0.065 + P(B nC) € [0.165, 0.33] 

In P(B nC) = P(An Bn) =0.1, PLBnC)< PLS) =04, PLB nec) <P(C) =0.2. 


PAN BNC’) P(A) = 0.45 


P(ANB' NC) 


P(A'NBNC') PA NB nc) 
P(A'NBNC) 





P(B) = 0.4 P(C) = 0.3 


A793 (9740 N2015/II/10)(i) 


30 


or 





() 


X 


() 5,000 10,000 15,000 20,000 25,000 30,000 35,000 40,000 45,000 


(ii) (a) PMCC x -0.9807. 
(ii) (b) PMCC » -0.9748. 
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(ii) (c) PMCC » -0.9986. 

(iii) We are apparently supposed to presume that the greater the PMCC, the “better” or 
the “more appropriate”. So we are supposed to use (c) from part (ii). 

The estimated regression equation is y—y = b(a-2), where b = > a » ai > #?. So in this 
case, the estimated regression equation is 


P - 14.083 = -0.147 (Va Z 140.986). 


(iv) Let x be the height given in metres. Then 32 = h. Thus, the above equation may be 
rewritten as 


P - 14.083 = -0.147 (V3x a 140.986) 


A794 (9740 N2015/II/11)(i) 8!/ (2!2!) = 10080. 


(ii) There is only one arrangement where the letters are in alphabetical order, namely 
AABBCEGS. Hence, the number of these arrangements in which the letters are not in 
alphabetical order is 10080 - 1 = 10079. 


(iii) Treating the two A’s as a single unit and the two B’s as a single unit, we have 6 units 
altogether, so there are 6! arrangements. 


(iv) Treating the two A’s as a single unit, we have 7 units altogether, so there are 7!/2! 
arrangements. 


Treating the two B’s as a single unit, we have 7 units altogether, so there are 7!/2! arrange- 
ments. 


Hence, the number of arrangements where there are at least two adjacent letters is 7!/2! + 
7!/2! - 6! = 7! - 6!, where the subtraction of 6! is to avoid double counting. 


Hence, he number of different arrangements with no two adjacent letters the same is 
8!/ (2!2!) — (7! - 6!) = 5760. 


A795 (9740 N2015/II/12)(i) Let Aj, A2,A3,A4,A5 be independent random variables 
with the identical distribution N (300, 207). Then F = Aj+A9+A3+A4+As ~ N (5-300, 5-207) 
and 


1600 - 1500 
P(F > 1600) =1- P(F s 1600) = 1- 0 


520 
=1- (V5) #1 - (2.236) » 1 - 0.9873 = 0.0127, 


(ii) Let P ~ N(200, 15). Then E = P, + Py +---+ Py ~ N(8-200,8-15°). Then F - E ~ 
N (5-300 - 8 - 200, 5- 20? + 8-15”) = N (-100, 3800) and 


P(F> E) = P(F-£>0)=1-P(F-£s0)=1-9 (2) 


/3810 
1 
=1- (=) x 1 — ®(1.622) x 1 - 0.9476 = 0.0524. 
/38 


(iii) 0.85F'+0.9E ~ N (0.85 -5-300 + 0.9 - 8 - 200, 0.852 - 5 - 20? + 0.98 - 15) = N (2715, 2903). 
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2750 — 2715 


P(0.85F + 0.9E < 2750) = ® 
2903 


x ©(0.650) © 0.7422. 


A796 (9740 N2014/II/5)(i) Arrange these 10000 customers by name, alphabetically. If 
two customers have the exact same same, then randomly pick one to precede the other. 


From this list of alphabetically sorted customers, pick every 20th customer to survey. 
(ii) Advantage: Each customer has equal probability of being surveyed. 


Disadvantage: There is the small risk that there is some periodic pattern that could bias 
the sample. For example, it could be that the customers are all in some country (or 
concentration camp), where each person has a 9-digit number for a name (e.g. 001533123) 
and only the most-privileged persons have 7 as the last digit of their name. If so, our 
proposed method would omit all the most-privileged persons. 


Such a pattern is obviously contrived and absurdly unlikely. In practice, it is unlikely that 
my proposed method of systematic sampling is any different from purely random sampling. 


A797 (9740 N2014/I1/6) (i) ( ' \( ; )( ; \( ; |. 31500. 


(ii) Ways to include only the midfielder brother = ( ' ( ; ( ; ( j } 


Ways to include only the attacker brother = ( : ( ; | ( ; | ( ; } 


In total, 16800 ways. 


(iii) The club now has 3 goalkeepers, 8 defenders, 3 midfielders, 5 attackers, and one player 
(call him Apu) who can either be a midfielder or a defender. 


Ways to form a team without am=(4 \( : \( : )( ?) =ais0 


Ways to form a team with Apu as a midfielder = ( : ( ; | ( : ( j =3 150, 


Ways to form a team with Apu as a defender = ( ; | ( ; ( : ( ; =2520). 


In total, 8820 ways. 
i] 
A798 (9740 N2014/II/7)(i) Let X be the number of sixes rolled. Then X ~ B (10. *). 


10 \/1\2/5\" 10!57 
And P(X =3)= =) t-)< = 0.155045. 
et ECS3) ( 3 \(2) (=) 31610 





i 
(ii) Let Y be the number of sixes rolled. Then Y ~ B (60, >). We have np > 5 and 


50 
n(1-p)>5. SoZ=N (10. =| is a suitable approximate distribution for Y. Using also the 


continuity correction, we have 
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8.5 -10 4.5 -10 
P(5<Y <8) x P(45<Z<8.5)=® ~ © ~ &(—-0.520) — &(-1.905) 
50/6 50/6 


= 6(1.905) — ©(0.520) » 0.9716 — 0.6985 = 0.2731. 


(Without using an approximation, P(5 < Y < 8) * 0.291854.) 
1 
(iii) Let A be the number of sixes rolled. Then A ~ B{ 60, =). We have n > 20 and np < 5. 
So B = Po(4) is a suitable approximate distribution for A. 
AG AY. Ae 


4(# 
P(5<A<8)xP(5< BK<8)=e (S4e a + =) # 0.349800. 





(Without using an approximation, P(5 < A < 8) » 0.353659.) 
A799 (9740 N2014/II/8)(a) Case (i) is in red and case (ii) is in blue. 





(b) (i) (A) PMCC « -0.9470452. 

(b) (i) (B) PMCC » -0.974921. 

(ii) It’s not at all clear which is the better model. But apparently we are supposed to say 
that since the second model is better because the magnitude of its PMCC is greater. 

In general, the estimated regression equation is y—¥y = b(a-Z), where b = » ae > Gi / > ie 
So in this case, the estimated regression equation is 


P — 72590 « -33659.728 (In m — 3.657) 
<=> P x -33659.728 lnm + 195693.560. 


(iii) P(50) » -33659.728 In 50 + 195693.560 ~ 64016. 


A800 (9740 N2014/II/9)(i) Let X be the number of minutes a bus is late after the new 
company has taken over. We’ll assume X ~ N ( ibs o*). 
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Our null hypothesis is Ho : js = fo = 4.3 and our alternative hypothesis is Hy: ps < pio = 4.3. 
(ii) The null hypothesis is not rejected if t > po—too.1-k/ Vn = 4.3-1.383- V3.2/V10 » 3.518. 
(iii) The null hypothesis is rejected if t < pg —to91-k/ Vn or 4.0 < 4.3 - 1.383 -k/V10 or 
k > 0.310/1.383 or k? > 0.3? - 10/1.383? » 0.471. 

A801 (9740 N2014/II/10)(i)(a) 0.1-0.2-0.1 = 0.002. 


(i)(b) The probability that no * is displayed is 0.9-0.8-0.9 = 0.648. And so the probability 
that t least one * symbol is displayed is 1 — 0.648 = 0.352. 


(i)(c) P(x x +) =0.3-0.1-0.2, P(x + x) = 0.3-0.3-0.4, P(+ x x) =0.4-0.1-0.4. 
Thus, the desired probability 0.006 + 0.036 + 0.016 = 0.058. 


(ii) The probability that there is exactly one « is P(« ##)+P(f * #)+P(f# *) =0.1-0.8- 
0.9+0.9-0.2-0.9+0.9-0.8-0.1 = 0.306. 


The probability that the symbols are *, +, © (in any order) is 


P(*+O)+P(*Ot+)+P(4+*O)+P(O*+)4+P (40%) +P(O+*) 
= 0.1(0.3-0.3 + 0.4-0.2) + 0.2(0.4-0.3 + 0.20.2) +0.1(0.4-0.4 + 0.2-0.3) 
= 0.017 + 0.032 + 0.022 = 0.071 


Hence, the desired probability is 0.071/0.306 = 71/306 ~ 0.232026. 
A802 (9740 N2014/II/11)(i)(a) Let O ~ Po(2) and P ~ Po(11). Then 


11° 11! 118 
P(P>8)=1-P(P<8)=1-e" (+ 4-4 =) 0.768015, 
(i)(b) O+ P ~ Po(13). So 


= -13 eee Se ec aaa caer nN 
P(O+P<15)=e (= oe ll oF +) 0.675132. 


(ii) Let Q ~ Po(2n). We are given that P(Q < 3) < 0.01. That is, 


(2n)” Qn)! | ny? 
0! 1! 2! 





P(Q <3) ao, ) =e" (1+ 2n+2n?) <0.01; 


Let f (n) =e?" (14 2n+2n?). From calculator, f (1), f (2), f (3), f (4) > 0.01 and f (5) < 
0.01. Hence, the smallest possible integer value of n is 5. 


(iii) Let R ~ Po(52-11) = Po(572). Given a large sample, we can use the normal distribution 
S ~ N (572,572)as an approximation. Hence, using also the continuity correction, 


590.5 — a 


P(R > 550) « P(S > 550.5) = 1- P(S < 550.5) = 1-6 
( pa ) ( ) ( Wap 


~ 1 — 6(-0.898960) = ©(0.898960) x 0.8158. 


(iv) Sales may be seasonal—e.g. it may be that art collectors make most of their purchases 
in the northern hemisphere’s summer months. 
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The sales of originals and prints may not be independent of each other. E.g., an art collector 
who buys an original Picasso might wish to also buy a few copies thereof. 


A803 (9740 N2013/II/5)(i) Use a computer program to randomly sort the 100000 
employees into an ordered list. Pick the first 90 employees on the list. 


The Chief Executive’s idea of a representative sample might be to have each country’s 
employees proportionally represented. For example, if 10% of employees are from India, 
then she may want 9 of the invited employees to be from India. 


(ii) Stratified sampling is more appropriate. If say 10% of employees are from India, 30% 
from China, 20% from Thailand, and 40% from Singapore, then we could instead pick 
from the list the first 9 Indian employees, the first 27 Chinese employees, the first 18 Thai 
employees, and the first 36 Singaporean employees. 


2a - 2a - 

A804 (9740 N2013/II/6). P(Y < 2a) =P (z < ——t) = 0.95 —> oP a 1.645 
O O 

2a- [1 





20= ph 
1.645 











P(Y <a) = P(Z< WTF) 20.25 = Fn -0.674 <> pa 0.6740 # 0.674 
O O 
7) : (1 | 20.674 
1.645) — 1.645 


A805 (9740 N2013/II/7)(i) The probability that one packet contains a free gift is 
independent of why another packet contains a free gift. 








—> u(t + Ja <> w*1.29a. That is, k x 1.29. 


There is no possibility that any one packet contains two or more free gifts. 


1 20 \/1\(19\” 
ii) Let fF ~ B{ 20,—]. Then P(F' = 1) = —<]{(ax] 8 0.877354. 
epee (20,55): Then POF =1) ( 1 } 5) (33) 


1 
(iii) Let F ~ B (60. =): Since n = 60 is large and np = 3 is small, a suitable approximation 
for F' is G ~ Po(3). 


a oe Be oS Be a 
+—+—+—+4+—+4+ 
O! ib 2! al at 





P(F 25) 8 P(G25)=1-6 ) = 0.184737. 


(By comparison, the actual probability is P(F' > 5) » 0.180335.) 

A806 (9740 N2013/II/8)(i) P(Bn A’) = P(BJA)P(A’ = 0.8 x 0.3 = 0.24. 

(ii) P(A’n B’) = 1-[P(A) + P(Bn A’)] = 1-0.7- 0.24 = 0.06. 

(iii) P(A‘|B’) = 1- P(A|B’) = 0.18. 

P(A'n B’) 0.06 _ 

P(A"B’) 0.12 
P(An B) =1-[P(A’) + P(An B’)] =1-0.3- P(An B’) 

= 0.7 — P(A|B’)P(B’) = 0.7 — 0.88 x 0.5 = 0.26. 


P(B’) = 0.5 


A807 (9740 N2013/II/9) (i) 
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ya — 14.04 12.5 + 11.0 + 11.04 12.5 + 12.6 + 15.6 + 13.2 








== = 12.8, 
n 8 
,-#)2 (14.0- 12.8)? + (12.5 —12.8)2 4+ --- + (13.2 — 12.8)? 
op RAAT ROT TS) CR RS UE SY’ 2.00574. 
n— 


(ii) The necessary assumption is that the population is normally distributed. 


The null hypothesis is Ho : Wp = 13.8 and the alternative hypothesis is H4 : pig < 13.8. 


Z—py  12.8-13.8 
t= = x - 1.862697. 
s[/n  ,/2.305714/8 


Since |t| < t7,9.95 = 1.895, we are unable to reject the null hypothesis at the 5% significance 
level. 

A808 (9740 N2013/II/10)(i) In blue is case (A), in red is case (B), and in green is 
case (C). 











@ 
o 
© 
e 
i aaa 
® $ . 
@ Y 
@® > 
(ii) 
bed ry 
150 & Distance, y ®©e@e @ 
@ 
@ 


Speed, x 





~ 


Or 


90 105 ROS 185 —si150 


() 15 30 45 60 


(iii) As a function of speed, the distance travelled decreases at an increasing rate. So (A) 
is the most appropriate. 


PMCC » -0.939203. 
(iv) In general, the estimated regression equation is y-y = b(a-Z), where b = De a | >. #2. 


So in this case, the estimated regression equation is 


y — 135 x —0.00461978 (x? — 11850.66667) 
<=> yx —0.004619782" + 189.747528. 
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Thus, y(110) * —0.00461978(110)? + 189.747528 « 134. 


A809 (9740 N2013/II/11)(i) The total number of ways to choose a code is 26°97 
(1423656). The number of ways to choose a code with three different letters and two differ- 
ent digits is 26-25-24-9-8 (1123200). Hence, the desired probability is 26-25-24-9-8/ (26° - 9”) = 


400 
—— # 0.78895. 
507 


(ii) The number of ways to choose the two digits so that the second digit is larger than the 
4 
first is 1+2+---+8=36. Hence, the desired probability is (1 +2+---+8)/9? = 9° 0.4. 


(iii) The number of ways to choose a code with exactly two letters the same, but not two 
digits the same is 


Arrange these three letters 


Repeated letter Third letter a Two digits 
— —— — 
26 x 25 x = x 9-8 = 26-25-3-9-8 = 140400. 


The number of ways to choose a code with exactly two digits the same, but not exactly 
two letters the same is 


All three All three 
Repeated digit letters different — letters same 
as a as 
9 x 26-25-24 + 26 =9-26-601 = 140634. 


26-25-3-9-8+9-26-601 25-3-8+601 1201 
263 - 92 262-9 6084 

(iv) There are 4 ways to choose the even digit, 5 to choose the odd digit then 2 ways to 

arrange these two digits. Hence, there are 4-5-2 = 40 ways to choose the two digits. 


x 0.197. 





Hence the desired probability is 


There are 5 ways to choose the vowel. There are 21? ways to choose the two consonants. 
We can now slot in the vowel amidst the consonants in 3 different ways. Hence, there are 
5+ 217-3 ways to choose the three letters. 


Altogether then, there are 5-21?-3-4-5-2 ways to choose a code with exactly one vowel 

and exactly one even digit. 

5+217-3-4-5-2 5-7?-5 1225 
26392 ~ 138-3 6591 

A810 (9740 N2013/II/12)(i) #1. The number of people sick on a particular day is 


independent of how many were sick the previous day. #2. The average number of illnesses 
in any span of 30 days is the same, throughout the course of the year. 


* 0.18586. 








Hence the desired probability is 


Condition #1 may not be met if the illness is contagious. If so, we’d expect the number of 
people sick on a particular day to depend (positively) on how many were sick the previous 
day. 


Condition #2 may not be met if the illnesses are seasonal. For example, due to influenza, 
illnesses may be more common during the winter than during the summer. 
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Let A ~ Po(1.2) and M ~ Po(2.7). 


(ii) Let B ~ Po(1.2n). Then P(B = 0) < 0.01 — > e 1?" < 0.01 — > n> (In0.01) /(-1.2) » 
3.8. Hence, the smallest number of days is 4. 


(iii) Let C be the total number of days of absence across both departments, over a 5-day 
period. Then C’ ~ Po(19.5) and 


at 19,5) 
P(C > 20) =1-P(C'< 20) = 1- °° ¥) —— w 0.396583. 
xo & 


(iv) Let D be the total number of days of absence across both departments, over a 60-day 
period. Then D ~ Po(234). Since Ap = 234 is large, the normal distribution is a suitable 
approximation. Let E ~ N (234, 234). Then, 


250.5 — 234 199.5 — 234 
P(200 < D < 250) » P(199.5 < E < 250.5) = ® (a) - ed 


V 234 V 234 


~ ©(1.0786) — ®(—2.2553) x 0.8597 — 0.0120 = 0.8477. 


A811 (9740 N2012/II/5)(i)(a) Let +, -, D, and N denote the events “positive result”, 
“negative result”, “has disease”, and “no disease”. Then, 


P(+) = P(+|D)P(D) + P(+|N)P(N) = p-0.001 + (1 - p) -0.999 = 0.999 — 0.998p = 0.00599. 
(i)(b) P(D|+) = P(D n+) = P(+) = P(D)P(+|D) = P(+) = 0.001p + 0.00599 » 0.166110. 
(ii) asP(D|+) = 0.75. But 


0.001p 
0.999 — 0.998p 





P(D|+) = 


So 3(0.999 — 0.998p) = 4(0.001p) or 2.997 = 2.998p or p = 0.999666. 
A812 (9740 N2012/II/6)(i) Ho : pup = 14.0, Hy: po # 14.0. 


(ii) Hp: Z~N (14.0, 3.87). Since Zp.925 = 1.96, the values of % for which the null hypothesis 
would not be rejected are 


o o 3.8 3.8 
Z€| U- Zo025—, LH + 20.025 — | = | 14.0 - 1.96 ——, 14.0 + 1.96 —— ] & (12.335, 15.665). 
(0 a Lt om) ( J20 =) ( ) 
(iii) The null hypothesis is rejected. 

A813 (9740 N2012/II/7)(i) There are 15! ways to arrange the 15 individuals. 


There are 2 ways to arrange the 2 sisters as a single unit. Counting the 2 sisters as a single 
unit, we have 14 units total, and there are 14! ways to arrange these 14 units. So, there 
are in total 2-14! ways to arrange the 15 individuals so that the two sisters are together. 


Hence, the probability that the sisters are next to each other is 2-14!/15! = 2/15 = 0.13. 


(ii) There are 3! ways to arrange the 3 brothers as a single unit. Counting the 3 brothers 
as a single unit, we have 13 units total, and there are 13! ways to arrange these 13 units. 
So, there are in total 3!- 13! ways to arrange the 15 individuals so that the three brothers 
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are together. We do not want the three brothers to be together. 

Hence, the desired probability is 1-3!-13!/15! = 1-6/ (14-15) = 1-1/35 = 34/35 » 0.97142857. 
(iii) There are 2 ways to arrange the 2 sisters as a single unit and 3! ways to arrange the 3 
brothers as a single unit. Counting the 2 sisters as a single unit and also the 3 brothers as 
a single unit, we have 12 units in total, and there are 12! ways to arranges these 12 units. 
So, there are in total 2-3!-12! ways to arrange the 15 individuals so that the 2 sisters are 
together and the 2 brothers are together. 

Hence, the desired probability is 2-3!-12!/15! = 12/(13-14-15) = 2/(13-7-5) = 2/455 » 
0.0043956. 


(iv) Let A and B denote the events that “the sisters are next to each other” and “the 
brothers are next to each other”. Our desired probably is P(Au B). 


Zz. 2 91-2 13 Z 
. = 


a a ae ae ae ae 





_ 91-2 33. 43 4B 
3-455 3-455 3-91 243 


A814 (9740 N2012/II/8) (i) 


~ 0.17695. 





100 
80 


Percentage mark, y 





(ii) The trend is one of steady improvement. After a terrible performance in Week 1, Amy 
resolves to work hard. Her work pays off, with her mark improving week after week. 


The only deviation from trend occurs on Week 5, because Amy happened to be experi- 
menting with drugs that week. 


(iii) A linear model would suggest that she eventually breaks the 100% barrier, which is 
quite impossible. 

A quadratic model would suggest that her mark eventually starts falling and moreover at 
an increasing rate, which is quite improbable, unless of course she gets hooked on drugs. 
(iv) PMCC » -0.929744. 


(v) We are supposed to say that the most appropriate choice is wherever the magnitude of 
the PMCC is the largest. Hence, L = 92 is the most appropriate. 


(vi) In general, the estimated regression equation is y-y = b(a-Z), where b = > a; > 4; j > o. 


So in this case, the estimated regression equation is 
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In (92 — y) — 3.125912 » -0.279599(x — 3.5) 
<=> In(92-y) » -0.279599x + 4.104510. 


y>90 — > -0.287+4.10<In2 == x >12.2. So she'll get at least 90% in Week 13. 


(vii) As x > oo, y > L. An interpretation is thus that L is the best mark she can ever 
hope to get, no matter how long she spends studying. 


A815 (9740 N2012/II/9)(i) The choice must be binary—a voter must be said to either 
support the Alliance Party or not support it. 


The probability that any one polled voter supports the Party is independent of whether 
another polled voter supports the party. 


(ii) P(A =3)+P(A=4) = ( 7 ae _p)" 4 ( a ae — p)?6 = 0.373068. 


(iii)(a) np = 16.5 > 5 and n(1->p) = 13.5 > 5 are both large and so yes, the normal 
distribution N(16.5, 16.5-0.45)would be a suitable approximation for A. 


(iii)(b) p is large. And so, while it is certainly possible to use the Poisson distribution as 
an approximation, it would fare poorly. 


(iv) P(A = 15) = ( . xe — p)'? ~ 0.06864. 


1/15 
Thus, p(1-p)=p-p?* oosay| i ) ~ (0.237900. 


Rearranging, p* — p + 0.237900 = 0. By the quadratic formula, p » 0.39,0.61. Given that 
p<0.5, we have px 0.39. 


A816 (9740 N2012/II/10)(i) The number of gold coins in a randomly chosen square 
metre is independent of how many gold coins there are in the square metre to its left. 


No two coins are stacked exactly on top of each other. 
(ii) Let G ~ Po(0.8). Then, 


0.89 0.8! 0.8? 
+ ~ 
0! if 2! 





P(G>3)=1-e°%8 ( ) ~ 0.0474226. 


(iii) Let H ~ Po(0.8x). Then P(H = 1) = e°**(0.82) = 0.2. 

By calculator, x * 0.323964, 3.1783. So x » 0.323964. 

(iv) Let J ~ Po(80). Since A is large, the normal distribution J ~ N(80,80) is a suitable 

approximation. Using also the continuity correction: 

89.5 — 80 
V80 

(v) Let P~ Po(3). Let Z be the number of gold coins and pottery shards found in 50 m?. 


Then Z ~ Po(190). Since A is large, the normal distribution Q ~ N(190, 190) is a suitable 
approximation for Z. Using also the continuity correction, 


P(I > 90) » P(J > 89.5) =1- a x 1- ®(1.062) x 1 - 0.8559 = 0.1441. 
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199.5 — 190 
V 190 


(vi) Let X and Y be, respectively, the numbers of gold coins and pottery shards found in 
50m?. Then X ~ Po(40) and Y ~ Po(150). Our goal is to find P(Y > 3X) = P(Y-3X > 0). 
Since Ax = 40 and Ay = 150 are both large, the normal distributions A ~ N(40,40) and 
B ~ N(150,150) are suitable approximations for X and Y, respectively. And in turn, 
B-3A ~ N(150 - 3-40, 150 + 37-40) = N(30,510) is a good approximation for Y - 3X. 
Hence, using also the continuity correction, 


P(Z > 200) » P(Q > 199.5) =1- a x 1 — ®(0.6892) » 1 — 0.7546 = 0.2454. 


~0.5 - 30 30.5 
P(Y -3X >0) x P(B-3A>-0.5) = 1-0(=—*) -0( 23 


x ©(1.3506) x 0.9116. 
510 =) ( ) 


40.0 - 40.0 - 
A817 (9740 N2011/II/5)(i) P(X < 40.0) = P(z Z ———*) ~ 0.05 gl ae ee 
O O 
1.645 <> 81.6450 + 40.0 


70.0 - 70.0 - 
P(X < 70.0) =P(z< can -0.975 << casket 2 1.96 <> u® —1.960 + 70.0. 
Oo Oo 


Comparing # and &, we have 1.6450 + 40.0 * -1.960 +70.0 <=> 3.6050 ~ 30.0 <> o #83 
and js ® 53.7. 


A818 (9740 N2011/II/6)(i) Decide what the age groups will be. Decide how many 
from each age group are to be interviewed (these are our quotas). Then pick, at random, 
residents on the street to be interviewed, until the quota for every age group is fulfilled. 


(ii) Residents who are on the street may not be a representative sample of the population. 


(iii) Random sampling. Acquire a complete list of the city suburb’s population. Use a 
computer program to randomly pick a sample. Interview this sample. 


No it is not realistic. First, one may be able to acquire a complete list of the city suburb’s 
population. Second, one may not be able to contact every member of one’s sample. 


A819 (9740 N2011/II/7)(i) #1. I do indeed make an actual attempt to contact n 
different friends. 


#2. The probability that one friend is contactable is independent of whether another friend 
is contactable. 


(ii) Assumption #1 may not hold because if say n = 100, I may run out of time before I 
attempt to contact all 100 different friends. 


Assumption #2 may not hold because my friends probably know each other and so they 
might be watching a movie together and their handphones are switched off. This would 
mean that the probability that one friend is contactable is dependent on whether another 
friend is contactable. 


5) 5 
(iii) P(R>6) =1-)>P(R=i)=1-> ( 2 Jor03 ~# 0.551774. 
i=0 i=0 t 


(iv) Since np = 28 > 5 and n(1-p) =12>5 are both large, a suitable approximation to R 
is the normal distribution S ~ N (28,8.4). Using also the continuity correction, we have 
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24.5 — 28 


JV84 
= 1- (1.2076) » 1 - 0.8863 = 0.1137. 


P(R < 25) » P(S < 24.5) - 0 } = (-1.2076) 


A820 (9740 N2011/II/8) (i) 





(ii) The PMCC is » -0.992317 which is very large in magnitude. But this merely means 
that the correlation between x and y is very strong. It does not also imply that their true 
relationship is definitely linear. Indeed in this case, it appears that the relationship is not 
linear. 


(iii) We are supposed to say that the larger the magnitude of the PMCC, the better the 
model. In this case, the PMCC of y and 2? is —0.999984. And so we’re supposed to conclude 
that y = a + bx? is the better model. 


(iv) In general, the estimated regression equation is y-y = b(a-Z), where b = )> 2; ) Gj / > a2 
So in this case, the estimated regression equation is 


y — 10.885714 » -0.856210 (x? — 13.25) 
<> y x -0.856210x7 + 22.230492. 


y(3.2) = -0.856210 - (3.2)? + 22.230492 ~ 13.5. 
A821 (9740 N2011/II/9)(i)(a) 0.6 - 0.05 + 0.4 - 0.07 = 0.03 + 0.028 = 0.058. 
(i)(b) 0.03/0.058 = 15/29 » 0.517241. 


(ii)(a) P(Exactly one faulty) = P(First faulty, second not) + P(Second faulty, first not) = 
0.058 (1 — 0.058) + (1 — 0.058) 0.058 = 2- 0.058 - 0.942 = 0.109272. 


E 
(ii) (b)P(Both made by A|Exactly one faulty) = ae 


But P(En F) = P(E)P(FIE) = 0.62(0.05-0.95 + 0.95- 0.05) = 0.0342. Hence P(E|F) = 
0.0342/0.109272 ~ 0.312980. 


A822 (9740 N2011/II/10)(i) We are given that T ~ N(5.0, 38.0). 


Let X be the time taken to install the component after background music is introduced. 
Assume that X remains normally distributed with standard deviation 5.0 (these are ques- 
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tionable assumptions, but without these we cannot proceed). That is, X ~ N (110, 50"). 
The null hypothesis is Ho : Wp = 38.0 and the alternative hypothesis is H4 : pg < 38.0. 


(ii) Zo.05 * 1.645. So to reject the null hypothesis, we must have t < jug — Zo.o50/V/n = 
38.0 — 1.645 - 5.0/V50 * 36.8. 

(iii) Since the null is not rejected with ¢ = 37.1, we must have t = 37.1 > fg — Zo.950/V/n = 
38.0 — 1.645 -5.0/\/n. Rearranging, n < (1.645 - 5.0/0.9)? » 83.5. Thus, n ¢ {1,2,...,83}. 


A823 (9740 N2011/II/11)(i) There are in total C(30, 10) ways to choose the committee. 
There are C(18,4) x C(12,6) ways to choose a committee with exactly 4 women. Hence, 
the desired probability is 


18 \/ 12 30 \_ [(18-17-16-15) /4!][(12-11-10-9-8-7) /6!] 
4 6 | 10 (30-29-----21) /10! 





17-48 


= ———_—_ x 7..9410679. 
29-13-23 


(ii) The number of ways to choose a committee with exactly r women is 
18 12 
if 10-r ] 
And the number of ways to choose a committee with exactly r + 1 women is 
18 12 
r+l1 Q-r] 
We are told that the first number is greater than the second, i.e. 
18 12 . 18 12 
‘i 10-r r+1 9-r 


18! 12! " 18! 12! 
(18-r)!lr!(2Q+r)!0-r)! (A7-r)r+1)!(+r)(9-7r)! 





<> (17-r)'(r+1)'(84r)(9-r)! > 18-r)!Iril(24+r)!10-r)! (as desired). 


Continuing with the algebra, we have (r+1)(3+r) > (18-r)(10-r) =< 1r?+4r+3> 
r?—28r+180 <= 32r>177 — > r>5+17/32. 

We have just proven that P(R =r) > P(R =r+1) if and only if r = 6,7,8,9. That is, 
we have just shown that P(R = 6) > P(R = 7) > P(R = 8) > P(R = 9) > P(R = 10), but 
PGi=0)< Pel) < Piha?) <P(h=3)< PUie4) < PUi=5) <PUA =6). 

We have thus shown that 6 is a most-probable-number-of-women and that 7, 8, 9, 10 are 


not. We must rule out that 5 (or any smaller number) is a most-probable-number-of-women. 
But clearly, 6!4! # 5!5!, so that 
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CCT) YG} 


Hence, it is indeed the case that P(R = 5) < P(R = 6). Thus, 6 is indeed the unique 
most-probable-number-of-women. 


A824 (9740 N2011/II/12)(i) Let X be the number of people who join the queue in a 
period of 4 minutes. Then X ~ Po(4.8) and 


7 a 
P(X >8)=1-P(X <7)=1-e*8y* = s 0.113334. 
imo 


(ii) Let Y be the number of people who join the queue in a period of t minutes. Then 
Y ~ Po(1.2t/60) = Po(0.02t). We are told that P(Y <1) =0.7. That is, 


PUY aie (1002) = 0:7, 


By calculator, t * 54.8675. 

(iii) Let Z be the number of people who leave the queue over 15 minutes. Then Z ~ Po(27). 
Let B be the number of people who join the queue over 15 minutes. Then B ~ Po(18). 
We wish to find P(35+ B- Z>24)=P(Z-B<11). 


Since Az = 27 is large, a suitable approximation for Z is the normal distribution is A ~ 
N(27, 27). Since Ag = 18 is large, a suitable approximation for B is the normal distribution 
is C ~ N(18,18). In turn, a suitable approximation for 7 - B is A- C ~ N(9,45). Hence, 
using also the continuity correction, 


11.5-9 
V 45 


(iv) There might be certain periods of time when more planes arrive and other periods when 
fewer arrive. So the rate at which people join the queue will probably not be constant. 


A825 (9740 N2010/II/5)(i) Say we wish to stratify the spectators by age group. One 
problem is that we may not know what proportion of the spectators belongs to each age 
group. As such, it would may be difficult to get a representative sample. 





P(Z-B<11)xP(A-C<115) = a ] x &(0.3727) x 0.6453. 


(ii) Order the spectators by their names, alphabetically. Choose every 100th spectator on 
the list to survey. 


A826 (9740 N2010/II/6) (i) 


pa ai 
n 11 


goles (Xt) /11 _ 18779.43 - 454.32/11 


= 1.684. 
n-1 10 


(ii) The null hypothesis is Ho : 9 = 42.0 and the alternative hypothesis is Hy : pug # 42.0. 
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E-py  41.3-42.0 


7 = ~ —1.789. 
s[//n_ \/1.684/11 








Since |T| < t10,0.95 = 1.812, we are unable to reject the null hypothesis. 
A827 (9740 N2010/II/7)(i) P(An B’) = P(A|B’)P(B’) = 0.8- 0.4 = 0.32. 
(ii) P(AU B) = P(B) + P(An B’) = 0.92. 

(iii) P(B’|A) = P(B’n A) = P(A) = 0.32 + 0.7 = 16/35 » 0.457142857. 

(iv) P(A’nC) = P(A’)P(C) = 0.3-0.5 = 0.15. 

(vy) P(A’ Bod) < 0.15. 


A828 (9740 N2010/II/8)(i) The probability that the number is greater than 30000 is 
the probability that the first digit is 3, 4, or 5. Answer: 3/5 = 0.6. 


(ii) The first three digits are odd and there are 3! ways to arrange them. The last two are 
even and there are 2! ways to arrange them. The total number of ways to arrange the five 
digits is 5!. Answer: 3!2!/5! = 1/10 = 0.1. 


(iii) If the first digit is 3, the last digit must be lor 5, and in each case, there are 3! ways 
to arrange the middle 3 digits. 


Similarly, if the first digit is 5, the last digit must be 1 or 3, and in each case, there are 3! 
ways to arrange the middle 3 digits. 


If the first digit is 4, the last digit can be 1, 3, or 5, and in each case, there are 3! ways to 
arrange the middle 3 digits. 


Altogether then, there are 7-3! ways to get such a number and the desired probability is 
(23)/0)=7/20= 0,35, 


A829 (9740 N2010/II/9)(i) Our desired probability is P(Y > 2X) = P(Y - 2X > 0). 
Now, Y - 2X ~ N (400 - 2-180, 60? + 27307) = N (40,7200). So 


0 — 40 
V 7200 
(ii) Our desired probability is P(0.12X +0.05Y > 45). Now, 





P(Y -2X>0)=1 - 4 J ®(0.4714) x 0.6813. 


0.12X +0.05Y ~ N (0.12- 180 + 0.05 - 400, 0.12? - 30? + 0.05? - 60?) = N (41.6, 21.96) 
45 — 41.6 
21.96 


(iii) Our desired probability is P (0.12X, + 0.12X2 > 45). Now, 


— > P(0.12X +0.05Y > 45) =1-® ( x 1 - ®(0.7255) » 1 - 0.7658 = 0.2342. 


0.12.X1 + 0.12X ~ N (0.12- 180 + 0.12 - 180, 0.12? - 30? + 0.12? - 30?) = N(43.2, 25.92) 
45 — 43.2 


P (0.12 +0.12Xy > 45)=1-® 
( ; ae) ( 25.92 


# 1 — ©(0.3536) » 1 - 0.6381 = 0.3619. 


A830 (9740 N2010/II/10) (i) 
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(ii)(a) PMCC » 0.986024. 
(ii)(b) PMCC » 0.990681. 


(iii) We are, as usual, supposed to say that the larger the magnitude of the PMCC, the 
better the model. So F = c+ dv” is the better model. 


(iv) In general, the estimated regression equation is y—y = b(a-2), where b = )° 2; ai >, ie 
So in this case, the estimated regression equation is 


F — 14.25 ~ 0.0242420 (x? — 456) 
<=> F x 0.0242420r? + 3.195652. 


And F = 26.0 <> 2 \/ (26.0 - 3.195652) /0.0242420 x 30.7. 


To predict a value of v given a value of F’, it would be more appropriate to use a regression 
where v (or a function of v) is the independent variable and F (or a function of F’) is the 
dependent variable. 


A831 (9740 N2010/II/11)(i) Let X be the number of calls received in a randomly 
chosen period of 4 minutes. Then X ~ Po(12) and 


128 
P(X =8) = ae ~ 0.0655233. 


(ii) Let Y be the number of calls received in a randomly chosen period of t seconds. Then 
Y ~ Po(3t/60) = Po(0.05t) and P(Y = 0) =e" = 0.2. So t = (In0.2) /(—-0.05) » 32. 


128 
P(Y = 0) =e? = 0.0655233. 


(iii) Let Z be the number of calls received in a randomly chosen period of 12 hours. 
Then Z ~ Po(2160) and a suitable approximation therefor is the normal distribution A ~ 
N (2160, 2160). Hence, using also the continuity correction, 


2200.5 — 2160 
V 2160 





P(Z > 2200) » P(A > 2200.5) =1-© ( ] ~ 1— (0.8714) » 1 - 0.8082 = 0.1918. 


(iv) ( : }o.soiso sos x 0.2354. 
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(v) Let B be the number of busy days out of 30. Since np x 5.754 >5 and n(1-p)>5,a 

suitable approximation to B is the normal distribution C ~ N (5.754, 4.650). So using also 

the continuity correction, 

10.5 — 5.754 
4.650 


(Without using any approximation, P(B < 10) » 0.980906.) 


A832 (9740 N2009/II/5). Simply survey people standing outside the theatre waiting 
for the movie to start. Stop once the quota of 100 persons is met. 


P(B< 10) P(C< 10.5) - 0 } = o@.201) x 0.9861. 


A disadvantage is that this may not be a representative sample. For example, there will be 
no late-comers in our sample of 100. 


A833 (9740 N2009/II/6) (i) 


X 
> 





(ii) No. A linear model would imply that several centuries hence, the time taken to run a 
mile would be negative, which is clearly impossible. 


The scatter diagram similarly suggests that the rate of improvement is tapering off, rather 
than linear. 


(iii) A quadratic model would imply that the world record time taken to run a mile eventu- 
ally bottoms out, then starts increasing. But by definition, it is impossible that the world 
record time increases. 


(iv) In general, the estimated regression equation is y—y = b(a-Z), where b = 9° &; ) Gj / > ae 


So in this case, the estimated regression equation is 


Int — 3.161647 x -0.0161280(zx - 1965) 
<=> Int» -0.0161280x + 34.853071. 


t(2010) @-0-0161280(2010) +34.853071 
2010 is 3m 41.4 s. 


Our range of data is 1930-2000. We are extrapolating our data, which might not always 
work out reliably. 


A834 (9740 N2009/II/7)(i) Let E and F be the events that “a randomly chosen com- 
ponent that is faulty” and “a randomly chosen component was supplied by A”. Then, 


# 11.4. So the predicted world record time on Ist January 
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P(E) = 0.01p-0.05 + 0.01(1 — p)0.03 = 0.03 + 0.02 - 0.01p = 0.035 


P(FoE) 0.01p-0.05 —— 0.05p 7.9 


ii =P(FIE) = = = dD 5 
OY) TD) = BIE) F(E)  0.03+0.02-0.01lp 3+0.02p 3 + 0.02p 

f' (p) = 7.5(3 + 0.02p)-?(0.02) > 0. This shows that the probability that a randomly chosen 
component that is faulty was supplied by A is increasing in the percentage of electronic 
components bought from A. Which is not very surprising. 


A835 (9740 N2009/II/8)(i) We have 8 letters total, 3 of which are repeated. Hence, 
there are 8!/3! = 6720 possible permutations. 


(ii) Let TD or DT be a single letter. Then we have 7 “letters” total, 3 of which are 
repeated, so there are 2! x 7!/3! possible permutations that we do not want. So there are 
6720 — 2! x 7!/3! = 5040 possible permutations that we do want. 


(iii) The 4 consonants by themselves have 4! possible permutations. The 4 vowels by 
themselves have 4! +3! = 4 possible permutations. The first letter can either be a consonant 
or a vowel. Hence, there are in total 2 x 4! x 4 = 192 possible permutations. 


(iv) There are only four broad possibilities: E E E_,E E E,E _ 
E _E,and_ E E E. Each of which have 5! possible permutations. Hence, 


there are in total 4 x 5! = 480 possible permutations. 








- ule - 2.53 — 2. 
A836 (9740 N2009/II/9)(i) M ~N (2.5 —). So P(M > 2.53) =1-@ (22) 
n 


0.12/n 
1-©(0.3Vn) = 0.0668 <> ©(0.3Vn) = 0.9332 <> 0.3/n=15 <— n=25. 
(ii) Assuming the thicknesses of the textbooks are independently distributed, 


X = M,4+---+Mo14+5S14+...So4~N (21 -2.54+24-2.0,21-0.17 +24- 0.087) = N (100.5, 0.3636) . 


100 — 100.5 
V 0.3636 


(iii) Again assuming the thicknesses of the textbooks are independently distributed, our 
desired probability is P (S1 + So +53 + $4 < 3M) =P (S$, +S) +53+5,-3M <0). Now, $1+ 
So + S3+S4-3M ~N(4-2.0-3-2.5,4-0.08? + 3?-0.17) = N (0.5,0.1156). Hence, 


Now, P(X < 100) = 0 ) ~ 1- ® (0.8292) « 1 - 0.7964 = 0.2036. 


0-0.5 
P(S, + 55+ $,+5,-3M <0) = ®| ———— |] x 1- 6(1.4706) » 1 — 0.9293 = 0.0707. 
(5) + S9+ 53+ S4 ) ( =*2 ( ) 


(iv) The thicknesses of the textbooks are independently distributed. 
A837 (9740 N2009/II/10) (i) 





Pome ee > x)’ [n _ 835.92. - 86.42/9 | 
7 n-1 7 8 . 


0.81. 
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(ii) A necessary assumption is that X is normally distributed. The null hypothesis is 
Ho: po = 10 and the alternative hypothesis is Hy : lo + 10. 
_ X- Lo _ 9.6-10 _ 4 


"Siva Joss 3 








Since |t| < tg.o.925 = 2.306, we are unable to reject the null hypothesis. 


The sample size is small. And so we are unable to appeal to the CLT and claim that a 
normal distribution is a suitable approximate distribution for Z. 


(Author’s remark: It actually makes no sense to say that “the CLT does not apply in this 
contert”. The CLT certainly applies. It is merely that the normal distribution is a poor 
approximation for the sample mean.) 


(iii) We’d use the Z-test instead. 


A838 (9740 N2009/II/11)(i) The probability that any observed car is red is independent 
of whether any other observed car is red. 


Each car is either strictly red or strictly not red. 


(ii) P(4< R<8) 
2 ( 1 Jousioss a . Joustoss ( > Joust ( . Jousross" 


® 0.346354. 


(iii) Since np and n(1-p) are large, a suitable approximation to R is the normal distribution 
X ~N(72,50.4). Hence, using also the continuity correction, 


59.5 — 72 
V 50.4 


(iv) Since n is large and p is small, a suitable approximation to R is the normal distribution 
Y ~ Po(4.8). Hence, 


P(R<60) x P(X <59.5) = a ~ 1- (1.761) x 1 - 0.9609 = 0.0391. 


(v)P(R=0)+P(R=1) -( 7 ie =o . Jan” 
=(1-p)!9(1-p+20p) = 0.2. 


By calculator, p » 0.142432. 


A839 (9740 N2008/II/5)(i) Take any ordered list of the 950 pupils. From the list, pick 
every 19th student. 


(ii) We might want each level to be equally well-represented. For example, we might like 
approximately one-sixth of the sample to be from Primary 1, another sixth from Primary 
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2, etc. 


In which case we’d probably prefer to do a stratified sample. The method might be some- 
thing like this: Pick from the aforementioned ordered list the first 108 Primary 1 students, 
the first 108 Primary 2 students, etc. 


A840 (9740 N2008/II/6). Let the mass of calcium in a bottle (after the extreme 
weather) be X ~N ( LLo, oO}: (We have made the necessary assumption that X is normally 
distributed. ) 


The null hypothesis is Ho : wp = 78 and the alternative hypothesis is Ho : jug # 78. Now, 
E — Lo ~x/n-78 
t —= — 
SVM \/[S02-(S2)' /n]/(n-Diva 


Since |t| < t14,0.025 * 2.145, we are unable to reject the null hypothesis. 

A841 (9740 N2008/II/7)(i) Let A; denote the event that A wins the first set. Similarly 
define Ao, As, By, Bo, and Bs. P (Az) =? (A; MN Ag) +P (By M Ag) = 0.6-0.7+0.4-0.2 =0.5. 
(ii) P (A wins) = P (A, n Ag) + P (A, MN Bo M A3) + P (By Nn A» n A3) = 0.42+0.6-0.3-0.2+ 
0.4-0.2-0.7 = 0.42 + 0.036 + 0.056 = 0.512. 

(iii) P (B, n Ayn As) /P (A wins) = 0.056/0.512 = 0.109375. 

A842 (9740 N2008/II/8)(i) PMCC * 0.9695281468. This large PMCC merely suggests 


that there is a strong (positive) linear relationship between x and t. However, the true 
relationship between x and t could be something other than linear. 


(ii) 





= 1.207, 





(iii) Without P, it appears that t is increasing, but at a decreasing rate. So a log model 
might be appropriate. 


(iv) In general, the estimated regression equation is y — y = b(a - Z), where 
b= Yai af D2. 
So in this case, the estimated regression equation is 


t — 6.45 4.396563 (In x — 1.143002) 
<=> tx 4.396563 In x + 1.424722. 
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So for the model t = a+ bln z, the least square estimates are a x 1.4 and b » 4.4. 
(v) t(2 = 4.8) » 4.41n(4.8) + 1.4» 8.3. 

(vi) This would be an extrapolation of the data, which may or may not be wise. 
A843 (9740 N2008/II/9) (i) X ~ Po(1.8). 


18° La 1.8? 
+ + 
0! 1! 2! 





P(X > 4) =1-P(X <3) = 1-e18/ ) = 0.108708. 


(ii) Let Y be the total number of pianos sold in a given week. Then Y ~ Po(4.4). P(Y = 
4) =e *44.44/4! » 0.191736. 


(iii) Let Z be the number of grand pianos sold in 50 weeks. Then Z ~ Po(90). Since Az 
is large, a suitable approximation is the normal distribution A ~ N (90,90). Hence, using 
also the continuity correction, 


19.0 =90 
V90 


(iv) An organisation might buy a relatively large number of grand pianos on any given day. 
So it is not likely that the rate at which grand pianos are sold is constant throughout the 
year. 


A844 (9740 N2008/II/10)(i) ( : \( ; \( ; |. 3-4-10 = 120. 


| 9) _ 

ay (2)-0 

ain (1)(7)<(2) (4 J-5:29 215-200 
A A 4) 3 


(iv) The number of ways to have 


P(Z < 80) x P(A< 79.5) = a ] ~ 1 — (1.1068) » 1 - 0.8657 = 0.1343. 


¢ No diplomats from K (i.e. only diplomats from L and M) is ( : } 


¢ No diplomats from L is ( : } 


¢ No diplomats from M is 0. 


_ {| 12 
The total number of ways to choose the diplomats is /.: Hence the number of ways to 


have at least 1 diplomat from each island is ( ‘ - : + ( : ) = 495 -(9+1) = 485. 
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A845 (9740 N2008/II/11)(i) X; +X2 ~N(100,2-87). So, 


120 — 100 


P(X + Xa> 120) =1- 8 Pee 


] w# 1 — 0(1.768) » 1 - 0.9615 = 0.0385. 


(ii) X; — X_ ~ N(0,2-87). So, 














15-0 

POG S Xy415)= POG = 29845) S1-<6 (==) w 1 — &(1.3258) » 1 — 0.9075 = 0.0925. 

i 4- 
(iii) py < 74) = 0(=—*) - 0.0668 ga ee 

O O 

2 146 - 146 - 
P(Y > 146) =1-6( H) = 0.0668 = o/ e H) = 0.9332 5 ee 

O O O 


146-4 74- 72 
1 5-(-1.5) 233 <> o=24 and p= 110. 
Oo Oo O 


Since o = 8a and pw = 50a+ 6, a=3 and b= —40. 
A846 (9233 N2008/I/1). 3 ways to arrange the 3 groups of books. And within each 
group of books, we can permute them as usual. So there are 3!6!5!4! = 12441600 ways. 


A847 (9233 N2008/II/23). By independence, pang = paps. Also paup = PA+PB-PAnB = 
pat+pp-papp. Plugging in the given numbers, we have 0.4 = 0.2+ pg -0.2pp, so pp = 0.25. 





pBpc = 0.25-0.4 = 0.1 = pgac, so that by definition, B and C are indeed independent. 


A848 (9233 N2008/II/26)(i) Let X ~ Po(3). P(X > 2) = 1-P(X < 0) =1- 
e3(1+34+9/2) =1-8.5e% ~ 1 — 0.423 = 0.577. 


(ii) Let Y be the number of times the machine will break down in a period of four weeks. 
Then Y ~ Po(12). 
P(Y <3) =e" (1412+ 12?/2 + 123/6) » 0.00229. 


(iii) Let Z be the number of times the machine will break down in a period of 16 weeks. 
Then Z ~ Po(48). Since Xz is large, a suitable approximation for Z is the normal distribu- 
tion A ~ N (48,48). Hence, using also the continuity correction, 


50.5 — 48 
V8 


A849 (9233 N2008/II/27)(i) Let the mass after the adjustment be X ~ N(uo,07). It 
is necessary to assume that these masses remain normally distributed. ‘The null hypothesis 
is Ho: Uo = 32.40 and the alternative hypothesis is Hy : uo + 32.40. Now, 


_E-po _ 32.00- 32.40 _ 


Siva \/2.892/80 


Since |t| > t79,9.025 * 1.99, we can reject the null hypothesis. 


P(Z > 50) « P(A>50.5)=1-@ ( x 1 — 6(0.3608) » 1 - 0.6409 = 0.3591. 


—2.104. 





(ii) This means that if Howere true and we tested infinitely many size-80 samples (as done 
above), we’d reject Hy in 5% of the samples. 
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(iii) The one-tailed p-value is ~ 0.0193. So the least level of significance is 1.93%. 


A850 (9233 N2008/II/29)(i) Let X ~ N(50,4”). The probability that Mr Sim is late 
on any given day is 





P(X > 55) =1-6(= 2 


= 1- (1.25) x 1- 0.8944 = 0.1056. 


Assuming that the probability that he’s late each day is independent of whether he was 
late on any other day, the probability that he will be late no more than once in 5 days is 


( : }o.s056%0 sour a ( ; Jose sou ~ 0.910. 


(ii) Let Y~N (40, a) Our desired probability is PLY - Y —5 <0). Assuming the journey 
times of Messrs Sim and Lee are independent, X —- Y -5~N (5, A? + 5): Thus, 

0-5 
V4? + 5? 


(iii) Assume that the journey times of Messrs Sim and Lee each day are independent. Then 
the desired probability is 


P(X -Y-5<0)=6 ( x 1 - ©(0.7809) » 1 - 0.7826 = 0.2174. 


( : Joarre 07s 4. ( j }oarrao ea - ( ° Joarre0 7 ~ 0.0722. 


A851 (9233 N2008/II/30)(i) Let M ~N(p,07). P(M < 86.50) = ® (———) = 0.12 
Oo 


gas, COUT ize 
O 


2.29 — 92.25 — 92.25 — 
P(M > 92.25) =1-© (———*) -0.2 <> © (=——") 208. a. <2 6 ey: 
O O O 
a — 9.19 
=minus = yields —— = 2.017 <— > o » 2.85. And now p » 89.85. 
O 


2 
(ii) Let X ~N(y,07). P(w-2<5X <pt2)=0.8 => P(X <p+2)=09 <— o(=)- 
Oo 
2 
0.9 —> —81.281 — o# 1.56. 
Oo 


0.50 
of\/n 
<> 0.50/n>2 —= n>I16. 


. 2 _ 
(iy te x N(u,}. Then P(X > +0.50)<0.1 <> 1 -0/ 
1) 


foe a (eve) , 0:50Vn 0.50/n , 2 
O O O O 


sou — 


> 1.281 << 


A852 (9740 N2007/II/5)(i) Consider a survey of whether students like a particular 
teacher. A quota of 10 students is to be chosen. Take a list of the teacher’s students, sort 
their names alphabetically, and pick the first 10 students on the list. 


One disadvantage is that this sample of 10 students might not be representative. For 
example, they might all be siblings from the same family of Angs. 
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(ii) Yes. If say the teacher teaches 10 different classes, we could stratify our sample by 
class and pick 1 student from each class. 


A853 (9740 N2007/II/6). 
( : }o20o70" ( : Joavoz — | Jo2so. ~ 0.933. 


(i) Let X ~ B(1000, 0.24) be the number of people in a sample of 1000 that have gene A. 
Since np = 240 >5 and n(1-p) = 760 >5 are both large, a suitable approximation for X is 
the normal distribution Y ~ N (240, 182.4). Hence, using also the continuity correction, 


260.5 — 240 229.5 — 240 
P(230 < X < 260) « P(229.5 < Y < 260.5) =© (| ~& (a) 


J182.4 J182.4 
x ©(1.5179) — ©(-0.7775) x 0.9355 — 0.2180 « 0.7175. 


(ii) Let Z ~ B(1000, 0.003) be the number of people in a sample of 1000 that have gene B. 
Since n is large and p is small, a suitable approximation for Y is the Poisson distribution 
A~ Po(3). Hence, 
P(2<24<5)sP(2<Y <5) =P(Y =2)+P(¥ =3)+P(Y =4) 
= 7? (37/2 + 33/6 + 34/24) » 0.616. 
A854 (9740 N2007/II/7) (i) 


462 2 ‘ 
pare epee end 


x 33.7259. 

n 150 n—-l 
(ii) Let Ho : 19 = 30 and Hy: po > 30 be the null and alternative hypotheses. Now, 
_ Bp _  30.84-30 


= yy a 1.772. 
s//n — \/33.7259/150 


Since Z > Zo.o5 = 1.645, we can reject the null hypothesis. 


Z 





(iii) We used the Z-test. The sample size is large, so the normal distribution is a good 
approximation provided the underlying distribution is “nice enough”. 


A855 (9740 N2007/II/8)(i) Let C be the weight of a randomly chosen chicken. Then 
C ~N(2.2,0.57). Then 3C ~ N (3-2.2,3?-0.5?) = N(6.6, 1.5°) and 
(=65 


4 
P =1-6{——-} =1-4(—] «0.3949. 
et ( 1.5 (<5) als 


(ii) Let T be the weight of a randomly chosen turkey. Then T ~ N (10.5, 2.17). Then 
57 ~ N (5- 10.5, 5? - 2.1?) = N (52.5, 10.5”) and 


50 — 52.5 


P(ST > 55) = 1 - & | ——_— 
eae ( 10.5 


5 
=1-®{ — } x 0.405904. 
}-1-#(57) 


Thus, P(3C’ > 7)-P(5T > 55) * 0.160. 
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(iii) 3C + 5T ~ N (6.6 + 52.5, 1.5? + 10.5) = N (59.1, 112.5). So, 


~59.1 
P(3C +5T > 62) = 1-0( = 1-0(=)] x 0.392. 
J112.5 21 


(iv) The event “both chicken costs more than $7 and turkey costs more than $55” is a 
proper subset of the event “chicken and turkey together cost $62”. By the monotonicity of 
probability, the probability of the latter is greater than the latter. 


A856 (9740 N2007/II/9)(i)(a) 12! (b) 6!- 2° 

(ii)(a) 11! 

(ii)(b) Fix any man. Then we must have to his right: Woman, man, woman, man, etc. So 
615! 

(ii)(c) Fix any man A. Then we must have 

¢ To his right: “Wife A, some other man, that some other man’s wife, etc.”; OR 


e To his left: “Wife A, some other man, that some other man’s wife, etc.”. 


In the first scenario, we have 5! possible arrangements. Likewise in the second. Altogether 
2-5! possible arrangements. 


A857 (9740 N2007/II/10). 


Figure to be 


inserted here. 





111.1 
842 64 
(ii) P(1,1) + P(,0,1) + P(0, 1,1) = 





(i) PCL Ay = 
Pil P37 1 1_8+2-3+7_ 21 
84°84 4°88 4 256 256 
(iii) Let E and F be the events that “the third throw is successful” and “exactly two of 


the three throws are successful”. 


13 1 ae 11. 13 
Pen) = P10, 1)+P(0, 1,1) = s— ++ ase 
(ENF) =P(1,0,1)+PO1 1) =3-5-545-5-5= 5 
Ty 
P(f)= P(En F) +P(B'n F) = = + P(1,1,0) = 


Thus, P(E|F) = P(En F’) = P(F) = 13/17. 
A858 (9740 N2007/II/11)(i) In general, the estimated regression equation is y - y = 
b(x - Z), where b= » Li » ai > ce So in this case, the estimated regression equation is 


x — 131.667 » —0.260 (t - 32) or x» —0.260t + 66.194. 


(ii) x(t = 300) » -0.259701 - (300) + 66.194030 « -11.7. 
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From the scatter diagram, the linear model does not appear to be suitable. Moreover, the 
linear model predicts that at t = 300, x < 0, which is impossible. 


(iii) PMCC » —0.993839. Its magnitude is larger than —0.912 and very close to -1. It 
would appear that the regression of Inx on ¢t is a more appropriate model. 

(iv) In general, the estimated regression equation is y—y = b(a-2), where b = )° 2; ai > a. 
So in this case, the estimated regression equation is 


In x — 2.995391 » -0.0123434 (t — 131.666667) 
<> Inz x -0.0123434t + 4.620609 


(v) c=15 = >t» (4.620609 — In 15) /0.0123434 » 155. 


A859 (9233 N2007/1/4)(i) It cannot be that all three vertices are collinear. Thus, one 
vertex must be chosen from the upper line segment and the other must be chosen from the 
lower line segment. Hence, there are 3 x 6 = 18 possible triangles. 


(ii) Consider triangles that do not have A as a vertex. Two vertices must be chosen from one 
line segment and the third must be chosen from the other. So there are ; }roa : = 
21+ 60 = 81 possible triangles. Now, including also triangles with A as a vertex, we have 


99 possible triangles. 


A860 (9233 N2007/II/23)(i) X ~ (30,5?) == X00 ~ (30,57/100) = (30,0.25). Since 
the sample size is sufficiently large, by the Central Limit Theorem, a suitable approximation 
for Xj99 is the normal distribution Y ~ N (30,0.25). So 


P (29.2 < X199 < 30.8) » P (29.2 < Y < 30.8) ~ 0.945201 - 0.054799 x 0.890. 


(ii) The distribution is “sufficiently nice” that with a sample size of 100, it is appropriate 
to use the CLT. 


A861 (9233 N2007/II/25)(i) P(W|B) = 20/52 = 5/13 ~ 0.384615. 

(ii) P(BIW) = 20/40 = 0.5. 

(iii) P(BUW) = (40 + 32)/90 = 72/100 = 0.72. 

(iv) P(W)P(B) =0.4-0.52 + P()BUW) and so W and B are not independent. 


There are men who take chemistry (equivalently, P(MM nC) # 0), so M and C are not 
mutually exclusive. 


A862 (9233 N2007/II/26)(i) Let X be the number of genuine call-outs in a randomly 
chosen two-week period. Then X ~ Po (4) and 


2 3 4 45 
7 a : + 4 + ) % 0.785130. 
2! 3! 6A! 





P(X <6)=e4(144+ 


(ii) Let Y be the total number of call-outs in a randomly chosen six-week period. Then 
Y ~ Po(15) and since Ay is large, a suitable approximation for Y is the normal distribution 
Z ~N(15,15). Hence, using also the continuity correction, 
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19.5-15 


V15 


A863 (N2007/II/27-9233)(i) L +H ~N(5+3,0.1? + 0.05?) = N (8, 0.0125). So 


P(Y > 19) = P(Z> 19) #1-9/ J ots. 


(79s L+H s82)=8 (S780) 9( 780 


V0.0125 V 0.0125 


(ii) 0.74L + 0.86H ~ N (0.74-5 + 0.86 - 3, 0.74? - 0.1? + 0.86? - 0.05) = N (6.28, 0.00728225). 


# 0.963 — 0.185 x 0.778. 


6.2 - 6.28 6.1- 6.28 
So, P(6.1<0.74L +0.86H < 6.2) - 4 \- ( 


V0.00728225 V0.00728225 


# 0.183 — 0.021 x 0.162. 


A867 (9233 N2006/II/26) (i) Let X be the number of severe floods in a randomly chosen 
100-year period. Then X ~ Po(2). So 


[P(X =1)] = (e?- 2)" = 4e~4 » 0.0733. 


(ii) Let Y be the number of severe floods in a randomly chosen 1000-year period. Then 
Y ~ Po(20). Since Ay is large, a suitable approximation for Y is the normal distribution 
Z ~N (20,20). Hence, using also the continuity correction, 

25.5 — 20 


P(Y >25) « P(Z> 2555) =1- 0 Tia 


x 0.109. 
A864 (9233 N2006/I/4). We could have 


e All three identical—1 possibility. 


e ‘Two identical—5 possibilities. 


e One entea-( ; = 10 possibilities. 


e None ensca_( ; = 10 possibilities. 


So total 26 possibilities. 

A865 (9233 N2006/II/23)(i) P(A) = 1/3. 

The sum of two scores is 9 if the dice are (3,6), (4,5), (5,4), or (6,3). So P(B) = 4/36 = 1/9. 
P(An B) = 2/36 = 1/18 + P(A)P(B), so A and B are not independent. 

(ii) P(Au B) = P(A) + P(B) - P(AnB) = 1/3+1/9-1/18= 7/18. 


A866 (9233 N2006/II/25)(i) The null hypothesis is Ho : = fp = 10000 and the alter- 
native hypothesis is H,4 : pu < 10000 
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7 Ea Ho ¥ (x — 10000)/n + 10000 = pup 
slV” \[£S(« 10000)? - [Xx - 10000) /n} / (n- 1)/ Vn 


_ —2510/80 + 10000 — 10000 ~ -1.795. 


\/ {2010203 — (-2510)” /80} /79/./80 


Since |Z| > Zo.95 = 1.645, we can reject the null hypothesis. 





(ii) If Ho is true and we conduct the above test on infinitely many size-80 samples, we’d 
(falsely) reject Ho for 5% of the samples. 


A868 (9233 N2006/II/28)(i) Let the speed of any car (in kmh!) be X ~ N(p,07). We 
are given that P(X > 125) = 1/80 and P(X < 40) = 1/10. 














1 pan). i =) 79 
BeSOc 2: 126 = b _79 
(X > 125) = 5 Ge ) 80 ( . 80 
—H 42.240 
O 
1 AO — pu ih AD - p 2 
Peete -=& @2=4)22 22 Rey 
aT, ( S 10 


85 
% minus & yields — * 3.522 <— > o x 24.1 and px 70.9. 
o 


(ii) ( a‘ }o.roa | . }o.os ( > }o.09 ( 5 Jo.os ~ 0.987. 


(iii) Let Y be the number of cars out of a random sample of 100 that are travelling at speed 
less than 40 km h'!. Then Y ~ B(100,0.1). Since np = 10 > 5 and n(1-p) = 90>5 are 
both large, a suitable approximation to Y is the normal distribution Z ~ N (10,9). Hence, 
using also the continuity correction: 


8.5 - 10 
V9 





P(Y <8)eP(Z<85)=® ( = 1- 6(0.5) x 1- 0.6915 = 0.3085. 


2166, Contents www.EconsPhDTutor.com 


Abbreviations Used in This Textbook 
We give the number of the page on which each abbreviation is first used. 
2D two-dimensional 


3D three-dimensional 


9233 — subject code of 2002-08 H2-Maths-equivalent syllabus 
9740 subject code of 2007-17 H2 Maths syllabus 
9758 — subject code of current H2 Maths syllabus (2017—?) 


A Level  Singapore-Cambridge Advanced Level 
‘A’ Maths Additional Mathematics (an O-Level subject) 


ASEAN _ Association of Southeast Asian Nations 70 
BBC British Broadcasting Corporation xi 
CLT ~— Central Limit Theorem as 
COI — constant of integration 1032 

CPFE — Correct Procedure for Finding Extrema 2? 
DOS _ disk operating system XXXVii 
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ESGS — Every School a Good School xli 


FDTE First Derivative Test for Extrema 935, 1688 
FDTI _ First Derivative Test for Inflexion Points caigmet ay 
FLC four-letter campaign pi 
FSSPFE — the Flawed Secondary School Procedure for Finding 2? 
Extrema 


FTCs Fundamental Theorems of Calculus 


GCT = Goh Chok Tong 


GDP _ gross domestic product xl 


GUI _ graphical user interface 
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Hi, He. H3 
HCI 


IET 


IMHO 


IMO 


i, 


JC 


JSYK 


LHL 


List MF26 


LKY 


MOE 
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Higher 1, Higher 2, Higher 3 (see A Level) 
Hwa Chong Institution 


Interior Extremum Theorem 


in my humble opinion 
International Mathematical Olympiad 
the first and second years of junior college 


junior college 


just so you know 


Lee Hsien Loong 


List of Formulae and Statistical Tables 


Lee Kuan Yew 


Ministry of Education (Singapore) 


uf 


XXXVI 


XXXV 


XXXV 


XXXVI 
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O Level 
PAP 
PDF 
PDF 

PISA 
PM 


PSC 
PSLE 


Q&A 


SDN 


SDTE 


SDTI 


SDU 


SE 
SEAB 


TI84 


TLA 
TLLM 


TLT 


TPL 


TSLN 


Lys 
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Singapore-Cambridge Ordinary Level 


People’s Action Party 


portable document format 
probability density function 


Programme for International Student Assessment 


Prime Minister 
Public Service Commission (Singapore) 
Primary School Leaving Examination 


question and answer 
Social Development Network 
Second Derivative Test for Extrema 


Second Derivative Test for Inflexion Points 


Social Development Unit 


Stack Exchange 
Singapore Examinations and Assessment Board 


TI-84 PLUS Silver Edition calculator 


three-letter abbreviation®”” 


Teach Less, Learn More 
Tangent Line Test 


Tin Pei Ling 


Thinking Schools, Learning Nation 


Ten Year Series 


XXXVi 


944 


gs 


XXXVI 


XXXVI 


XX1xX 


xli 


965 


2? 


ait 


xl 
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Singlish Used in This Textbook 


We give the number of the page on which each Singlish expression is first used. 


Note that there is often no single standardised spelling of a Singlish word or phrase (e.g. 


angmoh and sibei might be spelt angmo and seepeh). 


angmoh — white person (Hokkien, literally “red-haired”) 


Familee — the family of Lee Kuan Yew 


Gahmen — government 


kiasu _ literally, “afraid to lose” (Hokkien) 


mug to study hard, especially in rote fashion 


promos the J1 end-of-year promotional examinations 


sibei very (literally, “dead father”) 
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366 


70 


xli 


XXXV 


xliii 


xliv 


1034 
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Notation Used in the Main Text 


We list only notation not already already listed on pp. 14-18 of your H2 Maths syllabus. 
This is a fairly short list because we’ve generally tried to stick closely to the notation used 
in your syllabus and exams. 


We give the number of the page on which each piece of notation is first used and/or defined. 


Domain f 
Codomain f 
Range f 


lim f (x) 


ra” 


lim f (x) 


ra” 


A\IV 
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because 
therefore 
ditto 


the set of non-negative integers, {0,1,2,3,...} 

the set of non-positive integers, {...,-3,-2,-1,0} 

the set of non-positive rational numbers, {7 ¢ Q: x <0} 
the set of non-positive real numbers, {x € R: x < 0} 


proper subset of (used by other writers, not used in this 
textbook) 


set minus (or set difference) 
the domain of the function f 
the codomain of the function f 


the range of the function f 


the left-hand limit of f as x tends to a 
the right-hand limit of f as x tends to a 


any inequality, i.e. any of >, <, or <. 


63 


2?, 1654 
2?, 1654 
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Notation Used (Appendices) 


We list only notation not already listed on the previous page or on pp. 14-18 of your H2 
Maths syllabus. 


We give the number of the page on which each piece of notation is first used and/or defined. 


AxB — the cartesian product of the sets A and B 1556 
N.(a) — the e-neighbourhood of a—in R, the set (a-¢,a+¢) 1652 
N- (a) _ the left e-neighbourhood of a—in R, the set (a -«,a) 1652 
Ni (a) — the right ¢-neighbourhood of a—in R, the set (a,a+¢) 1652 
M-(a) the deleted (or punctured) e-neighbourhood of a—in R, 1652 


the set (a-€,a) U(a,a+é) 
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YouTube Ad 


For a while I was making educational YouTube videos (channel name: Econ 
Cow)! Mostly on economics. Do me a favour by checking them out and 
letting me know how | can improve! Be sure to hit subscribe, like, and leave 
me a comment! 


YouTube.com/EconCow 








yy. The Fallacy 
las of the__ 
i - | Hot-Hand, vEW 








(Voting #1) 
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Tuition Ad 


I give tuition for 


- Economics 
- Mathematics 
- Writing, English, General Paper 


I have a PhD in economics and have been teach- 
ing and tutoring since 2010, at every level from 
secondary school through PhD. 


For more information, please visit 


EconsPhD'Tutor.com 


Or email 


DrChooYanMin@gmail.com 
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